home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Breton-KEB: POS Tags: NOUN

There are 814 NOUN lemmas (44%), 1033 NOUN types (40%) and 1990 NOUN tokens (20%). Out of 16 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: den, levr, bro, ti, labour, yezh, brezhoneg, bugel, rannvro, breur

The 10 most frequent NOUN types: levr, dud, den, ti, brezhoneg, labour, vugale, yezh, vro, istor

The 10 most frequent ambiguous lemmas: den (NOUN 73, X 1), yezh (NOUN 22, ADJ 1), tra (NOUN 18, PRON 3, ADV 2, X 1), kinnig (VERB 35, NOUN 12), miz (X 15, NOUN 6), stourm (NOUN 5, VERB 4), kelenn (NOUN 4, VERB 1), en-dro (ADV 3, NOUN 3, X 1), gwech (NOUN 3, X 1), koulz (NOUN 3, X 3)

The 10 most frequent ambiguous types: dud (NOUN 30, X 1), labour (NOUN 18, VERB 4), yezh (NOUN 17, ADJ 1), stourm (NOUN 5, VERB 4), dro (NOUN 3, VERB 1), en-dro (ADV 3, NOUN 3, X 1), degemer (NOUN 2, VERB 1), deiz (NOUN 2, ADV 1), dibab (NOUN 2, VERB 2), gont (NOUN 2, VERB 1)

Morphology

The form / lemma ratio of NOUN is 1.269042 (the average of all parts of speech is 1.406011).

The 1st highest number of forms (6) was observed with the lemma “den”: den, dud, nen, tud, zen, zud.

The 2nd highest number of forms (5) was observed with the lemma “kevredigezh”: c’hevredigezhioù, c’hevredigezhioù, gevredigezh, kevredigezh, kevredigezhioù.

The 3rd highest number of forms (5) was observed with the lemma “kinnig”: c’hinnig, c’hinnigoù, c’hinnig, kinnig, kinnigoù.

NOUN occurs with 2 features: Gender (1990; 100% instances), Number (1960; 98% instances)

NOUN occurs with 4 feature-value pairs: Gender=Fem, Gender=Masc, Number=Plur, Number=Sing

NOUN occurs with 5 feature combinations. The most frequent feature combination is Gender=Masc|Number=Sing (978 tokens). Examples: levr, den, ti, brezhoneg, labour, istor, anv, bloaz, perzh, bed

Relations

NOUN nodes are attached to their parents using 25 different relations: obl (421; 21% instances), nsubj (373; 19% instances), obj (345; 17% instances), nmod:gen (305; 15% instances), nmod (153; 8% instances), conj (124; 6% instances), root (121; 6% instances), obl:agent (47; 2% instances), appos (27; 1% instances), dep (27; 1% instances), compound (17; 1% instances), parataxis (8; 0% instances), xcomp (4; 0% instances), acl (2; 0% instances), dislocated (2; 0% instances), flat:name (2; 0% instances), nmod:poss (2; 0% instances), nsubj:appos (2; 0% instances), vocative (2; 0% instances), advcl (1; 0% instances), list (1; 0% instances), nsubj:cop (1; 0% instances), nummod (1; 0% instances), obl:x (1; 0% instances), orphan (1; 0% instances)

Parents of NOUN nodes belong to 9 different parts of speech: VERB (1085; 55% instances), NOUN (627; 32% instances), (121; 6% instances), ADJ (78; 4% instances), NUM (27; 1% instances), PROPN (20; 1% instances), PRON (18; 1% instances), ADV (13; 1% instances), CCONJ (1; 0% instances)

181 (9%) NOUN nodes are leaves.

691 (35%) NOUN nodes have one child.

640 (32%) NOUN nodes have two children.

478 (24%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 11.

Children of NOUN nodes are attached using 32 different relations: det (1163; 31% instances), case (664; 18% instances), nmod:gen (368; 10% instances), amod (280; 7% instances), punct (233; 6% instances), nmod (171; 5% instances), conj (123; 3% instances), advmod (117; 3% instances), acl (103; 3% instances), cc (95; 3% instances), nummod (93; 2% instances), cop (80; 2% instances), nsubj (56; 1% instances), aux (40; 1% instances), dep (33; 1% instances), appos (30; 1% instances), obl (29; 1% instances), compound (19; 1% instances), advcl (14; 0% instances), mark (7; 0% instances), flat:name (6; 0% instances), parataxis (6; 0% instances), csubj (4; 0% instances), nmod:poss (4; 0% instances), aux:pass (2; 0% instances), list (2; 0% instances), orphan (2; 0% instances), acl:relcl (1; 0% instances), advmod:neg (1; 0% instances), discourse (1; 0% instances), obj (1; 0% instances), obl:agent (1; 0% instances)

Children of NOUN nodes belong to 15 different parts of speech: DET (1162; 31% instances), ADP (669; 18% instances), NOUN (627; 17% instances), ADJ (276; 7% instances), PUNCT (232; 6% instances), VERB (217; 6% instances), NUM (158; 4% instances), ADV (124; 3% instances), PROPN (113; 3% instances), CCONJ (94; 3% instances), PART (31; 1% instances), PRON (20; 1% instances), X (20; 1% instances), SCONJ (4; 0% instances), SYM (2; 0% instances)