home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Breton-KEB: POS Tags: X

There are 46 X lemmas (2%), 49 X types (2%) and 152 X tokens (2%). Out of 17 observed tags, the rank of X is: 8 in number of lemmas, 10 in number of types and 13 in number of tokens.

The 10 most frequent X lemmas: em, ar, miz, da, ur, ma, meur, Ofis, hag, pezh

The 10 most frequent X types: em, ar, ur, viz, ma, da, meur, miz, Ofis, hag

The 10 most frequent ambiguous lemmas: miz (X 15, NOUN 6), da (ADP 246, X 10, DET 3), ma (DET 27, SCONJ 17, X 8), Ofis (X 5, PROPN 1), pezh (X 5, NOUN 1), an (DET 863, X 2), mont (VERB 36, X 3), pep (DET 5, X 3), a (AUX 303, ADP 57, DET 6, PRON 5, X 2), kalz (ADP 6, ADV 2, X 2)

The 10 most frequent ambiguous types: ar (DET 382, X 13), ur (DET 62, X 9), ma (DET 14, SCONJ 8, X 8), da (ADP 116, X 6, DET 3), miz (X 6, NOUN 2), Ofis (X 5, PROPN 1), hag (CCONJ 53, X 5), an (DET 256, X 4), d’ (ADP 61, X 4), mont (VERB 4, X 3)

Morphology

The form / lemma ratio of X is 1.065217 (the average of all parts of speech is 1.395336).

The 1st highest number of forms (2) was observed with the lemma “da”: d’, da.

The 2nd highest number of forms (2) was observed with the lemma “d’an”: d’an, d’an.

The 3rd highest number of forms (2) was observed with the lemma “miz”: miz, viz.

X does not occur with any features.

Relations

X nodes are attached to their parents using 4 different relations: dep (92; 61% instances), fixed (58; 38% instances), det (1; 1% instances), xcomp (1; 1% instances)

Parents of X nodes belong to 11 different parts of speech: AUX (34; 22% instances), ADV (23; 15% instances), NOUN (21; 14% instances), PROPN (18; 12% instances), PRON (17; 11% instances), ADP (15; 10% instances), SCONJ (10; 7% instances), DET (6; 4% instances), NUM (6; 4% instances), CCONJ (1; 1% instances), VERB (1; 1% instances)

151 (99%) X nodes are leaves.

1 (1%) X nodes have one child.

The highest child degree of a X node is 1.

Children of X nodes are attached using 1 different relations: fixed (1; 100% instances)

Children of X nodes belong to 1 different parts of speech: ADP (1; 100% instances)