home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Breton-KEB: POS Tags: X

There are 49 X lemmas (3%), 51 X types (2%) and 153 X tokens (2%). Out of 16 observed tags, the rank of X is: 8 in number of lemmas, 8 in number of types and 13 in number of tokens.

The 10 most frequent X lemmas: en, ar, miz, meur, Ofis, evit, pezh, an, da, d’an

The 10 most frequent X types: en, ar, viz, meur, miz, Ofis, evit, pezh, an, da

The 10 most frequent ambiguous lemmas: en (X 36, ADP 1), miz (X 15, NOUN 6), Ofis (X 5, PROPN 1), evit (ADP 73, X 5), pezh (X 5, NOUN 1), an (DET 863, X 2), da (ADP 253, DET 3, X 3), kalz (ADP 5, X 3, ADV 2), koulz (NOUN 3, X 3), mont (VERB 36, X 3)

The 10 most frequent ambiguous types: en (ADP 43, X 26), ar (DET 382, X 13), miz (X 6, NOUN 2), Ofis (X 5, PROPN 1), evit (ADP 60, X 5), an (DET 256, X 4), da (ADP 119, DET 3, X 3), kalz (ADP 4, ADV 1, X 1), mont (VERB 4, X 3), pep (DET 4, X 3)

Morphology

The form / lemma ratio of X is 1.040816 (the average of all parts of speech is 1.406011).

The 1st highest number of forms (2) was observed with the lemma “d’an”: d’an, d’an.

The 2nd highest number of forms (2) was observed with the lemma “miz”: miz, viz.

The 3rd highest number of forms (1) was observed with the lemma “150”: 150.

X does not occur with any features.

Relations

X nodes are attached to their parents using 2 different relations: dep (152; 99% instances), xcomp (1; 1% instances)

Parents of X nodes belong to 11 different parts of speech: PART (34; 22% instances), ADV (23; 15% instances), NOUN (20; 13% instances), PROPN (18; 12% instances), PRON (17; 11% instances), ADP (16; 10% instances), SCONJ (10; 7% instances), NUM (7; 5% instances), DET (6; 4% instances), CCONJ (1; 1% instances), VERB (1; 1% instances)

153 (100%) X nodes are leaves.

The highest child degree of a X node is 0.