Treebank Statistics: UD_Occitan-TTB: POS Tags: X
There are 16 X lemmas (0%), 18 X types (0%) and 19 X tokens (0%).
Out of 16 observed tags, the rank of X is: 14 in number of lemmas, 14 in number of types and 16 in number of tokens.
The 10 most frequent X lemmas: z, God, USA, birds, club, du, editor, hoc, la, little
The 10 most frequent X types: z’, -s, -z-, Club, Editor, God, La, Peuple, Three, USA
The 10 most frequent ambiguous lemmas: club (NOUN 1, X 1), editor (NOUN 1, X 1), la (ADV 4, X 1)
The 10 most frequent ambiguous types: La (DET 90, PRON 9, X 1)
- La
Morphology
The form / lemma ratio of X is 1.125000 (the average of all parts of speech is 1.368971).
The 1st highest number of forms (3) was observed with the lemma “z”: -z-, z’, z-.
The 2nd highest number of forms (1) was observed with the lemma “God”: God.
The 3rd highest number of forms (1) was observed with the lemma “USA”: USA.
X occurs with 1 features: ExtPos (3; 16% instances)
X occurs with 1 feature-value pairs: ExtPos=PRON
X occurs with 2 feature combinations.
The most frequent feature combination is _ (16 tokens).
Examples: -s, Club, Editor, God, La, Peuple, Three, USA, Voix, birds
Relations
X nodes are attached to their parents using 7 different relations: flat (9; 47% instances), obj (3; 16% instances), fixed (2; 11% instances), parataxis (2; 11% instances), appos (1; 5% instances), nmod (1; 5% instances), root (1; 5% instances)
Parents of X nodes belong to 5 different parts of speech: X (9; 47% instances), VERB (4; 21% instances), NOUN (3; 16% instances), ADP (2; 11% instances), (1; 5% instances)
8 (42%) X nodes are leaves.
5 (26%) X nodes have one child.
3 (16%) X nodes have two children.
3 (16%) X nodes have three or more children.
The highest child degree of a X node is 3.
Children of X nodes are attached using 5 different relations: flat (9; 45% instances), punct (6; 30% instances), fixed (3; 15% instances), case (1; 5% instances), parataxis (1; 5% instances)
Children of X nodes belong to 5 different parts of speech: X (9; 45% instances), PUNCT (6; 30% instances), PRON (3; 15% instances), ADP (1; 5% instances), PROPN (1; 5% instances)