home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Zaar-Autogramm: POS Tags: X

There are 82 X lemmas (5%), 86 X types (3%) and 233 X tokens (1%). Out of 16 observed tags, the rank of X is: 7 in number of lemmas, 9 in number of types and 14 in number of tokens.

The 10 most frequent X lemmas: ʃèː, XX, x, ba, shi, a, nan, ɗaya, ya, ga

The 10 most frequent X types: ʃèː, XX, ba, shi, a, nan, ɗaya, X, ga, gaba

The 10 most frequent ambiguous lemmas: XX (X 16, ADV 2, INTJ 1, VERB 1), ba (X 7, PART 1), a (X 5, ADP 2), nan (X 5, ADV 3), hár (ADP 20, SCONJ 14, ADV 6, CCONJ 5, X 3), wannan (X 3, PRON 2), wàːtòː (PART 7, X 3, SCONJ 2, INTJ 1), yànzú (X 2, ADV 1), da (X 2, ADP 1), dàː (ADV 9, NOUN 1, X 1)

The 10 most frequent ambiguous types: XX (X 17, ADV 2, INTJ 1, VERB 1), a (X 5, ADP 1), nan (X 5, ADV 3), ga (X 3, AUX 1), hár (ADP 19, SCONJ 14, ADV 6, CCONJ 5, X 3), wannan (X 3, PRON 1), wàːtòː (PART 7, X 3, SCONJ 2, INTJ 1), yànzú (X 3, ADV 1), da (X 2, ADP 1), ka (AUX 90, X 2)

Morphology

The form / lemma ratio of X is 1.048780 (the average of all parts of speech is 1.729120).

The 1st highest number of forms (6) was observed with the lemma “X”: X, ki, kú~, tə́, wace, ƙasa.

The 2nd highest number of forms (2) was observed with the lemma “ya”: ya, yáː.

The 3rd highest number of forms (1) was observed with the lemma “XX”: XX.

X occurs with 2 features: Foreign (132; 57% instances), ExtPos (4; 2% instances)

X occurs with 2 feature-value pairs: ExtPos=ADV, Foreign=Yes

X occurs with 3 feature combinations. The most frequent feature combination is Foreign=Yes (128 tokens). Examples: ba, shi, a, nan, ɗaya, ga, hau, ina, ne, sa

Relations

X nodes are attached to their parents using 16 different relations: dep (85; 36% instances), flat:foreign (70; 30% instances), root (22; 9% instances), discourse (16; 7% instances), obl (8; 3% instances), obj (7; 3% instances), fixed (6; 3% instances), reparandum (5; 2% instances), advmod (4; 2% instances), parataxis (3; 1% instances), conj (2; 1% instances), advcl (1; 0% instances), compound:redup (1; 0% instances), dislocated (1; 0% instances), nmod (1; 0% instances), vocative (1; 0% instances)

Parents of X nodes belong to 11 different parts of speech: VERB (90; 39% instances), X (80; 34% instances), (22; 9% instances), NOUN (11; 5% instances), PRON (7; 3% instances), PROPN (7; 3% instances), SCONJ (7; 3% instances), ADV (3; 1% instances), INTJ (3; 1% instances), AUX (2; 1% instances), NUM (1; 0% instances)

160 (69%) X nodes are leaves.

33 (14%) X nodes have one child.

19 (8%) X nodes have two children.

21 (9%) X nodes have three or more children.

The highest child degree of a X node is 7.

Children of X nodes are attached using 16 different relations: flat:foreign (66; 42% instances), punct (42; 27% instances), discourse (11; 7% instances), dep (6; 4% instances), fixed (6; 4% instances), case (5; 3% instances), parataxis (5; 3% instances), nmod (3; 2% instances), reparandum (3; 2% instances), acl (2; 1% instances), aux (2; 1% instances), advmod (1; 1% instances), cc (1; 1% instances), compound:redup (1; 1% instances), det (1; 1% instances), mark (1; 1% instances)

Children of X nodes belong to 11 different parts of speech: X (80; 51% instances), PUNCT (42; 27% instances), SCONJ (12; 8% instances), NOUN (5; 3% instances), VERB (4; 3% instances), PART (3; 2% instances), PROPN (3; 2% instances), AUX (2; 1% instances), CCONJ (2; 1% instances), INTJ (2; 1% instances), DET (1; 1% instances)