Treebank Statistics: UD_Zaar-Autogramm: POS Tags: X
There are 81 X lemmas (5%), 85 X types (3%) and 232 X tokens (1%).
Out of 16 observed tags, the rank of X is: 6 in number of lemmas, 9 in number of types and 14 in number of tokens.
The 10 most frequent X lemmas: ʃèː, XX, x, ba, shi, a, nan, ɗaya, ya, ga
The 10 most frequent X types: ʃèː, XX, ba, shi, a, nan, ɗaya, X, ga, gaba
The 10 most frequent ambiguous lemmas: XX (X 16, ADV 2, INTJ 1, VERB 1), ba (X 7, PART 1), shi (X 6, PRON 1), a (AUX 550, X 5, ADP 2), nan (X 5, ADV 3), hár (ADP 19, SCONJ 14, ADV 7, CCONJ 5, X 3), wannan (X 3, PRON 2), wàːtòː (PART 9, X 3, INTJ 1), yànzú (X 2, ADV 1), da (X 2, ADP 1)
The 10 most frequent ambiguous types: XX (X 17, ADV 2, INTJ 1, VERB 1), shi (X 6, PRON 1), a (X 5, ADP 1), nan (X 5, ADV 3), ga (X 3, AUX 1), hár (ADP 18, SCONJ 14, ADV 7, CCONJ 5, X 3), wannan (X 3, PRON 1), wàːtòː (PART 9, X 3, INTJ 1), yànzú (X 3, ADV 1), da (X 2, ADP 1)
- XX
- shi
- a
- nan
- ga
- hár
- ADP 18: Tʃôɣn yáː yâddéy kàm < hár wò mán ʃiː wò naː ɗàrí nandam //
- SCONJ 14: tôː ká rîːp //= ká riːp tíː hár fî ɗan gyópti ʧǐː //
- ADV 7: tôː < bàː tá wâː ɗan hár wò || wò ʧi ʃí ɣá raː sòːséy hŋ́ //
- CCONJ 5: m̀ː gíː mbə́ɬəŋ hár dzàŋ gə̀ mâːy ɣəndá kə ɬyá //
- X 3: tòː < tá fî tə naː íri ɮə̀pmgə̀n gíː hár yànzú //
- wannan
- wàːtòː
- yànzú
- da
Morphology
The form / lemma ratio of X is 1.049383 (the average of all parts of speech is 1.692524).
The 1st highest number of forms (6) was observed with the lemma “X”: X, ki, kú~, tə́, wace, ƙasa.
The 2nd highest number of forms (2) was observed with the lemma “ya”: ya, yáː.
The 3rd highest number of forms (1) was observed with the lemma “XX”: XX.
X occurs with 1 features: Foreign (131; 56% instances)
X occurs with 1 feature-value pairs: Foreign=Yes
X occurs with 2 feature combinations.
The most frequent feature combination is Foreign=Yes (131 tokens).
Examples: ba, shi, a, nan, ɗaya, ga, gaba, hau, hár, ina
Relations
X nodes are attached to their parents using 14 different relations: dep (86; 37% instances), flat:foreign (75; 32% instances), root (22; 9% instances), discourse (16; 7% instances), obl (8; 3% instances), obj (7; 3% instances), reparandum (5; 2% instances), nmod (4; 2% instances), parataxis (3; 1% instances), conj (2; 1% instances), advcl (1; 0% instances), compound:redup (1; 0% instances), dislocated (1; 0% instances), vocative (1; 0% instances)
Parents of X nodes belong to 11 different parts of speech: VERB (89; 38% instances), X (80; 34% instances), (22; 9% instances), NOUN (11; 5% instances), PRON (7; 3% instances), PROPN (7; 3% instances), SCONJ (7; 3% instances), ADV (3; 1% instances), INTJ (3; 1% instances), AUX (2; 1% instances), NUM (1; 0% instances)
159 (69%) X nodes are leaves.
33 (14%) X nodes have one child.
19 (8%) X nodes have two children.
21 (9%) X nodes have three or more children.
The highest child degree of a X node is 7.
Children of X nodes are attached using 14 different relations: flat:foreign (71; 46% instances), punct (42; 27% instances), discourse (13; 8% instances), dep (8; 5% instances), parataxis (5; 3% instances), case (3; 2% instances), nmod (3; 2% instances), reparandum (3; 2% instances), acl (2; 1% instances), aux (2; 1% instances), advmod (1; 1% instances), cc (1; 1% instances), compound:redup (1; 1% instances), det (1; 1% instances)
Children of X nodes belong to 11 different parts of speech: X (80; 51% instances), PUNCT (42; 27% instances), PART (10; 6% instances), NOUN (5; 3% instances), SCONJ (5; 3% instances), VERB (4; 3% instances), PROPN (3; 2% instances), AUX (2; 1% instances), CCONJ (2; 1% instances), INTJ (2; 1% instances), DET (1; 1% instances)