home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hausa-SouthernAutogramm: POS Tags: X

There are 44 X lemmas (3%), 56 X types (3%) and 93 X tokens (1%). Out of 16 observed tags, the rank of X is: 7 in number of lemmas, 8 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: x, lìllaːhì, àlhamdù, galalan, gar̃as, sùkûːl, tìr̃kaː, ƙut, Allàː, Riːmiː

The 10 most frequent X types: X, lìllaːhì, àlhamdù, galalan, k~, gar̃as, mut~, sùkûːl, tìr̃kaː, à~

The 10 most frequent ambiguous lemmas: lìllaːhì (X 5, INTJ 1), àlhamdù (X 4, INTJ 1), Allàː (PROPN 34, X 1), Riːmiː (PROPN 1, X 1), dubuː (NUM 5, X 1), huɗu (NUM 2, X 1), kilaːs (NOUN 1, X 1), police (NOUN 3, X 1), Ùngwan (PROPN 9, X 1)

The 10 most frequent ambiguous types: lìllaːhì (X 5, INTJ 1), àlhamdù (X 5, INTJ 1), Allàː (PROPN 33, X 1), Riːmiː (PROPN 1, X 1), dubuː (NUM 5, X 1), huɗu (NUM 2, X 1), kilaːs (NOUN 2, X 1), nàː (AUX 13, X 1), police (NOUN 3, X 1), saːkar̃ai (NOUN 1, X 1)

Morphology

The form / lemma ratio of X is 1.272727 (the average of all parts of speech is 1.303635).

The 1st highest number of forms (15) was observed with the lemma “X”: X, at~, ka~, kwa~, k~, mut~, měː, naː~, nàː, nə́m~, r̃~, saːkar̃ai, zː, à~, ƙò~.

The 2nd highest number of forms (1) was observed with the lemma “Allàː”: Allàː.

The 3rd highest number of forms (1) was observed with the lemma “Riːmiː”: Riːmiː.

X occurs with 2 features: Foreign (29; 31% instances), Definite (1; 1% instances)

X occurs with 2 feature-value pairs: Definite=Cons, Foreign=Yes

X occurs with 3 feature combinations. The most frequent feature combination is _ (63 tokens). Examples: X, galalan, k~, gar̃as, mut~, tìr̃kaː, à~, ƙut, at~, a~

Relations

X nodes are attached to their parents using 14 different relations: reparandum (17; 18% instances), flat:foreign (16; 17% instances), dep (14; 15% instances), root (11; 12% instances), obj (8; 9% instances), conj (5; 5% instances), obl (5; 5% instances), obl:arg (5; 5% instances), xcomp (4; 4% instances), discourse (3; 3% instances), advcl:cleft (2; 2% instances), advcl (1; 1% instances), appos (1; 1% instances), nmod (1; 1% instances)

Parents of X nodes belong to 10 different parts of speech: VERB (27; 29% instances), X (21; 23% instances), NOUN (12; 13% instances), (11; 12% instances), AUX (10; 11% instances), PART (5; 5% instances), ADV (3; 3% instances), PROPN (2; 2% instances), NUM (1; 1% instances), PRON (1; 1% instances)

47 (51%) X nodes are leaves.

23 (25%) X nodes have one child.

16 (17%) X nodes have two children.

7 (8%) X nodes have three or more children.

The highest child degree of a X node is 6.

Children of X nodes are attached using 16 different relations: punct (25; 30% instances), flat:foreign (16; 20% instances), dep (6; 7% instances), discourse (6; 7% instances), case (5; 6% instances), conj (5; 6% instances), aux (4; 5% instances), cc (3; 4% instances), det (3; 4% instances), advmod (2; 2% instances), reparandum (2; 2% instances), advcl (1; 1% instances), advcl:cleft (1; 1% instances), amod (1; 1% instances), mark (1; 1% instances), xcomp (1; 1% instances)

Children of X nodes belong to 14 different parts of speech: PUNCT (25; 30% instances), X (21; 26% instances), AUX (6; 7% instances), ADP (5; 6% instances), DET (4; 5% instances), NOUN (4; 5% instances), SCONJ (4; 5% instances), CCONJ (3; 4% instances), INTJ (3; 4% instances), ADV (2; 2% instances), PART (2; 2% instances), ADJ (1; 1% instances), PROPN (1; 1% instances), VERB (1; 1% instances)