home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hausa-SouthernAutogramm: POS Tags: NUM

There are 24 NUM lemmas (2%), 24 NUM types (1%) and 94 NUM tokens (1%). Out of 16 observed tags, the rank of NUM is: 10 in number of lemmas, 12 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: biyu, goːmà, ɗaya, bakwài, bìyar̃, dubuː, gùdaː, àr̃bàʼin, shâː, àshìr̃in

The 10 most frequent NUM types: biyu, goːmà, ɗaya, bakwài, bìyar̃, dubuː, gùdaː, àr̃bàʼin, shâː, àshìr̃in

The 10 most frequent ambiguous lemmas: dubuː (NUM 5, X 1), shâː (NUM 4, VERB 1), huɗu (NUM 2, X 1)

The 10 most frequent ambiguous types: dubuː (NUM 5, X 1), shâː (NUM 4, VERB 4), huɗu (NUM 2, X 1)

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.303635).

The 1st highest number of forms (1) was observed with the lemma “bakwài”: bakwài.

The 2nd highest number of forms (1) was observed with the lemma “biyu”: biyu.

The 3rd highest number of forms (1) was observed with the lemma “bìyar̃”: bìyar̃.

NUM occurs with 3 features: Definite (20; 21% instances), ExtPos (1; 1% instances), PronType (1; 1% instances)

NUM occurs with 4 feature-value pairs: Definite=Cons, Definite=Def, ExtPos=NOUN, PronType=Int

NUM occurs with 5 feature combinations. The most frequent feature combination is _ (73 tokens). Examples: biyu, goːmà, ɗaya, bakwài, bìyar̃, dubuː, gùdaː, shâː, huɗu, takwàs

Relations

NUM nodes are attached to their parents using 12 different relations: nummod (44; 47% instances), flat (14; 15% instances), nmod (8; 9% instances), root (7; 7% instances), obl (4; 4% instances), xcomp (4; 4% instances), conj (3; 3% instances), nsubj (3; 3% instances), obj (3; 3% instances), fixed (2; 2% instances), advcl:cleft (1; 1% instances), obl:arg (1; 1% instances)

Parents of NUM nodes belong to 6 different parts of speech: NOUN (53; 56% instances), NUM (19; 20% instances), VERB (9; 10% instances), (7; 7% instances), PART (5; 5% instances), AUX (1; 1% instances)

69 (73%) NUM nodes are leaves.

9 (10%) NUM nodes have one child.

10 (11%) NUM nodes have two children.

6 (6%) NUM nodes have three or more children.

The highest child degree of a NUM node is 4.

Children of NUM nodes are attached using 12 different relations: flat (14; 28% instances), punct (9; 18% instances), advmod (6; 12% instances), discourse (5; 10% instances), nmod (5; 10% instances), conj (4; 8% instances), case (2; 4% instances), advcl (1; 2% instances), cc (1; 2% instances), cop (1; 2% instances), dislocated (1; 2% instances), nsubj (1; 2% instances)

Children of NUM nodes belong to 10 different parts of speech: NUM (19; 38% instances), PUNCT (9; 18% instances), NOUN (6; 12% instances), ADV (5; 10% instances), PART (5; 10% instances), ADP (2; 4% instances), AUX (1; 2% instances), CCONJ (1; 2% instances), INTJ (1; 2% instances), X (1; 2% instances)