home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hausa-SouthernAutogramm: POS Tags: NUM

There are 23 NUM lemmas (2%), 24 NUM types (1%) and 95 NUM tokens (1%). Out of 16 observed tags, the rank of NUM is: 9 in number of lemmas, 13 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: biyu, goːmà, ɗaya, bakwài, bìyar̃, dubuː, gùdaː, àr̃bàʼin, àshìr̃in, huɗu

The 10 most frequent NUM types: biyu, goːmà, ɗaya, bakwài, bìyar̃, dubuː, gùdaː, àr̃bàʼin, àshìr̃in, huɗu

The 10 most frequent ambiguous lemmas: shâː (NUM 3, ADP 1, VERB 1)

The 10 most frequent ambiguous types: shâː (VERB 4, NUM 3, ADP 1)

Morphology

The form / lemma ratio of NUM is 1.043478 (the average of all parts of speech is 1.352436).

The 1st highest number of forms (2) was observed with the lemma “goːmà”: goːmà, goːmànkà.

The 2nd highest number of forms (1) was observed with the lemma “bakwài”: bakwài.

The 3rd highest number of forms (1) was observed with the lemma “biyu”: biyu.

NUM occurs with 2 features: Definite (7; 7% instances), PronType (2; 2% instances)

NUM occurs with 2 feature-value pairs: Definite=Cons, PronType=Int

NUM occurs with 3 feature combinations. The most frequent feature combination is _ (86 tokens). Examples: biyu, goːmà, ɗaya, bakwài, bìyar̃, dubuː, gùdaː, àr̃bàʼin, huɗu, shâː

Relations

NUM nodes are attached to their parents using 11 different relations: nummod (47; 49% instances), flat (13; 14% instances), root (10; 11% instances), nmod (6; 6% instances), conj (4; 4% instances), obj (4; 4% instances), obl:mod (4; 4% instances), nsubj (3; 3% instances), advcl:cleft (2; 2% instances), flat:foreign (1; 1% instances), obl:arg (1; 1% instances)

Parents of NUM nodes belong to 6 different parts of speech: NOUN (53; 56% instances), NUM (20; 21% instances), (10; 11% instances), VERB (9; 9% instances), PRON (2; 2% instances), AUX (1; 1% instances)

67 (71%) NUM nodes are leaves.

9 (9%) NUM nodes have one child.

11 (12%) NUM nodes have two children.

8 (8%) NUM nodes have three or more children.

The highest child degree of a NUM node is 5.

Children of NUM nodes are attached using 16 different relations: flat (14; 23% instances), punct (12; 20% instances), discourse (9; 15% instances), advmod (5; 8% instances), nmod (5; 8% instances), conj (4; 7% instances), case (2; 3% instances), dislocated (2; 3% instances), advcl (1; 2% instances), advcl:cleft (1; 2% instances), aux (1; 2% instances), cc (1; 2% instances), cop (1; 2% instances), flat:foreign (1; 2% instances), nsubj (1; 2% instances), parataxis (1; 2% instances)

Children of NUM nodes belong to 10 different parts of speech: NUM (20; 33% instances), PUNCT (12; 20% instances), NOUN (8; 13% instances), PART (6; 10% instances), ADV (5; 8% instances), ADP (3; 5% instances), AUX (2; 3% instances), INTJ (2; 3% instances), VERB (2; 3% instances), CCONJ (1; 2% instances)