Treebank Statistics: UD_Kurmanji-MG: POS Tags: NUM
There are 117 NUM
lemmas (6%), 151 NUM
types (5%) and 218 NUM
tokens (2%).
Out of 17 observed tags, the rank of NUM
is: 5 in number of lemmas, 5 in number of types and 12 in number of tokens.
The 10 most frequent NUM
lemmas: yek, du, 4, 15, hezar, pênc, sed, sê, 1980, çar
The 10 most frequent NUM
types: du, yek, yekê, sê, 4, pênc, hezar, sed, siseyan, yekem
The 10 most frequent ambiguous lemmas: yek (NUM 22, NOUN 1)
The 10 most frequent ambiguous types: yekê (NUM 10, NOUN 1)
- yekê
Morphology
The form / lemma ratio of NUM
is 1.290598 (the average of all parts of speech is 1.511556).
The 1st highest number of forms (4) was observed with the lemma “15”: 15, 15’ê, 15em, 15ê.
The 2nd highest number of forms (4) was observed with the lemma “1980”: 1980, 1980’an, 1980’î, 1980an.
The 3rd highest number of forms (3) was observed with the lemma “1970”: 1970, 1970yî, 1970’î.
NUM
occurs with 4 features: NumType (218; 100% instances), Number (109; 50% instances), Case (97; 44% instances), Definite (60; 28% instances)
NUM
occurs with 6 feature-value pairs: Case=Acc
, Case=Nom
, Definite=Def
, NumType=Card
, Number=Plur
, Number=Sing
NUM
occurs with 7 feature combinations.
The most frequent feature combination is NumType=Card
(109 tokens).
Examples: du, yek, yekê, 4, siseyan, yekem, sê, 1, 10, 15’ê
Relations
NUM
nodes are attached to their parents using 11 different relations: nummod (77; 35% instances), nmod:poss (56; 26% instances), nmod (30; 14% instances), conj (20; 9% instances), flat (15; 7% instances), root (13; 6% instances), nsubj (3; 1% instances), compound (1; 0% instances), obj (1; 0% instances), orphan (1; 0% instances), parataxis (1; 0% instances)
Parents of NUM
nodes belong to 9 different parts of speech: NOUN (132; 61% instances), NUM (36; 17% instances), VERB (20; 9% instances), (13; 6% instances), ADJ (10; 5% instances), PROPN (3; 1% instances), ADP (2; 1% instances), AUX (1; 0% instances), SYM (1; 0% instances)
127 (58%) NUM
nodes are leaves.
45 (21%) NUM
nodes have one child.
13 (6%) NUM
nodes have two children.
33 (15%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 7.
Children of NUM
nodes are attached using 14 different relations: punct (41; 21% instances), case (39; 20% instances), flat (28; 14% instances), conj (23; 12% instances), cop (16; 8% instances), nmod (16; 8% instances), nsubj (14; 7% instances), det (10; 5% instances), advmod (3; 2% instances), cc (3; 2% instances), nmod:poss (3; 2% instances), advcl (1; 1% instances), compound (1; 1% instances), parataxis (1; 1% instances)
Children of NUM
nodes belong to 12 different parts of speech: PUNCT (41; 21% instances), NOUN (40; 20% instances), ADP (39; 20% instances), NUM (36; 18% instances), AUX (16; 8% instances), DET (10; 5% instances), PRON (4; 2% instances), VERB (4; 2% instances), CCONJ (3; 2% instances), PART (3; 2% instances), ADJ (2; 1% instances), PROPN (1; 1% instances)