home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Kurmanji-MG: POS Tags: NUM

There are 117 NUM lemmas (6%), 151 NUM types (5%) and 218 NUM tokens (2%). Out of 17 observed tags, the rank of NUM is: 5 in number of lemmas, 5 in number of types and 12 in number of tokens.

The 10 most frequent NUM lemmas: yek, du, 4, 15, hezar, pênc, sed, sê, 1980, çar

The 10 most frequent NUM types: du, yek, yekê, sê, 4, pênc, hezar, sed, siseyan, yekem

The 10 most frequent ambiguous lemmas: yek (NUM 22, NOUN 1)

The 10 most frequent ambiguous types: yekê (NUM 10, NOUN 1)

Morphology

The form / lemma ratio of NUM is 1.290598 (the average of all parts of speech is 1.511556).

The 1st highest number of forms (4) was observed with the lemma “15”: 15, 15’ê, 15em, 15ê.

The 2nd highest number of forms (4) was observed with the lemma “1980”: 1980, 1980’an, 1980’î, 1980an.

The 3rd highest number of forms (3) was observed with the lemma “1970”: 1970, 1970yî, 1970’î.

NUM occurs with 4 features: NumType (218; 100% instances), Number (109; 50% instances), Case (97; 44% instances), Definite (60; 28% instances)

NUM occurs with 6 feature-value pairs: Case=Acc, Case=Nom, Definite=Def, NumType=Card, Number=Plur, Number=Sing

NUM occurs with 7 feature combinations. The most frequent feature combination is NumType=Card (109 tokens). Examples: du, yek, yekê, 4, siseyan, yekem, sê, 1, 10, 15’ê

Relations

NUM nodes are attached to their parents using 11 different relations: nummod (77; 35% instances), nmod:poss (56; 26% instances), nmod (30; 14% instances), conj (20; 9% instances), flat (15; 7% instances), root (13; 6% instances), nsubj (3; 1% instances), compound (1; 0% instances), obj (1; 0% instances), orphan (1; 0% instances), parataxis (1; 0% instances)

Parents of NUM nodes belong to 9 different parts of speech: NOUN (132; 61% instances), NUM (36; 17% instances), VERB (20; 9% instances), (13; 6% instances), ADJ (10; 5% instances), PROPN (3; 1% instances), ADP (2; 1% instances), AUX (1; 0% instances), SYM (1; 0% instances)

127 (58%) NUM nodes are leaves.

45 (21%) NUM nodes have one child.

13 (6%) NUM nodes have two children.

33 (15%) NUM nodes have three or more children.

The highest child degree of a NUM node is 7.

Children of NUM nodes are attached using 14 different relations: punct (41; 21% instances), case (39; 20% instances), flat (28; 14% instances), conj (23; 12% instances), cop (16; 8% instances), nmod (16; 8% instances), nsubj (14; 7% instances), det (10; 5% instances), advmod (3; 2% instances), cc (3; 2% instances), nmod:poss (3; 2% instances), advcl (1; 1% instances), compound (1; 1% instances), parataxis (1; 1% instances)

Children of NUM nodes belong to 12 different parts of speech: PUNCT (41; 21% instances), NOUN (40; 20% instances), ADP (39; 20% instances), NUM (36; 18% instances), AUX (16; 8% instances), DET (10; 5% instances), PRON (4; 2% instances), VERB (4; 2% instances), CCONJ (3; 2% instances), PART (3; 2% instances), ADJ (2; 1% instances), PROPN (1; 1% instances)