home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Kyrgyz-KTMU: POS Tags: NUM

There are 123 NUM lemmas (5%), 131 NUM types (4%) and 420 NUM tokens (6%). Out of 13 observed tags, the rank of NUM is: 5 in number of lemmas, 5 in number of types and 5 in number of tokens.

The 10 most frequent NUM lemmas: млрд, миң, бир, эки, 4, 1, 5, млн, ,5, 3

The 10 most frequent NUM types: млрд, миң, бир, эки, 4, 1, 5, млн, ,5, 3

The 10 most frequent ambiguous lemmas: млрд (NUM 36, NOUN 4), миң (NUM 28, NOUN 2), бир (NUM 14, ADJ 2, NOUN 1), 1 (NUM 12, NOUN 1), миллион (NUM 2, NOUN 1)

The 10 most frequent ambiguous types: млрд (NUM 36, NOUN 4), 1 (NUM 12, NOUN 1), миңден (NUM 5, NOUN 2), миллион (NUM 2, NOUN 1)

Morphology

The form / lemma ratio of NUM is 1.065041 (the average of all parts of speech is 1.500863).

The 1st highest number of forms (3) was observed with the lemma “эки”: эки, экинчи, экөө.

The 2nd highest number of forms (3) was observed with the lemma “үч”: үч, үчүнчү, үчөө.

The 3rd highest number of forms (2) was observed with the lemma “200”: 200, 200гө.

NUM occurs with 5 features: NumType (414; 99% instances), Case (11; 3% instances), Number (6; 1% instances), Person (3; 1% instances), PronType (3; 1% instances)

NUM occurs with 9 feature-value pairs: Case=Abl, Case=Dat, Case=Nom, NumType=Card, NumType=Ord, Number=Sing, Person=3, PronType=Ind, PronType=Prs

NUM occurs with 7 feature combinations. The most frequent feature combination is NumType=Card (377 tokens). Examples: млрд, миң, бир, эки, 4, 1, 5, млн, ,5, 13

Relations

NUM nodes are attached to their parents using 5 different relations: nummod (265; 63% instances), compound (138; 33% instances), nmod (11; 3% instances), nsubj (5; 1% instances), fixed (1; 0% instances)

Parents of NUM nodes belong to 7 different parts of speech: NOUN (250; 60% instances), NUM (131; 31% instances), VERB (18; 4% instances), ADJ (13; 3% instances), ADV (5; 1% instances), PROPN (2; 0% instances), CCONJ (1; 0% instances)

247 (59%) NUM nodes are leaves.

157 (37%) NUM nodes have one child.

16 (4%) NUM nodes have two children.

The highest child degree of a NUM node is 2.

Children of NUM nodes are attached using 11 different relations: compound (132; 70% instances), punct (26; 14% instances), advmod (11; 6% instances), nmod (10; 5% instances), amod (2; 1% instances), cc (2; 1% instances), mark (2; 1% instances), acl (1; 1% instances), fixed (1; 1% instances), nsubj (1; 1% instances), nummod (1; 1% instances)

Children of NUM nodes belong to 9 different parts of speech: NUM (131; 69% instances), PUNCT (26; 14% instances), ADV (12; 6% instances), NOUN (7; 4% instances), ADJ (4; 2% instances), CCONJ (4; 2% instances), PROPN (2; 1% instances), VERB (2; 1% instances), PRON (1; 1% instances)