Treebank Statistics: UD_Kyrgyz-KTMU: POS Tags: NUM
There are 123 NUM
lemmas (5%), 131 NUM
types (4%) and 420 NUM
tokens (6%).
Out of 13 observed tags, the rank of NUM
is: 5 in number of lemmas, 5 in number of types and 5 in number of tokens.
The 10 most frequent NUM
lemmas: млрд, миң, бир, эки, 4, 1, 5, млн, ,5, 3
The 10 most frequent NUM
types: млрд, миң, бир, эки, 4, 1, 5, млн, ,5, 3
The 10 most frequent ambiguous lemmas: млрд (NUM 36, NOUN 4), миң (NUM 28, NOUN 2), бир (NUM 14, ADJ 2, NOUN 1), 1 (NUM 12, NOUN 1), миллион (NUM 2, NOUN 1)
The 10 most frequent ambiguous types: млрд (NUM 36, NOUN 4), 1 (NUM 12, NOUN 1), миңден (NUM 5, NOUN 2), миллион (NUM 2, NOUN 1)
- млрд
- 1
- миңден
- миллион
Morphology
The form / lemma ratio of NUM
is 1.065041 (the average of all parts of speech is 1.500863).
The 1st highest number of forms (3) was observed with the lemma “эки”: эки, экинчи, экөө.
The 2nd highest number of forms (3) was observed with the lemma “үч”: үч, үчүнчү, үчөө.
The 3rd highest number of forms (2) was observed with the lemma “200”: 200, 200гө.
NUM
occurs with 5 features: NumType (414; 99% instances), Case (11; 3% instances), Number (6; 1% instances), Person (3; 1% instances), PronType (3; 1% instances)
NUM
occurs with 9 feature-value pairs: Case=Abl
, Case=Dat
, Case=Nom
, NumType=Card
, NumType=Ord
, Number=Sing
, Person=3
, PronType=Ind
, PronType=Prs
NUM
occurs with 7 feature combinations.
The most frequent feature combination is NumType=Card
(377 tokens).
Examples: млрд, миң, бир, эки, 4, 1, 5, млн, ,5, 13
Relations
NUM
nodes are attached to their parents using 5 different relations: nummod (265; 63% instances), compound (138; 33% instances), nmod (11; 3% instances), nsubj (5; 1% instances), flat (1; 0% instances)
Parents of NUM
nodes belong to 7 different parts of speech: NOUN (250; 60% instances), NUM (131; 31% instances), VERB (18; 4% instances), ADJ (13; 3% instances), ADV (5; 1% instances), PROPN (2; 0% instances), CCONJ (1; 0% instances)
247 (59%) NUM
nodes are leaves.
157 (37%) NUM
nodes have one child.
16 (4%) NUM
nodes have two children.
The highest child degree of a NUM
node is 2.
Children of NUM
nodes are attached using 11 different relations: compound (132; 70% instances), punct (26; 14% instances), advmod (11; 6% instances), nmod (10; 5% instances), amod (2; 1% instances), cc (2; 1% instances), mark (2; 1% instances), acl (1; 1% instances), fixed (1; 1% instances), nsubj (1; 1% instances), nummod (1; 1% instances)
Children of NUM
nodes belong to 9 different parts of speech: NUM (131; 69% instances), PUNCT (26; 14% instances), ADV (12; 6% instances), NOUN (7; 4% instances), ADJ (4; 2% instances), CCONJ (4; 2% instances), PROPN (2; 1% instances), VERB (2; 1% instances), PRON (1; 1% instances)