home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Komi_Zyrian-IKDP: POS Tags: NUM

There are 30 NUM lemmas (4%), 32 NUM types (3%) and 65 NUM tokens (3%). Out of 16 observed tags, the rank of NUM is: 6 in number of lemmas, 6 in number of types and 8 in number of tokens.

The 10 most frequent NUM lemmas: куим, нёль, кык, сизим, дас, вит, десятой, кызь, сорок, ӧти

The 10 most frequent NUM types: куим, нёль, кык, сизим, дас, вит, десятой, кызь, сорок, Девять

The 10 most frequent ambiguous lemmas: кык (NUM 7, NOUN 1), мӧд (PRON 4, DET 1, NUM 1)

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of NUM is 1.066667 (the average of all parts of speech is 1.332474).

The 1st highest number of forms (2) was observed with the lemma “куим”: Куимсэ, куим.

The 2nd highest number of forms (2) was observed with the lemma “ӧти”: Ӧтікес, ӧтік.

The 3rd highest number of forms (1) was observed with the lemma “Девять”: Девять.

NUM occurs with 6 features: NumType (62; 95% instances), Case (53; 82% instances), Number (51; 78% instances), Number[psor] (2; 3% instances), Person[psor] (2; 3% instances), Clitic (1; 2% instances)

NUM occurs with 9 feature-value pairs: Case=Acc, Case=Nom, Clitic=So, NumType=Card, NumType=Card,Ord, NumType=Ord, Number=Sing, Number[psor]=Sing, Person[psor]=1

NUM occurs with 10 feature combinations. The most frequent feature combination is Case=Nom|Number=Sing|NumType=Card (44 tokens). Examples: нёль, куим, кык, сизим, дас, кызь, вит, кызь-вит, три, тысяча

Relations

NUM nodes are attached to their parents using 8 different relations: nummod (53; 82% instances), compound (2; 3% instances), fixed (2; 3% instances), nmod (2; 3% instances), obj (2; 3% instances), root (2; 3% instances), amod (1; 2% instances), conj (1; 2% instances)

Parents of NUM nodes belong to 5 different parts of speech: NOUN (47; 72% instances), NUM (10; 15% instances), ADJ (4; 6% instances), (2; 3% instances), VERB (2; 3% instances)

46 (71%) NUM nodes are leaves.

16 (25%) NUM nodes have one child.

2 (3%) NUM nodes have two children.

1 (2%) NUM nodes have three or more children.

The highest child degree of a NUM node is 3.

Children of NUM nodes are attached using 12 different relations: nummod (6; 26% instances), punct (4; 17% instances), fixed (3; 13% instances), advmod (2; 9% instances), advcl (1; 4% instances), advmod:tmod (1; 4% instances), case (1; 4% instances), cc (1; 4% instances), compound (1; 4% instances), conj (1; 4% instances), orphan (1; 4% instances), reparandum (1; 4% instances)

Children of NUM nodes belong to 6 different parts of speech: NUM (10; 43% instances), ADV (4; 17% instances), PUNCT (4; 17% instances), ADJ (3; 13% instances), ADP (1; 4% instances), CCONJ (1; 4% instances)