home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_Russian-RNC: POS Tags: NUM

There are 80 NUM lemmas (3%), 104 NUM types (2%) and 216 NUM tokens (1%). Out of 17 observed tags, the rank of NUM is: 6 in number of lemmas, 8 in number of types and 12 in number of tokens.

The 10 most frequent NUM lemmas: два, 3, одинъ, много, 10, 2, сколко, 5, сто, 8

The 10 most frequent NUM types: 3, много, 2, два, дву, 3-х, сколко, 10, 5, 8

The 10 most frequent ambiguous lemmas: 3 (NUM 18, ADJ 1), одинъ (NUM 16, DET 3, ADJ 1), много (NUM 10, ADV 6), 10 (NUM 8, ADJ 3), 2 (NUM 8, ADJ 2), сто (NUM 7, NOUN 1), 8 (NUM 5, ADJ 3), 1 (ADJ 2, NUM 1), 13 (ADJ 1, NUM 1), 190 (ADJ 1, NUM 1)

The 10 most frequent ambiguous types: 3 (NUM 11, ADJ 1, X 1), много (NUM 10, ADV 4), 2 (NUM 8, ADJ 2), 10 (NUM 6, ADJ 3, X 1), 5 (NUM 6, X 1), 8 (NUM 5, ADJ 3, X 1), 4 (NUM 4, X 1), сто (NUM 4, NOUN 1), 15 (NUM 3, X 1), 9 (NUM 3, X 1)

Morphology

The form / lemma ratio of NUM is 1.300000 (the average of all parts of speech is 1.900114).

The 1st highest number of forms (10) was observed with the lemma “одинъ”: адин, аднꙋ, один, одиного, одна, однова, одново, одном, одну, ѡдинъ.

The 2nd highest number of forms (5) was observed with the lemma “два”: два, две, двема, дву, двѣ.

The 3rd highest number of forms (2) was observed with the lemma “10”: 10, 10-ти.

NUM occurs with 5 features: Case (208; 96% instances), Gender (64; 30% instances), Number (24; 11% instances), Degree (2; 1% instances), Animacy (1; 0% instances)

NUM occurs with 15 feature-value pairs: Animacy=Anim, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Degree=Cmp, Degree=Pos, Gender=Fem, Gender=Masc, Gender=Neut, Number=Dual, Number=Plur, Number=Sing

NUM occurs with 29 feature combinations. The most frequent feature combination is Case=Nom (75 tokens). Examples: много, 2, 8, 3, 10, 100, сто, 9, сколко, 119

Relations

NUM nodes are attached to their parents using 10 different relations: nummod:gov (109; 50% instances), nummod (69; 32% instances), compound (18; 8% instances), nsubj:pass (5; 2% instances), obj (5; 2% instances), obl (4; 2% instances), amod (2; 1% instances), conj (2; 1% instances), nsubj (1; 0% instances), root (1; 0% instances)

Parents of NUM nodes belong to 7 different parts of speech: NOUN (173; 80% instances), NUM (17; 8% instances), VERB (15; 7% instances), ADJ (6; 3% instances), PRON (3; 1% instances), ADV (1; 0% instances), (1; 0% instances)

181 (84%) NUM nodes are leaves.

30 (14%) NUM nodes have one child.

4 (2%) NUM nodes have two children.

1 (0%) NUM nodes have three or more children.

The highest child degree of a NUM node is 6.

Children of NUM nodes are attached using 11 different relations: compound (16; 36% instances), case (7; 16% instances), advmod (3; 7% instances), cc (3; 7% instances), conj (3; 7% instances), nmod (3; 7% instances), parataxis (3; 7% instances), punct (3; 7% instances), det (1; 2% instances), nsubj (1; 2% instances), obl (1; 2% instances)

Children of NUM nodes belong to 10 different parts of speech: NUM (17; 39% instances), NOUN (8; 18% instances), ADP (7; 16% instances), CCONJ (3; 7% instances), PUNCT (3; 7% instances), PART (2; 5% instances), ADV (1; 2% instances), DET (1; 2% instances), PRON (1; 2% instances), VERB (1; 2% instances)