home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_Russian-RNC: POS Tags: NUM

There are 51 NUM lemmas (2%), 67 NUM types (2%) and 115 NUM tokens (1%). Out of 17 observed tags, the rank of NUM is: 6 in number of lemmas, 9 in number of types and 14 in number of tokens.

The 10 most frequent NUM lemmas: два, много, одинъ, сколко, сто, оба, пять, три, 3, восмь

The 10 most frequent NUM types: много, два, сколко, дву, сто, 3, восмь, две, двесте, двѣ

The 10 most frequent ambiguous lemmas: много (NUM 9, ADV 1), одинъ (NUM 8, DET 2, ADJ 1), сто (NUM 7, NOUN 1), 3 (NUM 3, ADJ 1), 2 (ADJ 1, NUM 1), болши (ADV 3, NUM 1), колико (ADV 1, NUM 1)

The 10 most frequent ambiguous types: много (NUM 9, ADV 1), сто (NUM 4, NOUN 1), 3 (NUM 3, ADJ 1, X 1), 5 (NUM 2, X 1), 1 (NUM 1, X 1), 15 (NUM 1, X 1), 2 (ADJ 1, NUM 1), 4 (NUM 1, X 1), болши (ADV 1, NUM 1), колико (ADV 1, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.313725 (the average of all parts of speech is 1.860579).

The 1st highest number of forms (7) was observed with the lemma “одинъ”: аднꙋ, один, одиного, одна, одново, одну, ѡдинъ.

The 2nd highest number of forms (4) was observed with the lemma “два”: два, две, дву, двѣ.

The 3rd highest number of forms (2) was observed with the lemma “оба”: оба, обѣ.

NUM occurs with 5 features: Case (108; 94% instances), Gender (31; 27% instances), Number (18; 16% instances), Animacy (1; 1% instances), Degree (1; 1% instances)

NUM occurs with 13 feature-value pairs: Animacy=Anim, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Degree=Cmp, Gender=Fem, Gender=Masc, Number=Dual, Number=Plur, Number=Sing

NUM occurs with 23 feature combinations. The most frequent feature combination is Case=Nom (30 tokens). Examples: много, сколко, сто, восмь, двесте, 24, два, две, девеноста, десять

Relations

NUM nodes are attached to their parents using 8 different relations: nummod:gov (50; 43% instances), nummod (37; 32% instances), compound (16; 14% instances), nsubj:pass (4; 3% instances), obj (3; 3% instances), amod (2; 2% instances), conj (2; 2% instances), obl (1; 1% instances)

Parents of NUM nodes belong to 5 different parts of speech: NOUN (84; 73% instances), NUM (15; 13% instances), VERB (9; 8% instances), ADJ (4; 3% instances), PRON (3; 3% instances)

91 (79%) NUM nodes are leaves.

20 (17%) NUM nodes have one child.

4 (3%) NUM nodes have two children.

The highest child degree of a NUM node is 2.

Children of NUM nodes are attached using 7 different relations: compound (14; 50% instances), case (6; 21% instances), cc (2; 7% instances), conj (2; 7% instances), punct (2; 7% instances), det (1; 4% instances), nmod (1; 4% instances)

Children of NUM nodes belong to 6 different parts of speech: NUM (15; 54% instances), ADP (6; 21% instances), CCONJ (2; 7% instances), NOUN (2; 7% instances), PUNCT (2; 7% instances), DET (1; 4% instances)