home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_Russian-RNC: POS Tags: NUM

There are 118 NUM lemmas (3%), 162 NUM types (2%) and 370 NUM tokens (1%). Out of 17 observed tags, the rank of NUM is: 6 in number of lemmas, 7 in number of types and 12 in number of tokens.

The 10 most frequent NUM lemmas: два, 3, одинъ, оба, 2, 5, много, три, 10, двадцать

The 10 most frequent NUM types: 3, 2, обе, много, 5, два, дву, две, сколко, 10

The 10 most frequent ambiguous lemmas: 3 (NUM 22, ADJ 6), одинъ (NUM 20, DET 3), 2 (NUM 13, ADJ 2), 5 (NUM 10, ADJ 2), много (ADV 10, NUM 10), 10 (NUM 9, ADJ 4), сто (NUM 9, NOUN 1), 8 (NUM 7, ADJ 3), 6 (NUM 5, ADJ 4), 1 (ADJ 6, NUM 4)

The 10 most frequent ambiguous types: 3 (NUM 15, ADJ 4, ADV 1, X 1), 2 (NUM 13, ADJ 2), много (NUM 10, ADV 6), 5 (NUM 9, ADJ 2), 10 (NUM 7, ADJ 4, X 1), 8 (NUM 7, ADJ 3, X 1), сто (NUM 6, NOUN 1), 6 (NUM 5, ADJ 4, X 1), 1 (ADJ 6, NUM 4, X 1), 30 (NUM 4, ADJ 3)

Morphology

The form / lemma ratio of NUM is 1.372881 (the average of all parts of speech is 1.947446).

The 1st highest number of forms (11) was observed with the lemma “одинъ”: адин, аднꙋ, один, одиного, одна, однова, одново, одном, одною, одну, ѡдинъ.

The 2nd highest number of forms (6) was observed with the lemma “два”: два, две, двема, двома, дву, двѣ.

The 3rd highest number of forms (4) was observed with the lemma “единъ”: единаго, единеми, едино, единъ.

NUM occurs with 6 features: Case (367; 99% instances), Gender (155; 42% instances), NumForm (136; 37% instances), Number (38; 10% instances), Degree (5; 1% instances), Animacy (2; 1% instances)

NUM occurs with 17 feature-value pairs: Animacy=Anim, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Degree=Cmp, Degree=Pos, Gender=Fem, Gender=Masc, Gender=Neut, NumForm=Cyril, NumForm=Digit, NumForm=Roman, Number=Dual, Number=Sing

NUM occurs with 49 feature combinations. The most frequent feature combination is Case=Nom|NumForm=Digit (56 tokens). Examples: 8, 10, 100, 6, 9, 119, 12, 13, 15, 16

Relations

NUM nodes are attached to their parents using 15 different relations: nummod:gov (186; 50% instances), nummod (103; 28% instances), compound (29; 8% instances), root (12; 3% instances), appos (9; 2% instances), amod (5; 1% instances), conj (5; 1% instances), nsubj:pass (5; 1% instances), obj (5; 1% instances), obl (4; 1% instances), nsubj (3; 1% instances), acl (1; 0% instances), dep (1; 0% instances), nmod (1; 0% instances), parataxis (1; 0% instances)

Parents of NUM nodes belong to 7 different parts of speech: NOUN (296; 80% instances), NUM (29; 8% instances), VERB (19; 5% instances), (12; 3% instances), ADJ (8; 2% instances), PRON (5; 1% instances), ADV (1; 0% instances)

303 (82%) NUM nodes are leaves.

45 (12%) NUM nodes have one child.

15 (4%) NUM nodes have two children.

7 (2%) NUM nodes have three or more children.

The highest child degree of a NUM node is 6.

Children of NUM nodes are attached using 16 different relations: punct (29; 28% instances), compound (28; 27% instances), case (10; 10% instances), advmod (5; 5% instances), nmod (5; 5% instances), cc (4; 4% instances), conj (4; 4% instances), nsubj (4; 4% instances), obl (3; 3% instances), parataxis (3; 3% instances), appos (2; 2% instances), goeswith (2; 2% instances), cop (1; 1% instances), dep (1; 1% instances), det (1; 1% instances), iobj (1; 1% instances)

Children of NUM nodes belong to 14 different parts of speech: NUM (29; 28% instances), PUNCT (29; 28% instances), NOUN (14; 14% instances), ADP (10; 10% instances), CCONJ (4; 4% instances), ADJ (3; 3% instances), PART (3; 3% instances), PROPN (3; 3% instances), ADV (2; 2% instances), PRON (2; 2% instances), AUX (1; 1% instances), DET (1; 1% instances), VERB (1; 1% instances), X (1; 1% instances)