Treebank Statistics: UD_Old_East_Slavic-Ruthenian: POS Tags: NUM
There are 144 NUM
lemmas (3%), 336 NUM
types (2%) and 927 NUM
tokens (1%).
Out of 17 observed tags, the rank of NUM
is: 6 in number of lemmas, 7 in number of types and 14 in number of tokens.
The 10 most frequent NUM
lemmas: два, оба, три, одинъ, четыри, пять, сто, 10, тридцать, двадцать
The 10 most frequent NUM
types: два, три, ѡбѣ, две, 10, 5, сто, 4, ѡбе, двесте
The 10 most frequent ambiguous lemmas: одинъ (NUM 58, DET 16), 10 (NUM 21, ADJ 12), 5 (NUM 19, ADJ 14), 4 (NUM 16, ADJ 11), много (NUM 16, ADV 2), 3 (NUM 14, ADJ 7), 6 (ADJ 12, NUM 11), 7 (NUM 11, ADJ 7), 20 (ADJ 11, NUM 10), семь (NUM 10, ADV 5)
The 10 most frequent ambiguous types: 10 (NUM 19, ADJ 12), 5 (NUM 16, ADJ 14), 4 (NUM 13, ADJ 11), 3 (NUM 10, ADJ 7), 20 (ADJ 11, NUM 9), 6 (ADJ 12, NUM 9), 7 (NUM 9, ADJ 7), 12 (ADJ 24, NUM 8), много (NUM 8, ADV 2), 14 (ADJ 28, NUM 7)
- 10
- 5
- 4
- 3
- 20
- 6
- 7
- 12
- много
- 14
Morphology
The form / lemma ratio of NUM
is 2.333333 (the average of all parts of speech is 2.909188).
The 1st highest number of forms (23) was observed with the lemma “одинъ”: одно, одног(о), одного, однои, одномъ, одною, однъ, одным, одінъ, ѡдин, ѡдинъ, ѡдины, ѡдна, ѡдног(о), ѡдного, ѡднои, ѡдному, ѡдномꙋ, ѡдною, ѡдным, ѡднымъ, ѡднымь, ѡдъномꙋ.
The 2nd highest number of forms (14) was observed with the lemma “оба”: абею, абою, обею, обою, обу, обѣ, обꙋ, ѡба, ѡбаи, ѡбе, ѡбема, ѡбою, ѡбу, ѡбѣ.
The 3rd highest number of forms (10) was observed with the lemma “два”: два, две, двема, двоу, дву, двух, двѣ, двѣма, двꙋ, двꙋхъ.
NUM
occurs with 7 features: NumForm (927; 100% instances), NumType (927; 100% instances), Case (925; 100% instances), Gender (432; 47% instances), Number (73; 8% instances), Animacy (12; 1% instances), Degree (2; 0% instances)
NUM
occurs with 20 feature-value pairs: Animacy=Anim
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Degree=Cmp
, Gender=Fem
, Gender=Masc
, Gender=Neut
, NumForm=Combi
, NumForm=Digit
, NumForm=Word
, NumType=Card
, NumType=Frac
, NumType=Sets
, Number=Dual
, Number=Plur
, Number=Sing
NUM
occurs with 73 feature combinations.
The most frequent feature combination is Case=Nom|NumForm=Digit|NumType=Card
(149 tokens).
Examples: 10, 5, 7, 20, 30, 15, 8, 1000, 6, 400
Relations
NUM
nodes are attached to their parents using 20 different relations: nummod (359; 39% instances), nummod:gov (353; 38% instances), parataxis (66; 7% instances), compound (37; 4% instances), conj (34; 4% instances), dep (29; 3% instances), obl (13; 1% instances), obj (8; 1% instances), nmod (4; 0% instances), orphan (4; 0% instances), root (4; 0% instances), advcl (3; 0% instances), nsubj (3; 0% instances), appos (2; 0% instances), nsubj:pass (2; 0% instances), xcomp (2; 0% instances), acl (1; 0% instances), amod (1; 0% instances), ccomp (1; 0% instances), iobj (1; 0% instances)
Parents of NUM
nodes belong to 9 different parts of speech: NOUN (736; 79% instances), VERB (109; 12% instances), ADJ (39; 4% instances), NUM (25; 3% instances), PROPN (6; 1% instances), PRON (4; 0% instances), (4; 0% instances), DET (3; 0% instances), INTJ (1; 0% instances)
805 (87%) NUM
nodes are leaves.
77 (8%) NUM
nodes have one child.
24 (3%) NUM
nodes have two children.
21 (2%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 10.
Children of NUM
nodes are attached using 23 different relations: cc (43; 20% instances), punct (34; 16% instances), case (24; 11% instances), conj (21; 10% instances), advmod (15; 7% instances), nmod (15; 7% instances), nsubj (11; 5% instances), cop (7; 3% instances), compound (6; 3% instances), obl (6; 3% instances), orphan (6; 3% instances), det (4; 2% instances), acl:relcl (3; 1% instances), appos (2; 1% instances), iobj (2; 1% instances), mark (2; 1% instances), nummod (2; 1% instances), nummod:gov (2; 1% instances), amod (1; 0% instances), discourse (1; 0% instances), list (1; 0% instances), parataxis (1; 0% instances), vocative (1; 0% instances)
Children of NUM
nodes belong to 14 different parts of speech: CCONJ (43; 20% instances), NOUN (35; 17% instances), PUNCT (34; 16% instances), NUM (25; 12% instances), ADP (24; 11% instances), PART (13; 6% instances), VERB (9; 4% instances), AUX (8; 4% instances), DET (5; 2% instances), PRON (4; 2% instances), ADV (3; 1% instances), PROPN (3; 1% instances), ADJ (2; 1% instances), SCONJ (2; 1% instances)