Treebank Statistics: UD_Old_East_Slavic-Ruthenian: POS Tags: NUM
There are 201 NUM lemmas (2%), 489 NUM types (2%) and 1299 NUM tokens (1%).
Out of 17 observed tags, the rank of NUM is: 6 in number of lemmas, 7 in number of types and 14 in number of tokens.
The 10 most frequent NUM lemmas: два, оба, три, одинъ, четыри, пять, сто, десять, двадцать, тридцать
The 10 most frequent NUM types: три, два, ѡбѣ, чотыри, две, 10, сто, 5, двѣ, шесть
The 10 most frequent ambiguous lemmas: одинъ (NUM 67, DET 20), 10 (ADJ 27, NUM 22), 5 (NUM 20, ADJ 18), много (NUM 20, ADV 4), 3 (NUM 17, ADJ 8), 4 (ADJ 25, NUM 17), 12 (ADJ 42, NUM 15), 14 (ADJ 71, NUM 15), 6 (ADJ 19, NUM 14), 7 (ADJ 16, NUM 12)
The 10 most frequent ambiguous types: 10 (NUM 20, ADJ 13), 5 (NUM 17, ADJ 16), 4 (NUM 14, ADJ 11), много (NUM 12, ADV 4), 3 (NUM 11, ADJ 7), 20 (ADJ 11, NUM 10), 6 (ADJ 17, NUM 10), 7 (NUM 10, ADJ 8), семъ (NUM 10, DET 5, PRON 4, ADV 2), 12 (ADJ 25, NUM 9)
- 10
- 5
- 4
- много
- 3
- 20
- 6
- 7
- семъ
- NUM 10: Венку обыскати семъ чоловеков против Кричова , а Кричов Анъдреиковичу .
- DET 5: Бо есмо им , ѡкромъ ѡдно сел(ь)ских путниковъ , в том праве ничого не рꙋшили и все писаное в перъвомъ нашомъ привил(ь)ю и в семъ нашом листе , што естъ вышеи выписано , потверъжаемъ симъ нашим листомъ вечно имъ и напотомъ бꙋдꙋчымъ их счадкомъ .
- PRON 4: На семъ же целуите ко мнѣ кр(е)стъ по любьви и в правду , без всѧкого извѣта .
- ADV 2: А хто тобѣ продал , даи семъ того » .
- 12
Morphology
The form / lemma ratio of NUM is 2.432836 (the average of all parts of speech is 2.698737).
The 1st highest number of forms (30) was observed with the lemma “одинъ”: о(д)но, о(д)ного, оди(н), один, одино, одинъ, одно, одног(о), одного, однои, одномъ, одною, однъ, одным, одінъ, ѡдин, ѡдинъ, ѡдины, ѡдна, ѡдно, ѡдног(о), ѡдного, ѡднои, ѡдному, ѡдномꙋ, ѡдною, ѡдным, ѡднымъ, ѡднымь, ѡдъномꙋ.
The 2nd highest number of forms (17) was observed with the lemma “оба”: абею, абою, обею, обо(х), обохъ, обою, обу, обѣ, обꙋ, ѡба, ѡбаи, ѡбе, ѡбема, ѡбохъ, ѡбою, ѡбу, ѡбѣ.
The 3rd highest number of forms (14) was observed with the lemma “два”: два, две, двема, дво(м), дво(х), двома, двоу, двохъ, дву, двух, двѣ, двѣма, двꙋ, двꙋхъ.
NUM occurs with 7 features: Case (1292; 99% instances), NumForm (1289; 99% instances), NumType (1289; 99% instances), Gender (561; 43% instances), Number (102; 8% instances), Animacy (15; 1% instances), Degree (2; 0% instances)
NUM occurs with 21 feature-value pairs: Animacy=Anim, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Degree=Cmp, Gender=Fem, Gender=Masc, Gender=Neut, NumForm=Combi, NumForm=Cyril, NumForm=Digit, NumForm=Word, NumType=Card, NumType=Frac, NumType=Sets, Number=Dual, Number=Plur, Number=Sing
NUM occurs with 86 feature combinations.
The most frequent feature combination is Case=Nom|NumForm=Digit|NumType=Card (230 tokens).
Examples: 10, 5, 7, 20, 8, 30, 15, 1000, 6, 50
Relations
NUM nodes are attached to their parents using 22 different relations: nummod:gov (521; 40% instances), nummod (460; 35% instances), root (79; 6% instances), parataxis (66; 5% instances), dep (41; 3% instances), compound (39; 3% instances), conj (37; 3% instances), obl (15; 1% instances), obj (8; 1% instances), nmod (6; 0% instances), orphan (5; 0% instances), nsubj (4; 0% instances), advcl (3; 0% instances), appos (3; 0% instances), acl:relcl (2; 0% instances), iobj (2; 0% instances), nsubj:pass (2; 0% instances), xcomp (2; 0% instances), acl (1; 0% instances), amod (1; 0% instances), ccomp (1; 0% instances), obl:float (1; 0% instances)
Parents of NUM nodes belong to 9 different parts of speech: NOUN (994; 77% instances), VERB (122; 9% instances), (79; 6% instances), ADJ (49; 4% instances), NUM (37; 3% instances), PRON (8; 1% instances), PROPN (8; 1% instances), DET (1; 0% instances), INTJ (1; 0% instances)
1073 (83%) NUM nodes are leaves.
98 (8%) NUM nodes have one child.
26 (2%) NUM nodes have two children.
102 (8%) NUM nodes have three or more children.
The highest child degree of a NUM node is 10.
Children of NUM nodes are attached using 26 different relations: punct (271; 49% instances), compound (80; 14% instances), cc (46; 8% instances), case (27; 5% instances), conj (23; 4% instances), advmod (17; 3% instances), nmod (16; 3% instances), nsubj (15; 3% instances), dep (13; 2% instances), cop (10; 2% instances), obl (7; 1% instances), orphan (7; 1% instances), det (5; 1% instances), acl:relcl (3; 1% instances), mark (3; 1% instances), appos (2; 0% instances), iobj (2; 0% instances), nummod (2; 0% instances), nummod:gov (2; 0% instances), amod (1; 0% instances), discourse (1; 0% instances), expl (1; 0% instances), fixed (1; 0% instances), list (1; 0% instances), parataxis (1; 0% instances), vocative (1; 0% instances)
Children of NUM nodes belong to 15 different parts of speech: PUNCT (271; 49% instances), SYM (74; 13% instances), CCONJ (46; 8% instances), NOUN (40; 7% instances), NUM (37; 7% instances), ADP (27; 5% instances), PART (14; 3% instances), AUX (11; 2% instances), DET (10; 2% instances), VERB (9; 2% instances), ADJ (4; 1% instances), ADV (4; 1% instances), PRON (4; 1% instances), PROPN (4; 1% instances), SCONJ (3; 1% instances)