home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-TOROT: POS Tags: NUM

There are 63 NUM lemmas (1%), 301 NUM types (1%) and 3772 NUM tokens (3%). Out of 14 observed tags, the rank of NUM is: 8 in number of lemmas, 8 in number of types and 9 in number of tokens.

The 10 most frequent NUM lemmas: шесть.тысячь, единъ, дъва, трие, шестьсътъ, дъвадесяти, четыре, пятьсътъ, пять, осмь

The 10 most frequent NUM types: ҂ѕ҃, х҃, к҃, ѕ҃, ф҃, г҃, в҃, л҃, м҃, д҃

The 10 most frequent ambiguous lemmas: съто (NUM 61, NOUN 1), пятьнадесять (NUM 35, ADJ 1), четырьнадесять (NUM 23, ADJ 2), осмьнадесять (NUM 17, ADJ 2), седмьнадесять (NUM 14, ADJ 1), шестьнадесять (NUM 12, ADJ 1), сорокъ (NUM 4, ADJ 1)

The 10 most frequent ambiguous types: х҃ (NUM 183, PROPN 1), а (CCONJ 2205, NUM 51, X 3, ADP 1, ADV 1), и (CCONJ 10267, ADV 722, PRON 517, NUM 51, ADP 32, VERB 2, ADJ 1, DET 1), и҃ (NUM 48, PRON 1), г (ADP 51, NUM 47), д (NUM 46, NOUN 5, ADP 1, CCONJ 1), е (NUM 44, PRON 27, AUX 4), з (NUM 44, ADP 31), в (ADP 1690, NUM 41, PROPN 1), а҃ (NUM 40, ADJ 2)

Morphology

The form / lemma ratio of NUM is 4.777778 (the average of all parts of speech is 3.571475).

The 1st highest number of forms (47) was observed with the lemma “единъ”: а, а҃, еди, един, едина, единаго, едино, единого, единои, едином, единому, единомъ, единомь, единомꙋ, единою, едину, единъ, единым, единымъ, единыхъ, единѣмъ, единѣхъ, единѹ, едінъ, єдино, єдинъ, ѥдин, ѥдина, ѥдини, ѥдино, ѥдиного, ѥдинои, ѥдином, ѥдиномȣ, ѥдиному, ѥдиномь, ѥдиномѹ, ѥдиноꙗ, ѥдину, ѥдинъ, ѥдины, ѥдиныи, ѥдинѣмъ, ѥдинѣмь, ѥдинѣхъ, ѥдинѹ, ѥдін.

The 2nd highest number of forms (29) was observed with the lemma “одинъ”: а, а҃, дним, одиного, одинои, одиномь, одиноѣ, одинъ, одинѡ, одинѹ, одномъ, однѹ, одїного, одїнъ, ѡдина, ѡдини, ѡдино, ѡдиного, ѡдинои, ѡдином, ѡдиною, ѡдинъ, ѡдинѣхъ, ѡдно, ѡдного, ѡднѣ, ѡднѹ, ѡдїну, ҃а.

The 3rd highest number of forms (18) was observed with the lemma “дъва”: .в҃, в, в҃, два, двема, двоу, двою, дву, двѣ, двѣма, двꙋ, дова, довѣ, дъва, дъвою, дъвѣ, дъвѣма, дъвѹ.

NUM occurs with 3 features: Case (709; 19% instances), Gender (709; 19% instances), Number (709; 19% instances)

NUM occurs with 12 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Number=Dual, Number=Plur, Number=Sing

NUM occurs with 53 feature combinations. The most frequent feature combination is _ (3063 tokens). Examples: ҂ѕ҃, х҃, к҃, ѕ҃, ф҃, г҃, в҃, л҃, м҃, д҃

Relations

NUM nodes are attached to their parents using 18 different relations: conj (1496; 40% instances), nummod (1270; 34% instances), obl (338; 9% instances), nsubj (239; 6% instances), obj (177; 5% instances), nmod (93; 2% instances), appos (52; 1% instances), orphan (28; 1% instances), root (24; 1% instances), xcomp (24; 1% instances), iobj (10; 0% instances), nsubj:pass (7; 0% instances), dislocated (6; 0% instances), advcl (4; 0% instances), ccomp (1; 0% instances), dep (1; 0% instances), flat (1; 0% instances), vocative (1; 0% instances)

Parents of NUM nodes belong to 11 different parts of speech: NUM (1576; 42% instances), NOUN (1300; 34% instances), VERB (586; 16% instances), CCONJ (123; 3% instances), PROPN (47; 1% instances), AUX (34; 1% instances), ADJ (33; 1% instances), ADV (31; 1% instances), (24; 1% instances), PRON (14; 0% instances), ADP (4; 0% instances)

2290 (61%) NUM nodes are leaves.

560 (15%) NUM nodes have one child.

392 (10%) NUM nodes have two children.

530 (14%) NUM nodes have three or more children.

The highest child degree of a NUM node is 27.

Children of NUM nodes are attached using 19 different relations: conj (1496; 48% instances), nmod (914; 29% instances), case (248; 8% instances), cc (152; 5% instances), appos (78; 3% instances), advmod (67; 2% instances), orphan (35; 1% instances), obl (25; 1% instances), discourse (15; 0% instances), nummod (15; 0% instances), nsubj (13; 0% instances), cop (12; 0% instances), acl (8; 0% instances), amod (7; 0% instances), dislocated (7; 0% instances), det (5; 0% instances), advcl (3; 0% instances), flat (1; 0% instances), mark (1; 0% instances)

Children of NUM nodes belong to 11 different parts of speech: NUM (1576; 51% instances), NOUN (897; 29% instances), ADP (249; 8% instances), CCONJ (154; 5% instances), ADV (83; 3% instances), PROPN (46; 1% instances), ADJ (42; 1% instances), PRON (26; 1% instances), VERB (13; 0% instances), AUX (12; 0% instances), DET (4; 0% instances)