home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_Church_Slavonic-PROIEL: POS Tags: NUM

There are 51 NUM lemmas (1%), 425 NUM types (1%) and 1815 NUM tokens (1%). Out of 14 observed tags, the rank of NUM is: 9 in number of lemmas, 8 in number of types and 12 in number of tokens.

The 10 most frequent NUM lemmas: ѥдинъ, дъва, десѧть, триѥ, пѧть, оба, съто, ѥдьнъ, седмь, четꙑрe

The 10 most frequent NUM types: единъ, дъва, десѧте, единого, три, ѥ҅динъ, р, пѧть, оба, десꙙте

The 10 most frequent ambiguous lemmas: ѥдинъ (NUM 509, PRON 76, DET 51), десѧть (NUM 185, ADJ 1, NOUN 1), съто (NUM 76, NOUN 3, ADP 1, INTJ 1), ѥдьнъ (NUM 54, DET 1), четꙑрe (NUM 53, ADJ 1), тꙑсѧщи (NUM 32, NOUN 1), осмь (NUM 22, ADJ 3), дъва.на.десѧте (NUM 16, ADJ 1), девѧтъ (ADJ 12, NUM 1), осмъ (ADJ 5, NUM 1)

The 10 most frequent ambiguous types: единъ (NUM 106, DET 20, PRON 14), единого (NUM 55, DET 2, PRON 1), ѥ҅динъ (NUM 47, PRON 22, DET 2), р (NUM 24, INTJ 1), ѥ҅дного (NUM 23, PRON 4, DET 1), ѥ͑динъ (NUM 21, PRON 2), единѫ (NUM 18, DET 5), единомоу (NUM 15, DET 3, PRON 1), ѥ҅дномоу (NUM 14, PRON 1), а (CCONJ 207, NUM 11, ADP 2, NOUN 1, PRON 1, PROPN 1, SCONJ 1)

Morphology

The form / lemma ratio of NUM is 8.333333 (the average of all parts of speech is 5.263244).

The 1st highest number of forms (101) was observed with the lemma “ѥдинъ”: ͑Единъ͗, Е҅динꙑ, а, а҃, а҃҃҃, едина, единааго, едини, едино, единого, единои, единомоу, единомъ, единомь, единоуемоу, единоѩ, единъ, единь, единѣмъ, единѣмь, единѫ, единѫѭ, единѫѭ҄, единꙑ, едіного, едіномоу, едінъ, е҅дꙿнои҆, е҅дꙿнѫ, ѢДИНА, ѥ͑дʼна, ѥ͑дʼно, ѥ͑дʼного, ѥ͑дʼномѹ, ѥ͑дʼноѭ҄, ѥ͑динʼ, ѥ͑дина, ѥ͑дини, ѥ͑дино, ѥ͑диномъ, ѥ͑диноѭ҄, ѥ͑динъ, ѥ͑динъи, ѥ͑динѣмъ, ѥ͑динѣмь͗, ѥ͑динѣхъ, ѥ͑динѫ, ѥ͑дно, ѥ͑дного, ѥ͑дномоу, ѥ͑дномъ, ѥ͑дномь͗, ѥ͑дноѭ҄, ѥ͑днѣмъ, ѥ͗дʼного, ѥ͗дʼнои͗, ѥдʼного, ѥдʼномоу, ѥдино, ѥдинъ, ѥднои, ѥдномоу, ѥдноѧ, ѥ҅дин, ѥ҅дина, ѥ҅дини, ѥ҅дино, ѥ҅динои҆, ѥ҅диномоу, ѥ҅диноуо̑умꙋ, ѥ҅диноѧ, ѥ҅динъ, ѥ҅динь҆, ѥ҅динѣмъ, ѥ҅динѣмь҆, ѥ҅динѣмꙿ, ѥ҅динѣхъ, ѥ҅динѫ, ѥ҅динꙑ, ѥ҅динꙑи, ѥ҅динꙑи҆, ѥ҅ди҆нъ, ѥ҅дно, ѥ҅дного, ѥ҅днои, ѥ҅днои҆, ѥ҅дномоу, ѥ҅дномь҆, ѥ҅дноѭ̑, ѥ҅дноѭ҄, ѥ҅днѣмъ, ѥ҅днѣмъ҆, ѥ҅днѣмь, ѥ҅днѣмь҆, ѥ҅днѫ, ѥ҅дінъ, ѥ҅дꙿномоу, ѥ҅дꙿномь, ѥ҅дꙿнѣми, ѥ҅дꙿнѫ, ҅Ѥдинъ.

The 2nd highest number of forms (33) was observed with the lemma “десѧть”: десеⷮ҇, десѧте, десѧтемъ, десѧти, десѧтии, десѧтиѭ, десѧтъ, десѧтъма, десѧть, десѧтьѭ, десѧтѣ, десѧтꙑ, десѩти, десⱕте, десⱕтемъ, десⱕти, десⱕтъ, десⱕты, десꙙте, десꙙтемь҆, десꙙтехъ, десꙙтехꙿ, десꙙти, десꙙтъ, десꙙть, десꙙть҆, десꙙть҆ма, десꙙть҆мь, десꙙтꙑѧ, ї, ї͆, ї꙯, ꙇ҃.

The 3rd highest number of forms (29) was observed with the lemma “дъва”: Б, б҃, в, в҃, в҃҃, в꙯, дʼва, дʼвѣма, два, двою̑, двою҄, двѣ, двѣма, дъва, дъвою, дъвою̑, дъвою҄, дъвоѭ, дъвѣ, дъвѣма, дь͗вѣма, дьва, дьвѣ, дьвѣма, дь҆ва, дь҆вѣ, дь҆вѣма, дꙿва, дꙿвѣма.

NUM occurs with 3 features: Case (1451; 80% instances), Number (1451; 80% instances), Gender (1367; 75% instances)

NUM occurs with 16 feature-value pairs: Case=Acc, Case=Dat, Case=Dat,Gen, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Fem,Masc, Gender=Fem,Neut, Gender=Masc, Gender=Masc,Neut, Gender=Neut, Number=Dual, Number=Plur, Number=Sing

NUM occurs with 81 feature combinations. The most frequent feature combination is _ (364 tokens). Examples: р, а, к҃, м͆, к꙯, м҃, о҃, р҃, в, г

Relations

NUM nodes are attached to their parents using 22 different relations: nummod (658; 36% instances), conj (221; 12% instances), root (182; 10% instances), nsubj (159; 9% instances), obj (141; 8% instances), obl (121; 7% instances), nmod (72; 4% instances), xcomp (71; 4% instances), appos (59; 3% instances), obl:arg (30; 2% instances), dislocated (24; 1% instances), orphan (22; 1% instances), advcl (17; 1% instances), parataxis (11; 1% instances), nsubj:pass (9; 0% instances), obl:agent (7; 0% instances), ccomp (3; 0% instances), advcl:cmp (2; 0% instances), dep (2; 0% instances), fixed (2; 0% instances), acl (1; 0% instances), csubj (1; 0% instances)

Parents of NUM nodes belong to 13 different parts of speech: NOUN (538; 30% instances), VERB (508; 28% instances), NUM (410; 23% instances), (182; 10% instances), PRON (64; 4% instances), ADJ (48; 3% instances), ADV (25; 1% instances), AUX (22; 1% instances), PROPN (12; 1% instances), ADP (3; 0% instances), DET (1; 0% instances), INTJ (1; 0% instances), SCONJ (1; 0% instances)

902 (50%) NUM nodes are leaves.

567 (31%) NUM nodes have one child.

254 (14%) NUM nodes have two children.

92 (5%) NUM nodes have three or more children.

The highest child degree of a NUM node is 8.

Children of NUM nodes are attached using 23 different relations: nmod (416; 30% instances), conj (233; 17% instances), case (219; 16% instances), nummod (144; 10% instances), advmod (77; 5% instances), cc (68; 5% instances), cop (43; 3% instances), nsubj (33; 2% instances), acl (29; 2% instances), orphan (29; 2% instances), discourse (28; 2% instances), obl (19; 1% instances), appos (17; 1% instances), advcl (13; 1% instances), amod (12; 1% instances), mark (10; 1% instances), det (7; 0% instances), dislocated (3; 0% instances), ccomp (2; 0% instances), fixed (2; 0% instances), advcl:cmp (1; 0% instances), obl:arg (1; 0% instances), parataxis (1; 0% instances)

Children of NUM nodes belong to 12 different parts of speech: NUM (410; 29% instances), NOUN (345; 25% instances), ADP (220; 16% instances), ADV (121; 9% instances), PRON (74; 5% instances), CCONJ (69; 5% instances), ADJ (46; 3% instances), AUX (44; 3% instances), VERB (44; 3% instances), PROPN (18; 1% instances), SCONJ (9; 1% instances), DET (7; 0% instances)