This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home cu/pos issue tracker

NUM: numeral

This document is a placeholder for the language-specific documentation for NUM.


Treebank Statistics (UD_Old_Church_Slavonic)

There are 20 NUM lemmas (1%), 105 NUM types (1%) and 673 NUM tokens (1%). Out of 13 observed tags, the rank of NUM is: 8 in number of lemmas, 7 in number of types and 10 in number of tokens.

The 10 most frequent NUM lemmas: ѥдинъ, дъва, десѧть, оба, пѧть, триѥ, седмь, съто, тꙑсѧщи, четꙑрe

The 10 most frequent NUM types: единъ, десѧте, дъва, единого, три, пѧть, оба, седмь, десѧтъ, едино

The 10 most frequent ambiguous lemmas: ѥдинъ (NUM 224, PRON 93), десѧть (NUM 97, NOUN 1)

The 10 most frequent ambiguous types: единъ (NUM 89, PRON 32), единого (NUM 47, PRON 3), единѫ (NUM 16, PRON 5), единомоу (NUM 8, PRON 4), едина (NUM 7, PRON 7), единꙑ (NUM 5, PRON 3), единомъ (NUM 4, PRON 1), единомь (NUM 4, PRON 3), единѣмъ (NUM 4, PRON 1), едини (PRON 27, NUM 3)

Morphology

The form / lemma ratio of NUM is 5.250000 (the average of all parts of speech is 3.336884).

The 1st highest number of forms (20) was observed with the lemma “ѥдинъ”: Е҅динꙑ, едина, единааго, едини, едино, единого, единои, единомоу, единомъ, единомь, единоуемоу, единоѩ, единъ, единь, единѣмъ, единѣмь, единѫ, единѫѭ, единѫѭ҄, единꙑ.

The 2nd highest number of forms (13) was observed with the lemma “десѧть”: десеⷮ҇, десѧте, десѧтемъ, десѧти, десѧтии, десѧтиѭ, десѧтъ, десѧтъма, десѧть, десѧтьѭ, десѧтѣ, десѧтꙑ, ꙇ҃.

The 3rd highest number of forms (10) was observed with the lemma “дъва”: б҃, два, дъва, дъвою, дъвоѭ, дъвѣ, дъвѣма, дьва, дьвѣ, дьвѣма.

NUM occurs with 3 features: Case (672; 100% instances), Number (672; 100% instances), Gender (596; 89% instances)

NUM occurs with 16 feature-value pairs: Case=Acc, Case=Dat, Case=Dat,Gen, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Fem,Masc, Gender=Fem,Neut, Gender=Masc, Gender=Masc,Neut, Gender=Neut, Number=Dual, Number=Plur, Number=Sing

NUM occurs with 65 feature combinations. The most frequent feature combination is Case=Nom|Gender=Masc|Number=Sing (75 tokens). Examples: единъ, единь, Е҅динꙑ

Relations

NUM nodes are attached to their parents using 13 different relations: nummod (262; 39% instances), dobj (84; 12% instances), nsubj (77; 11% instances), nmod (73; 11% instances), appos (29; 4% instances), xcomp (29; 4% instances), remnant (28; 4% instances), advmod (26; 4% instances), iobj (26; 4% instances), conj (25; 4% instances), root (10; 1% instances), dep (3; 0% instances), advcl (1; 0% instances)

Parents of NUM nodes belong to 10 different parts of speech: VERB (275; 41% instances), NOUN (204; 30% instances), NUM (138; 21% instances), ADJ (13; 2% instances), PRON (12; 2% instances), ROOT (10; 1% instances), PROPN (9; 1% instances), ADV (8; 1% instances), ADP (3; 0% instances), SCONJ (1; 0% instances)

309 (46%) NUM nodes are leaves.

250 (37%) NUM nodes have one child.

74 (11%) NUM nodes have two children.

40 (6%) NUM nodes have three or more children.

The highest child degree of a NUM node is 6.

Children of NUM nodes are attached using 19 different relations: nmod (199; 37% instances), case (111; 21% instances), nummod (74; 14% instances), cc (34; 6% instances), conj (32; 6% instances), advmod (19; 4% instances), neg (13; 2% instances), remnant (13; 2% instances), acl (9; 2% instances), amod (8; 2% instances), appos (7; 1% instances), discourse (5; 1% instances), aux (2; 0% instances), ccomp (2; 0% instances), det (1; 0% instances), iobj (1; 0% instances), mark (1; 0% instances), nsubj (1; 0% instances), parataxis (1; 0% instances)

Children of NUM nodes belong to 11 different parts of speech: NUM (138; 26% instances), NOUN (137; 26% instances), ADP (112; 21% instances), ADV (43; 8% instances), CONJ (34; 6% instances), PRON (26; 5% instances), VERB (19; 4% instances), ADJ (16; 3% instances), PROPN (4; 1% instances), SCONJ (3; 1% instances), DET (1; 0% instances)


NUM in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]