Treebank Statistics: UD_Old_East_Slavic-RNC: Features: NumForm
This feature is language-specific.
It occurs with 5 different values: Combi, Cyril, Digit, Roman, Word.
5062 tokens (3%) have a non-empty value of NumForm.
1046 types (3%) occur at least once with a non-empty value of NumForm.
594 lemmas (5%) occur at least once with a non-empty value of NumForm.
The feature is used with 2 part-of-speech tags: NUM (3813; 2% instances), ADJ (1249; 1% instances).
NUM
3813 NUM tokens (100% of all NUM tokens) have a non-empty value of NumForm.
The most frequent other feature values with which NUM and NumForm co-occurred: NumType=Card (3601; 94%), Case=Nom (2380; 62%), Gender=EMPTY (2301; 60%).
NUM tokens may have the following values of NumForm:
Combi(34; 1% of non-emptyNumForm): 3-х, 10-ти, 20-ти, 5-ти, 10-и, 11-ти, 12-ти, 13-ти, 14-ти, 15-тиCyril(19; 0% of non-emptyNumForm): Г, Ѕ, І, В, Е, РПЅ, #АΨВ, ΨѲ, ИІ, КЅDigit(2176; 57% of non-emptyNumForm): 3, 2, 4, 5, 10, 6, 7, 8, 9, 12Roman(10; 0% of non-emptyNumForm): I, II, III, IV, IX, V, VI, VII, VIII, XWord(1574; 41% of non-emptyNumForm): два, три, две, один, дву, четыре, много, трехъ, полтора, сто
| Paradigm 3 | Combi | Cyril | Digit |
|---|---|---|---|
| Animacy=Anim|Case=Acc|Gender=Masc | 3-х | ||
| Case=Acc|Gender=Masc | Г | 3 | |
| Case=Acc|Gender=Fem | Г | 3 | |
| Case=Acc|Gender=Neut | 3 | ||
| Case=Acc | 3 | ||
| Case=Dat|Gender=Masc | 3 | ||
| Case=Gen|Gender=Masc | 3-х | 3 | |
| Case=Gen|Gender=Fem | 3-х | ||
| Case=Gen | 3 | ||
| Case=Ins|Gender=Masc | 3-ма | ||
| Case=Loc|Gender=Masc | 3-х | ||
| Case=Loc|Gender=Fem | 3-х | ||
| Case=Nom|Gender=Masc | 3, 3] | ||
| Case=Nom|Gender=Fem | 3 | ||
| Case=Nom|Gender=Neut | 3 | ||
| Case=Nom | 3 |
NumForm seems to be lexical feature of NUM. 95% lemmas (369) occur only with one value of NumForm.
ADJ
1249 ADJ tokens (9% of all ADJ tokens) have a non-empty value of NumForm.
The most frequent other feature values with which ADJ and NumForm co-occurred: Variant=EMPTY (1240; 99%), Number=Sing (1225; 98%), Degree=EMPTY (1184; 95%), Gender=Masc (957; 77%).
ADJ tokens may have the following values of NumForm:
Combi(288; 23% of non-emptyNumForm): 178-г(о), 160-го, 133-го, 177-г(о), 1-го, 153-го, 154-го, 177-м, 3-го, 6-иCyril(7; 1% of non-emptyNumForm): КЗ, ГІ, КВ, КГ, КИ, РЛѲDigit(738; 59% of non-emptyNumForm): 1, 205, 23, 3, 2, 21, 29, 4, 18, 22Roman(3; 0% of non-emptyNumForm): I, II, XVWord(213; 17% of non-emptyNumForm): третеи, трети, перваго, первой, первом, третей, первое, первои, четвертои, вторая
| Paradigm 23 | Cyril | Digit |
|---|---|---|
| Case=Acc | КГ | 23 |
| Case=Loc | 23 |
NumForm seems to be lexical feature of ADJ. 98% lemmas (255) occur only with one value of NumForm.
Relations with Agreement in NumForm
The 10 most frequent relations where parent and child node agree in NumForm:
NUM –[conj]–> NUM (171; 98%),
NUM –[compound]–> NUM (59; 100%),
NUM –[nsubj]–> NUM (25; 93%),
ADJ –[conj]–> ADJ (16; 84%),
NUM –[nummod:gov]–> NUM (12; 92%),
NUM –[nmod]–> NUM (9; 90%),
NUM –[flat]–> NUM (5; 83%),
NUM –[nummod]–> NUM (4; 80%),
NUM –[conj]–> ADJ (2; 100%),
NUM –[orphan]–> NUM (2; 100%).