Treebank Statistics: UD_Russian-Taiga: Features: NumForm
This feature is language-specific.
It occurs with 5 different values: Combi, Cyril, Digit, Roman, Word.
22559 tokens (1%) have a non-empty value of NumForm.
1740 types (1%) occur at least once with a non-empty value of NumForm.
1305 lemmas (2%) occur at least once with a non-empty value of NumForm.
The feature is used with 2 part-of-speech tags: NUM (12848; 1% instances), ADJ (9711; 1% instances).
NUM
12848 NUM tokens (100% of all NUM tokens) have a non-empty value of NumForm.
The most frequent other feature values with which NUM and NumForm co-occurred: NumType=Card (12068; 94%), Number=EMPTY (10947; 85%), Gender=EMPTY (8704; 68%).
NUM tokens may have the following values of NumForm:
Combi(23; 0% of non-emptyNumForm): 2-х, 12-ти, 3-х, 3х, 4-х, 11-ти, 13-ти, 14-ти, 17-ти, 18-тиCyril(4; 0% of non-emptyNumForm): a҃, в҃, г҃, д҃Digit(3547; 28% of non-emptyNumForm): 2, 1, 3, 5, 4, 10, 6, 20, 7, 30Roman(3; 0% of non-emptyNumForm): I, VWord(9271; 72% of non-emptyNumForm): два, много, несколько, три, один, двух, две, одной, сколько, одного
| Paradigm 2 | Combi | Digit |
|---|---|---|
| Case=Gen|Gender=Masc|NumType=Card | 2-х | |
| Case=Gen|NumType=Card | 2-х, 2х | |
| NumType=Card | 2 | |
| NumType=Sets | 2 |
NumForm seems to be lexical feature of NUM. 98% lemmas (531) occur only with one value of NumForm.
ADJ
9711 ADJ tokens (6% of all ADJ tokens) have a non-empty value of NumForm.
The most frequent other feature values with which ADJ and NumForm co-occurred: Degree=EMPTY (9711; 100%), Gender=EMPTY (6709; 69%), Case=EMPTY (5839; 60%), Number=EMPTY (5839; 60%).
ADJ tokens may have the following values of NumForm:
Combi(567; 6% of non-emptyNumForm): 20-х, 60-х, 30-х, 30-е, 50-х, 1-го, 40-х, 2-й, 90-х, 20-еDigit(3099; 32% of non-emptyNumForm): 1905, 1918, 2, 1917, 1812, 1907, 1, 1880, 1920, 3Roman(2738; 28% of non-emptyNumForm): XIX, XVIII, XX, XVII, XVI, XV, I, XIV, II, XIIWord(3307; 34% of non-emptyNumForm): первый, второй, первой, первые, первая, первого, первых, первую, первое, первым
| Paradigm I | Roman | Word |
|---|---|---|
| Case=Nom|Gender=Masc|Number=Sing | I | |
| I | I |
NumForm seems to be lexical feature of ADJ. 99% lemmas (838) occur only with one value of NumForm.
Relations with Agreement in NumForm
The 10 most frequent relations where parent and child node agree in NumForm:
ADJ –[nmod]–> ADJ (805; 92%),
NUM –[nmod]–> NUM (405; 100%),
NUM –[conj]–> NUM (302; 100%),
ADJ –[conj]–> ADJ (162; 72%),
NUM –[compound]–> NUM (133; 100%),
ADJ –[compound]–> NUM (36; 62%),
NUM –[list]–> NUM (21; 84%),
NUM –[parataxis]–> NUM (16; 84%),
NUM –[nummod:gov]–> NUM (13; 100%),
ADJ –[nmod]–> NUM (6; 86%).