Treebank Statistics: UD_Estonian-EDT: Features: NumForm
This feature is language-specific.
It occurs with 3 different values: Digit, Roman, Word.
11429 tokens (3%) have a non-empty value of NumForm.
2115 types (3%) occur at least once with a non-empty value of NumForm.
1780 lemmas (4%) occur at least once with a non-empty value of NumForm.
The feature is used with 4 part-of-speech tags: NUM (8912; 2% instances), ADJ (2487; 1% instances), PROPN (24; 0% instances), SYM (6; 0% instances).
NUM
8912 NUM tokens (99% of all NUM tokens) have a non-empty value of NumForm.
The most frequent other feature values with which NUM and NumForm co-occurred: NumType=Card (8250; 93%), Case=EMPTY (5516; 62%), Number=EMPTY (5516; 62%).
NUM tokens may have the following values of NumForm:
Digit(5548; 62% of non-emptyNumForm): 1, 2, 10, 3, 4, 5, 15, 20, 6, 12Roman(3; 0% of non-emptyNumForm): I, IX, XIIWord(3361; 38% of non-emptyNumForm): kaks, üks, kolm, kahe, ühe, miljonit, viis, miljoni, neli, kolme
NumForm seems to be lexical feature of NUM. 100% lemmas (1435) occur only with one value of NumForm.
ADJ
2487 ADJ tokens (7% of all ADJ tokens) have a non-empty value of NumForm.
The most frequent other feature values with which ADJ and NumForm co-occurred: Tense=EMPTY (2487; 100%), VerbForm=EMPTY (2487; 100%), Voice=EMPTY (2487; 100%), Degree=EMPTY (2477; 100%), Case=EMPTY (1531; 62%), Number=EMPTY (1531; 62%).
ADJ tokens may have the following values of NumForm:
Digit(1457; 59% of non-emptyNumForm): 1., 2000., 2., 1997., 1999., 3., 1996., 1998., 1992., 1995.Roman(114; 5% of non-emptyNumForm): II, I, III, XI, VII, XX, VI, XII, IV, MDCXXXIIWord(916; 37% of non-emptyNumForm): esimene, esimest, esimese, teine, teise, esimesel, esimesed, esimeses, teisel, kolmas
NumForm seems to be lexical feature of ADJ. 100% lemmas (358) occur only with one value of NumForm.
PROPN
24 PROPN tokens (0% of all PROPN tokens) have a non-empty value of NumForm.
The most frequent other feature values with which PROPN and NumForm co-occurred: Number=Sing (21; 88%).
PROPN tokens may have the following values of NumForm:
Digit(1; 4% of non-emptyNumForm): 8Roman(4; 17% of non-emptyNumForm): ADV, CX, M, XMWord(19; 79% of non-emptyNumForm): Teist, Teise, Kolmanda, Esimene, Esimese, Kolme, Neljanda, Neljandal, Teisel
NumForm seems to be lexical feature of PROPN. 100% lemmas (10) occur only with one value of NumForm.
SYM
6 SYM tokens (1% of all SYM tokens) have a non-empty value of NumForm.
The most frequent other feature values with which SYM and NumForm co-occurred: Abbr=EMPTY (6; 100%).
SYM tokens may have the following values of NumForm:
Digit(6; 100% of non-emptyNumForm): %
Relations with Agreement in NumForm
The 10 most frequent relations where parent and child node agree in NumForm:
NUM –[conj]–> NUM (374; 98%),
NUM –[flat]–> NUM (96; 94%),
ADJ –[conj]–> ADJ (58; 83%),
NUM –[nummod]–> NUM (41; 85%),
NUM –[parataxis]–> NUM (7; 100%),
NUM –[orphan]–> NUM (6; 100%),
NUM –[obl]–> NUM (3; 100%),
ADJ –[compound]–> NUM (2; 100%),
ADJ –[flat]–> NUM (1; 100%),
ADJ –[orphan]–> ADJ (1; 100%).