Treebank Statistics: UD_Estonian-EDT: Features: NumForm
This feature is language-specific.
It occurs with 3 different values: Digit
, Roman
, Word
.
11436 tokens (3%) have a non-empty value of NumForm
.
2116 types (3%) occur at least once with a non-empty value of NumForm
.
1779 lemmas (4%) occur at least once with a non-empty value of NumForm
.
The feature is used with 4 part-of-speech tags: NUM (8933; 2% instances), ADJ (2472; 1% instances), PROPN (25; 0% instances), SYM (6; 0% instances).
NUM
8933 NUM tokens (99% of all NUM
tokens) have a non-empty value of NumForm
.
The most frequent other feature values with which NUM
and NumForm
co-occurred: NumType=Card (8290; 93%), Case=EMPTY (5527; 62%), Number=EMPTY (5527; 62%).
NUM
tokens may have the following values of NumForm
:
Digit
(5577; 62% of non-emptyNumForm
): 1, 2, 10, 3, 4, 5, 15, 20, 6, 12Roman
(3; 0% of non-emptyNumForm
): III, IX, VIIWord
(3353; 38% of non-emptyNumForm
): kaks, üks, kolm, kahe, ühe, miljonit, viis, miljoni, neli, kolme
NumForm
seems to be lexical feature of NUM
. 100% lemmas (1434) occur only with one value of NumForm
.
ADJ
2472 ADJ tokens (7% of all ADJ
tokens) have a non-empty value of NumForm
.
The most frequent other feature values with which ADJ
and NumForm
co-occurred: Tense=EMPTY (2472; 100%), VerbForm=EMPTY (2472; 100%), Voice=EMPTY (2472; 100%), Degree=EMPTY (2463; 100%), Case=EMPTY (1451; 59%), Number=EMPTY (1451; 59%).
ADJ
tokens may have the following values of NumForm
:
Digit
(1457; 59% of non-emptyNumForm
): 1., 2000., 2., 1997., 1999., 3., 1996., 1998., 1992., 1995.Roman
(109; 4% of non-emptyNumForm
): II, I, III, XI, VII, XX, VI, XII, IV, MDCXXXIIWord
(906; 37% of non-emptyNumForm
): esimene, esimest, esimese, teine, teise, esimesel, esimesed, esimeses, teisel, kolmas
NumForm
seems to be lexical feature of ADJ
. 100% lemmas (359) occur only with one value of NumForm
.
PROPN
25 PROPN tokens (0% of all PROPN
tokens) have a non-empty value of NumForm
.
The most frequent other feature values with which PROPN
and NumForm
co-occurred: Number=Sing (22; 88%).
PROPN
tokens may have the following values of NumForm
:
Digit
(1; 4% of non-emptyNumForm
): 8Roman
(5; 20% of non-emptyNumForm
): M, ADV, CX, XMWord
(19; 76% of non-emptyNumForm
): Teist, Teise, Kolmanda, Esimene, Esimese, Kolme, Neljanda, Neljandal, Teisel
NumForm
seems to be lexical feature of PROPN
. 100% lemmas (10) occur only with one value of NumForm
.
SYM
6 SYM tokens (1% of all SYM
tokens) have a non-empty value of NumForm
.
The most frequent other feature values with which SYM
and NumForm
co-occurred: Abbr=EMPTY (6; 100%).
SYM
tokens may have the following values of NumForm
:
Digit
(6; 100% of non-emptyNumForm
): %
Relations with Agreement in NumForm
The 10 most frequent relations where parent and child node agree in NumForm
:
NUM –[conj]–> NUM (374; 98%),
NUM –[flat]–> NUM (96; 94%),
ADJ –[conj]–> ADJ (58; 83%),
NUM –[nummod]–> NUM (40; 85%),
NUM –[orphan]–> NUM (6; 100%),
NUM –[parataxis]–> NUM (6; 100%),
NUM –[obl]–> NUM (4; 100%),
ADJ –[compound]–> NUM (2; 100%),
ADJ –[flat]–> ADJ (1; 100%),
ADJ –[flat]–> NUM (1; 100%).