Treebank Statistics: UD_English-GUM: Features: NumForm
This feature is language-specific.
It occurs with 4 different values: Combi
, Digit
, Roman
, Word
.
5064 tokens (2%) have a non-empty value of NumForm
.
832 types (5%) occur at least once with a non-empty value of NumForm
.
792 lemmas (6%) occur at least once with a non-empty value of NumForm
.
The feature is used with 6 part-of-speech tags: NUM (4272; 2% instances), ADJ (498; 0% instances), ADV (163; 0% instances), NOUN (117; 0% instances), DET (12; 0% instances), PROPN (2; 0% instances).
NUM
4272 NUM tokens (100% of all NUM
tokens) have a non-empty value of NumForm
.
The most frequent other feature values with which NUM
and NumForm
co-occurred: NumType=Card (4147; 97%).
NUM
tokens may have the following values of NumForm
:
Digit
(2767; 65% of non-emptyNumForm
): 1, 2, 3, 4, 10, 6, 20, 5, 15, 7Roman
(28; 1% of non-emptyNumForm
): II, I, IV, III, VI, XIV, XV, XVIIWord
(1477; 35% of non-emptyNumForm
): one, two, three, four, five, six, ten, million, twenty, hundred
NumForm
seems to be lexical feature of NUM
. 100% lemmas (706) occur only with one value of NumForm
.
ADJ
498 ADJ tokens (3% of all ADJ
tokens) have a non-empty value of NumForm
.
The most frequent other feature values with which ADJ
and NumForm
co-occurred: Degree=Pos (498; 100%).
ADJ
tokens may have the following values of NumForm
:
Combi
(110; 22% of non-emptyNumForm
): 19th, 20th, 30th, 3rd, 10th, 17th, 21st, 25th, 29th, 2ndWord
(388; 78% of non-emptyNumForm
): first, second, third, fourth, fifth, ninth, seventh, sixth, tenth
NumForm
seems to be lexical feature of ADJ
. 100% lemmas (64) occur only with one value of NumForm
.
ADV
163 ADV tokens (1% of all ADV
tokens) have a non-empty value of NumForm
.
The most frequent other feature values with which ADV
and NumForm
co-occurred: PronType=EMPTY (163; 100%), Degree=Pos (112; 69%).
ADV
tokens may have the following values of NumForm
:
Combi
(3; 2% of non-emptyNumForm
): 135th, 15thWord
(160; 98% of non-emptyNumForm
): first, once, second, twice, half, third, Fifth, Fourth, sixth
NumForm
seems to be lexical feature of ADV
. 100% lemmas (11) occur only with one value of NumForm
.
NOUN
117 NOUN tokens (0% of all NOUN
tokens) have a non-empty value of NumForm
.
The most frequent other feature values with which NOUN
and NumForm
co-occurred: Number=Sing (70; 60%).
NOUN
tokens may have the following values of NumForm
:
Combi
(37; 32% of non-emptyNumForm
): 1960s, 1970s, 1830s, 1950s, 1980s, 1990s, 1920s, 1930s, 1940s, 2000sWord
(80; 68% of non-emptyNumForm
): half, quarter, third, thirds, quarters, fifths, halves, hundredths, millionth, tenth
NumForm
seems to be lexical feature of NOUN
. 100% lemmas (22) occur only with one value of NumForm
.
DET
12 DET tokens (0% of all DET
tokens) have a non-empty value of NumForm
.
The most frequent other feature values with which DET
and NumForm
co-occurred: Definite=EMPTY (12; 100%), PronType=Ind (12; 100%).
DET
tokens may have the following values of NumForm
:
Word
(12; 100% of non-emptyNumForm
): half
PROPN
2 PROPN tokens (0% of all PROPN
tokens) have a non-empty value of NumForm
.
The most frequent other feature values with which PROPN
and NumForm
co-occurred: Number=Sing (2; 100%).
PROPN
tokens may have the following values of NumForm
:
Word
(2; 100% of non-emptyNumForm
): EIGHT, One
Relations with Agreement in NumForm
The 10 most frequent relations where parent and child node agree in NumForm
:
NUM –[conj]–> NUM (211; 100%),
NUM –[nmod:unmarked]–> NUM (139; 100%),
NUM –[nmod]–> NUM (135; 99%),
NUM –[compound]–> NUM (78; 70%),
NUM –[flat]–> NUM (13; 100%),
NUM –[nummod]–> NUM (13; 68%),
NUM –[parataxis]–> NUM (9; 100%),
ADJ –[conj]–> ADJ (6; 67%),
NUM –[dep]–> NUM (4; 80%),
NUM –[det:predet]–> DET (2; 100%).