Treebank Statistics: UD_English-GUM: Features: NumForm
This feature is language-specific.
It occurs with 4 different values: Combi
, Digit
, Roman
, Word
.
4716 tokens (2%) have a non-empty value of NumForm
.
811 types (5%) occur at least once with a non-empty value of NumForm
.
772 lemmas (6%) occur at least once with a non-empty value of NumForm
.
The feature is used with 6 part-of-speech tags: NUM (3993; 2% instances), ADJ (450; 0% instances), ADV (149; 0% instances), NOUN (111; 0% instances), DET (11; 0% instances), PROPN (2; 0% instances).
NUM
3993 NUM tokens (100% of all NUM
tokens) have a non-empty value of NumForm
.
The most frequent other feature values with which NUM
and NumForm
co-occurred: NumType=Card (3871; 97%).
NUM
tokens may have the following values of NumForm
:
Digit
(2635; 66% of non-emptyNumForm
): 1, 2, 3, 4, 10, 6, 5, 20, 15, 7Roman
(27; 1% of non-emptyNumForm
): II, I, IV, III, VI, XIV, XV, XVIIWord
(1331; 33% of non-emptyNumForm
): one, two, three, four, five, six, ten, million, twenty, seven
NumForm
seems to be lexical feature of NUM
. 100% lemmas (686) occur only with one value of NumForm
.
ADJ
450 ADJ tokens (3% of all ADJ
tokens) have a non-empty value of NumForm
.
The most frequent other feature values with which ADJ
and NumForm
co-occurred: Degree=Pos (450; 100%).
ADJ
tokens may have the following values of NumForm
:
Combi
(101; 22% of non-emptyNumForm
): 19th, 20th, 30th, 3rd, 10th, 21st, 17th, 2nd, 33rd, 50thWord
(349; 78% of non-emptyNumForm
): first, second, third, fourth, fifth, seventh, ninth, sixth, tenth
NumForm
seems to be lexical feature of ADJ
. 100% lemmas (64) occur only with one value of NumForm
.
ADV
149 ADV tokens (1% of all ADV
tokens) have a non-empty value of NumForm
.
The most frequent other feature values with which ADV
and NumForm
co-occurred: PronType=EMPTY (149; 100%), Degree=Pos (101; 68%).
ADV
tokens may have the following values of NumForm
:
Combi
(3; 2% of non-emptyNumForm
): 135th, 15thWord
(146; 98% of non-emptyNumForm
): first, once, second, twice, half, third, Fifth, Fourth, sixth
NumForm
seems to be lexical feature of ADV
. 100% lemmas (11) occur only with one value of NumForm
.
NOUN
111 NOUN tokens (0% of all NOUN
tokens) have a non-empty value of NumForm
.
The most frequent other feature values with which NOUN
and NumForm
co-occurred: Number=Sing (65; 59%).
NOUN
tokens may have the following values of NumForm
:
Combi
(36; 32% of non-emptyNumForm
): 1960s, 1970s, 1950s, 1980s, 1990s, 1830s, 1920s, 1930s, 1940s, 2000sWord
(75; 68% of non-emptyNumForm
): half, quarter, third, thirds, quarters, fifths, halves, hundredths, millionth, tenth
NumForm
seems to be lexical feature of NOUN
. 100% lemmas (22) occur only with one value of NumForm
.
DET
11 DET tokens (0% of all DET
tokens) have a non-empty value of NumForm
.
The most frequent other feature values with which DET
and NumForm
co-occurred: Definite=EMPTY (11; 100%), PronType=Ind (11; 100%).
DET
tokens may have the following values of NumForm
:
Word
(11; 100% of non-emptyNumForm
): half
PROPN
2 PROPN tokens (0% of all PROPN
tokens) have a non-empty value of NumForm
.
The most frequent other feature values with which PROPN
and NumForm
co-occurred: Number=Sing (2; 100%).
PROPN
tokens may have the following values of NumForm
:
Word
(2; 100% of non-emptyNumForm
): EIGHT, One
Relations with Agreement in NumForm
The 10 most frequent relations where parent and child node agree in NumForm
:
NUM –[conj]–> NUM (199; 100%),
NUM –[nmod:unmarked]–> NUM (131; 100%),
NUM –[nmod]–> NUM (128; 100%),
NUM –[compound]–> NUM (71; 70%),
NUM –[flat]–> NUM (13; 100%),
NUM –[nummod]–> NUM (13; 72%),
NUM –[dep]–> NUM (7; 88%),
NUM –[parataxis]–> NUM (7; 100%),
NUM –[det:predet]–> DET (2; 100%),
NUM –[reparandum]–> NUM (2; 100%).