Treebank Statistics: UD_English-GUM: Features: NumForm
This feature is language-specific.
It occurs with 4 different values: Combi
, Digit
, Roman
, Word
.
4307 tokens (2%) have a non-empty value of NumForm
.
758 types (5%) occur at least once with a non-empty value of NumForm
.
725 lemmas (6%) occur at least once with a non-empty value of NumForm
.
The feature is used with 6 part-of-speech tags: NUM (3685; 2% instances), ADJ (419; 0% instances), ADV (121; 0% instances), NOUN (70; 0% instances), DET (10; 0% instances), PROPN (2; 0% instances).
NUM
3685 NUM tokens (100% of all NUM
tokens) have a non-empty value of NumForm
.
The most frequent other feature values with which NUM
and NumForm
co-occurred: NumType=Card (3573; 97%).
NUM
tokens may have the following values of NumForm
:
Digit
(2491; 68% of non-emptyNumForm
): 1, 2, 3, 10, 4, 6, 5, 15, 7, 20Roman
(19; 1% of non-emptyNumForm
): II, I, III, VI, XV, XVIIWord
(1175; 32% of non-emptyNumForm
): one, two, three, four, five, six, million, ten, eight, seven
NumForm
seems to be lexical feature of NUM
. 100% lemmas (654) occur only with one value of NumForm
.
ADJ
419 ADJ tokens (3% of all ADJ
tokens) have a non-empty value of NumForm
.
The most frequent other feature values with which ADJ
and NumForm
co-occurred: Degree=Pos (419; 100%).
ADJ
tokens may have the following values of NumForm
:
Combi
(94; 22% of non-emptyNumForm
): 19th, 20th, 10th, 30th, 17th, 21st, 2nd, 33rd, 3rd, 50thWord
(325; 78% of non-emptyNumForm
): first, second, third, fourth, fifth, seventh, ninth, sixth, tenth
NumForm
seems to be lexical feature of ADJ
. 100% lemmas (64) occur only with one value of NumForm
.
ADV
121 ADV tokens (1% of all ADV
tokens) have a non-empty value of NumForm
.
The most frequent other feature values with which ADV
and NumForm
co-occurred: PronType=EMPTY (121; 100%), Degree=Pos (79; 65%).
ADV
tokens may have the following values of NumForm
:
Combi
(3; 2% of non-emptyNumForm
): 135th, 15thWord
(118; 98% of non-emptyNumForm
): first, once, second, twice, half, third, sixth
NOUN
70 NOUN tokens (0% of all NOUN
tokens) have a non-empty value of NumForm
.
The most frequent other feature values with which NOUN
and NumForm
co-occurred: Number=Sing (60; 86%).
NOUN
tokens may have the following values of NumForm
:
Word
(70; 100% of non-emptyNumForm
): half, quarter, third, thirds, quarters, fifths, halves, hundredths, millionth, tenth
DET
10 DET tokens (0% of all DET
tokens) have a non-empty value of NumForm
.
The most frequent other feature values with which DET
and NumForm
co-occurred: Definite=EMPTY (10; 100%), PronType=Ind (10; 100%).
DET
tokens may have the following values of NumForm
:
Word
(10; 100% of non-emptyNumForm
): half
PROPN
2 PROPN tokens (0% of all PROPN
tokens) have a non-empty value of NumForm
.
The most frequent other feature values with which PROPN
and NumForm
co-occurred: Number=Sing (2; 100%).
PROPN
tokens may have the following values of NumForm
:
Word
(2; 100% of non-emptyNumForm
): EIGHT, One
Relations with Agreement in NumForm
The 10 most frequent relations where parent and child node agree in NumForm
:
NUM –[conj]–> NUM (181; 100%),
NUM –[nmod:tmod]–> NUM (126; 100%),
NUM –[nmod]–> NUM (123; 100%),
NUM –[compound]–> NUM (65; 69%),
NUM –[flat]–> NUM (11; 100%),
NUM –[nummod]–> NUM (9; 64%),
NUM –[dep]–> NUM (7; 88%),
NUM –[parataxis]–> NUM (7; 100%),
NUM –[reparandum]–> NUM (2; 100%),
ADJ –[appos]–> ADJ (1; 100%).