Treebank Statistics: UD_English-GUM: Features: NumForm
This feature is language-specific.
It occurs with 4 different values: Combi, Digit, Roman, Word.
5502 tokens (2%) have a non-empty value of NumForm.
865 types (5%) occur at least once with a non-empty value of NumForm.
824 lemmas (5%) occur at least once with a non-empty value of NumForm.
The feature is used with 6 part-of-speech tags: NUM (4624; 2% instances), ADJ (535; 0% instances), ADV (200; 0% instances), NOUN (128; 0% instances), DET (13; 0% instances), PROPN (2; 0% instances).
NUM
4624 NUM tokens (100% of all NUM tokens) have a non-empty value of NumForm.
The most frequent other feature values with which NUM and NumForm co-occurred: NumType=Card (4478; 97%).
NUM tokens may have the following values of NumForm:
Digit(2993; 65% of non-emptyNumForm): 1, 2, 3, 4, 10, 20, 6, 5, 15, 7Roman(31; 1% of non-emptyNumForm): II, I, IV, III, VI, XIV, XV, XVIIWord(1600; 35% of non-emptyNumForm): one, two, three, five, four, six, million, ten, twenty, hundred
NumForm seems to be lexical feature of NUM. 100% lemmas (739) occur only with one value of NumForm.
ADJ
535 ADJ tokens (3% of all ADJ tokens) have a non-empty value of NumForm.
The most frequent other feature values with which ADJ and NumForm co-occurred: Degree=Pos (535; 100%).
ADJ tokens may have the following values of NumForm:
Combi(113; 21% of non-emptyNumForm): 19th, 20th, 30th, 3rd, 10th, 17th, 21st, 13th, 15th, 25thWord(422; 79% of non-emptyNumForm): first, second, third, fourth, fifth, ninth, seventh, sixth, tenth
NumForm seems to be lexical feature of ADJ. 100% lemmas (64) occur only with one value of NumForm.
ADV
200 ADV tokens (2% of all ADV tokens) have a non-empty value of NumForm.
The most frequent other feature values with which ADV and NumForm co-occurred: PronType=EMPTY (200; 100%), Degree=Pos (131; 66%).
ADV tokens may have the following values of NumForm:
Combi(3; 2% of non-emptyNumForm): 135th, 15thWord(197; 99% of non-emptyNumForm): first, once, twice, second, third, half, Fifth, Fourth, sixth
NumForm seems to be lexical feature of ADV. 100% lemmas (11) occur only with one value of NumForm.
NOUN
128 NOUN tokens (0% of all NOUN tokens) have a non-empty value of NumForm.
The most frequent other feature values with which NOUN and NumForm co-occurred: Number=Sing (77; 60%).
NOUN tokens may have the following values of NumForm:
Combi(41; 32% of non-emptyNumForm): 1960s, 1970s, 1980s, 1990s, 1830s, 1950s, 1920s, 1930s, 1940s, 2000sWord(87; 68% of non-emptyNumForm): half, quarter, third, thirds, quarters, fifth, fifths, halves, hundredths, millionth
NumForm seems to be lexical feature of NOUN. 100% lemmas (22) occur only with one value of NumForm.
DET
13 DET tokens (0% of all DET tokens) have a non-empty value of NumForm.
The most frequent other feature values with which DET and NumForm co-occurred: Definite=EMPTY (13; 100%), PronType=Ind (13; 100%).
DET tokens may have the following values of NumForm:
Word(13; 100% of non-emptyNumForm): half
PROPN
2 PROPN tokens (0% of all PROPN tokens) have a non-empty value of NumForm.
The most frequent other feature values with which PROPN and NumForm co-occurred: Number=Sing (2; 100%).
PROPN tokens may have the following values of NumForm:
Word(2; 100% of non-emptyNumForm): EIGHT, One
Relations with Agreement in NumForm
The 10 most frequent relations where parent and child node agree in NumForm:
NUM –[conj]–> NUM (252; 99%),
NUM –[nmod:unmarked]–> NUM (147; 100%),
NUM –[nmod]–> NUM (139; 99%),
NUM –[compound]–> NUM (90; 62%),
NUM –[flat]–> NUM (17; 100%),
NUM –[parataxis]–> NUM (13; 93%),
ADJ –[conj]–> ADJ (8; 67%),
NUM –[det:predet]–> DET (3; 100%),
NUM –[reparandum]–> NUM (2; 100%),
ADJ –[appos]–> ADJ (1; 100%).