Treebank Statistics: UD_English-EWT: Features: NumForm
This feature is language-specific.
It occurs with 4 different values: Combi, Digit, Roman, Word.
5608 tokens (2%) have a non-empty value of NumForm.
1347 types (7%) occur at least once with a non-empty value of NumForm.
1299 lemmas (8%) occur at least once with a non-empty value of NumForm.
The feature is used with 5 part-of-speech tags: NUM (5036; 2% instances), ADJ (257; 0% instances), ADV (155; 0% instances), NOUN (151; 0% instances), DET (9; 0% instances).
NUM
5036 NUM tokens (100% of all NUM tokens) have a non-empty value of NumForm.
The most frequent other feature values with which NUM and NumForm co-occurred: NumType=Card (4885; 97%).
NUM tokens may have the following values of NumForm:
Digit(3927; 78% of non-emptyNumForm): 2, 1, 3, 5, 4, 10, 20, 6, 2005, 2003Roman(52; 1% of non-emptyNumForm): ii, VI, iii, i, v, XIII, iv, VII, VIIIWord(1057; 21% of non-emptyNumForm): one, two, three, four, m, million, five, six, k, billion
NumForm seems to be lexical feature of NUM. 100% lemmas (1231) occur only with one value of NumForm.
ADJ
257 ADJ tokens (2% of all ADJ tokens) have a non-empty value of NumForm.
The most frequent other feature values with which ADJ and NumForm co-occurred: Degree=Pos (257; 100%).
ADJ tokens may have the following values of NumForm:
Combi(41; 16% of non-emptyNumForm): 17th, 5th, 19th, 21st, 2nd, 7th, 10th, 14th, 1st, 20thWord(216; 84% of non-emptyNumForm): first, second, third, fourth, half, sixth, fifth
NumForm seems to be lexical feature of ADJ. 100% lemmas (31) occur only with one value of NumForm.
ADV
155 ADV tokens (1% of all ADV tokens) have a non-empty value of NumForm.
The most frequent other feature values with which ADV and NumForm co-occurred: PronType=EMPTY (155; 100%).
ADV tokens may have the following values of NumForm:
Word(155; 100% of non-emptyNumForm): first, once, twice, second, Third, fifth, half
NOUN
151 NOUN tokens (0% of all NOUN tokens) have a non-empty value of NumForm.
The most frequent other feature values with which NOUN and NumForm co-occurred: Number=Sing (103; 68%).
NOUN tokens may have the following values of NumForm:
Combi(117; 77% of non-emptyNumForm): 1970s, 23rd, 26th, 30th, 80’s, 15th, 1980s, 20th, 22nd, 13thDigit(1; 1% of non-emptyNumForm): 22sWord(33; 22% of non-emptyNumForm): half, first, third, Sixties, eighties, fifteenth, fifth, fourth, mid-nineties, sixth
NumForm seems to be lexical feature of NOUN. 100% lemmas (58) occur only with one value of NumForm.
DET
9 DET tokens (0% of all DET tokens) have a non-empty value of NumForm.
The most frequent other feature values with which DET and NumForm co-occurred: Definite=EMPTY (9; 100%), PronType=Ind (9; 100%).
DET tokens may have the following values of NumForm:
Word(9; 100% of non-emptyNumForm): half
Relations with Agreement in NumForm
The 10 most frequent relations where parent and child node agree in NumForm:
NUM –[list]–> NUM (210; 100%),
NUM –[nmod]–> NUM (149; 99%),
NUM –[nmod:unmarked]–> NUM (96; 100%),
NUM –[flat]–> NUM (82; 98%),
NUM –[conj]–> NUM (71; 97%),
NUM –[appos]–> NUM (10; 83%),
NOUN –[conj]–> NOUN (6; 60%),
NUM –[acl:relcl]–> NUM (4; 100%),
NUM –[obl]–> NUM (2; 100%),
ADV –[advcl]–> ADV (1; 100%).