Treebank Statistics: UD_Czech-PDTC: Features: NumForm
This feature is language-specific.
It occurs with 3 different values: Digit, Roman, Word.
93184 tokens (3%) have a non-empty value of NumForm.
7225 types (4%) occur at least once with a non-empty value of NumForm.
7071 lemmas (8%) occur at least once with a non-empty value of NumForm.
The feature is used with 1 part-of-speech tags: NUM (93184; 3% instances).
NUM
93184 NUM tokens (89% of all NUM tokens) have a non-empty value of NumForm.
The most frequent other feature values with which NUM and NumForm co-occurred: NumType=Card (93184; 100%), Gender=EMPTY (82739; 89%), Case=EMPTY (69042; 74%), Number=EMPTY (69042; 74%).
NUM tokens may have the following values of NumForm:
Digit(67981; 73% of non-emptyNumForm): 1, 2, 3, 4, 8, 10, 30, 5, 15, 20Roman(435; 0% of non-emptyNumForm): II, I, III, IV, V, VI, VII, IX, XX, VIIIWord(24768; 27% of non-emptyNumForm): dva, tři, jeden, dvě, dvou, čtyři, pět, jedna, jednoho, jedné
NumForm seems to be lexical feature of NUM. 100% lemmas (7071) occur only with one value of NumForm.
Relations with Agreement in NumForm
The 10 most frequent relations where parent and child node agree in NumForm:
NUM –[compound]–> NUM (5837; 67%),
NUM –[conj]–> NUM (5698; 99%),
NUM –[orphan]–> NUM (34; 89%),
NUM –[dep]–> NUM (30; 83%),
NUM –[parataxis]–> NUM (10; 100%),
NUM –[nsubj]–> NUM (5; 71%),
NUM –[appos]–> NUM (2; 100%),
NUM –[nmod]–> NUM (1; 100%).