Statistics of NumForm in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Czech-PDTC: Features: `NumForm`

This feature is language-specific. It occurs with 3 different values: Digit, Roman, Word.

93184 tokens (3%) have a non-empty value of NumForm. 7225 types (4%) occur at least once with a non-empty value of NumForm. 7071 lemmas (8%) occur at least once with a non-empty value of NumForm. The feature is used with 1 part-of-speech tags: NUM (93184; 3% instances).

`NUM`

93184 NUM tokens (89% of all NUM tokens) have a non-empty value of NumForm.

The most frequent other feature values with which NUM and NumForm co-occurred: NumType=Card (93184; 100%), Gender=EMPTY (82739; 89%), Case=EMPTY (69042; 74%), Number=EMPTY (69042; 74%).

NUM tokens may have the following values of NumForm:

Digit (67981; 73% of non-empty NumForm): 1, 2, 3, 4, 8, 10, 30, 5, 15, 20
Roman (435; 0% of non-empty NumForm): II, I, III, IV, V, VI, VII, IX, XX, VIII
Word (24768; 27% of non-empty NumForm): dva, tři, jeden, dvě, dvou, čtyři, pět, jedna, jednoho, jedné

NumForm seems to be lexical feature of NUM. 100% lemmas (7071) occur only with one value of NumForm.

Relations with Agreement in `NumForm`

The 10 most frequent relations where parent and child node agree in NumForm: NUM –[compound]–> NUM (5837; 67%), NUM –[conj]–> NUM (5698; 99%), NUM –[orphan]–> NUM (34; 89%), NUM –[dep]–> NUM (30; 83%), NUM –[parataxis]–> NUM (10; 100%), NUM –[nsubj]–> NUM (5; 71%), NUM –[appos]–> NUM (2; 100%), NUM –[nmod]–> NUM (1; 100%).

Treebank Statistics: UD_Czech-PDTC: Features: NumForm

NUM

Relations with Agreement in NumForm

Treebank Statistics: UD_Czech-PDTC: Features: `NumForm`

`NUM`

Relations with Agreement in `NumForm`