home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-PDTC: Features: NumForm

This feature is language-specific. It occurs with 3 different values: Digit, Roman, Word.

93184 tokens (3%) have a non-empty value of NumForm. 7225 types (4%) occur at least once with a non-empty value of NumForm. 7071 lemmas (8%) occur at least once with a non-empty value of NumForm. The feature is used with 1 part-of-speech tags: NUM (93184; 3% instances).

NUM

93184 NUM tokens (89% of all NUM tokens) have a non-empty value of NumForm.

The most frequent other feature values with which NUM and NumForm co-occurred: NumType=Card (93184; 100%), Gender=EMPTY (82739; 89%), Case=EMPTY (69042; 74%), Number=EMPTY (69042; 74%).

NUM tokens may have the following values of NumForm:

NumForm seems to be lexical feature of NUM. 100% lemmas (7071) occur only with one value of NumForm.

Relations with Agreement in NumForm

The 10 most frequent relations where parent and child node agree in NumForm: NUM –[compound]–> NUM (5837; 67%), NUM –[conj]–> NUM (5698; 99%), NUM –[orphan]–> NUM (34; 89%), NUM –[dep]–> NUM (30; 83%), NUM –[parataxis]–> NUM (10; 100%), NUM –[nsubj]–> NUM (5; 71%), NUM –[appos]–> NUM (2; 100%), NUM –[nmod]–> NUM (1; 100%).