home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-PDT: Features: NumForm

This feature is language-specific. It occurs with 3 different values: Digit, Roman, Word.

41165 tokens (3%) have a non-empty value of NumForm. 3589 types (3%) occur at least once with a non-empty value of NumForm. 3428 lemmas (6%) occur at least once with a non-empty value of NumForm. The feature is used with 1 part-of-speech tags: NUM (41165; 3% instances).

NUM

41165 NUM tokens (99% of all NUM tokens) have a non-empty value of NumForm.

The most frequent other feature values with which NUM and NumForm co-occurred: NumType=Card (41165; 100%), Gender=EMPTY (36748; 89%), NumValue=EMPTY (33115; 80%), Case=EMPTY (29884; 73%), Number=EMPTY (29858; 73%).

NUM tokens may have the following values of NumForm:

NumForm seems to be lexical feature of NUM. 100% lemmas (3428) occur only with one value of NumForm.

Relations with Agreement in NumForm

The 10 most frequent relations where parent and child node agree in NumForm: NUM –[conj]–> NUM (3247; 100%), NUM –[compound]–> NUM (2671; 95%), NUM –[orphan]–> NUM (79; 98%), NUM –[dep]–> NUM (50; 96%).