NumForm

This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.

home cs/feat issue tracker

`NumForm`: numeral form

Feature of cardinal and ordinal numbers. Is the number expressed by digits or as a word?

Word: number expressed as word

Examples

jeden “one”, dva “two”, tři “three”

Digit: number expressed using digits

Examples

1, 2, 3

Roman: roman numeral

Examples

I, II, III

Treebank Statistics (UD_Czech)

This feature is language-specific. It occurs with 3 different values: Digit, Roman, Word.

41165 tokens (3%) have a non-empty value of NumForm. 3589 types (3%) occur at least once with a non-empty value of NumForm. 3428 lemmas (6%) occur at least once with a non-empty value of NumForm. The feature is used with 1 part-of-speech tags: cs-pos/NUM (41165; 3% instances).

`NUM`

41165 cs-pos/NUM tokens (99% of all NUM tokens) have a non-empty value of NumForm.

The most frequent other feature values with which NUM and NumForm co-occurred: NumType=Card (41165; 100%), Gender=EMPTY (36748; 89%), NumValue=EMPTY (33115; 80%), Case=EMPTY (29884; 73%), Number=EMPTY (29858; 73%).

NUM tokens may have the following values of NumForm:

Digit (29481; 72% of non-empty NumForm): 1, 2, 3, 4, 6, 5, 1992, 10, 1994, 1993
Roman (376; 1% of non-empty NumForm): II, I, III, IV, V, VI, XX, D, C, IX
Word (11308; 27% of non-empty NumForm): dva, tři, jeden, dvě, tisíc, dvou, pět, čtyři, obou, jednoho

NumForm seems to be lexical feature of NUM. 100% lemmas (3428) occur only with one value of NumForm.

Relations with Agreement in `NumForm`

The 10 most frequent relations where parent and child node agree in NumForm: NUM –[conj]–> NUM (3365; 100%), NUM –[compound]–> NUM (2671; 95%), NUM –[dep]–> NUM (50; 96%).

Treebank Statistics (UD_Czech-CAC)

This feature is language-specific. It occurs with 2 different values: Digit, Word.

7247 tokens (1%) have a non-empty value of NumForm. 124 types (0%) occur at least once with a non-empty value of NumForm. 50 lemmas (0%) occur at least once with a non-empty value of NumForm. The feature is used with 1 part-of-speech tags: cs-pos/NUM (7247; 1% instances).

`NUM`

7247 cs-pos/NUM tokens (99% of all NUM tokens) have a non-empty value of NumForm.

The most frequent other feature values with which NUM and NumForm co-occurred: NumType=Card (7247; 100%), Gender=EMPTY (6108; 84%), NumValue=EMPTY (5285; 73%), Number=EMPTY (4836; 67%), Case=EMPTY (4836; 67%).

NUM tokens may have the following values of NumForm:

Digit (4836; 67% of non-empty NumForm): #
Word (2411; 33% of non-empty NumForm): dvou, jeden, dvě, tři, dva, obou, jedné, jednoho, jedním, dvěma

NumForm seems to be lexical feature of NUM. 100% lemmas (50) occur only with one value of NumForm.

Relations with Agreement in `NumForm`

The 10 most frequent relations where parent and child node agree in NumForm: NUM –[conj]–> NUM (315; 100%), NUM –[compound]–> NUM (31; 74%).

Treebank Statistics (UD_Czech-CLTT)

This feature is language-specific. It occurs with 2 different values: Roman, Word.

440 tokens (1%) have a non-empty value of NumForm. 97 types (2%) occur at least once with a non-empty value of NumForm. 83 lemmas (3%) occur at least once with a non-empty value of NumForm. The feature is used with 1 part-of-speech tags: cs-pos/NUM (440; 1% instances).

`NUM`

440 cs-pos/NUM tokens (100% of all NUM tokens) have a non-empty value of NumForm.

The most frequent other feature values with which NUM and NumForm co-occurred: NumType=Card (440; 100%), Gender=EMPTY (394; 90%), NumValue=EMPTY (382; 87%), Number=EMPTY (371; 84%), Case=EMPTY (371; 84%).

NUM tokens may have the following values of NumForm:

Roman (371; 84% of non-empty NumForm): 1, 3, 2, 4, 5, 41, 7, 10, 2004, 2008
Word (69; 16% of non-empty NumForm): jeden, jedné, tří, dvanáct, dvě, dvanácti, jedno, jednoho, jednou, obě

NumForm seems to be lexical feature of NUM. 100% lemmas (83) occur only with one value of NumForm.

Relations with Agreement in `NumForm`

The 10 most frequent relations where parent and child node agree in NumForm: NUM –[conj]–> NUM (39; 100%), NUM –[compound]–> NUM (1; 100%).

NumForm: numeral form

Word: number expressed as word

Examples

Digit: number expressed using digits

Examples

Roman: roman numeral

Examples

Treebank Statistics (UD_Czech)

NUM

Relations with Agreement in NumForm

Treebank Statistics (UD_Czech-CAC)

NUM

Relations with Agreement in NumForm

Treebank Statistics (UD_Czech-CLTT)

NUM

Relations with Agreement in NumForm

`NumForm`: numeral form

`NUM`

Relations with Agreement in `NumForm`

`NUM`

Relations with Agreement in `NumForm`

`NUM`

Relations with Agreement in `NumForm`