NumValue
: numeric value
In Czech, number “one” agrees with the counted noun in Gender, Number and Case. Number “two” agrees in gender and case and numbers “three” and “four” agree in case. These numerals behave similarly to adjectives. Numbers “five”, “six” etc. behave differently. If the case of the counted phrase is genitive, dative, locative or instrumental, the numeral agrees in case with the noun. However, if the case of the whole phrase is nominative, accusative or vocative, then the numeral dictates that the noun is in genitive. This behavior is similar to nouns modified by other nouns in genitive. (Note that this is why in the Czech PDT some numeral nodes are annotated as governing nouns instead of modifying them.) In addition, the whole phrase (number + counted noun) together behaves as neuter singular (this is important for subject-verb agreement).
Specific behavior of low-value numerals is the reason why there is a separate feature to mark these numerals.
1
: numeric value 1
- jeden, jedna, jedno “one”
2
: numeric value 2
- dva, dvě “two”
3
: numeric value 3 or 4
- tři “three”, čtyři “four”
Treebank Statistics (UD_Czech)
This feature is language-specific.
It occurs with 3 different values: 1
, 2
, 3
.
Some words have combined values of the feature; 1 combinations have been observed: 1|2|3
.
8080 tokens (1%) have a non-empty value of NumValue
.
90 types (0%) occur at least once with a non-empty value of NumValue
.
18 lemmas (0%) occur at least once with a non-empty value of NumValue
.
The feature is used with 2 part-of-speech tags: cs-pos/NUM (8050; 1% instances), cs-pos/ADJ (30; 0% instances).
NUM
8050 cs-pos/NUM tokens (19% of all NUM
tokens) have a non-empty value of NumValue
.
The most frequent other feature values with which NUM
and NumValue
co-occurred: NumType=Card (8050; 100%), NumForm=Word (8050; 100%), Number=Plur (4885; 61%).
NUM
tokens may have the following values of NumValue
:
1,2,3
(8050; 100% of non-emptyNumValue
): dva, tři, jeden, dvě, tisíc, dvou, čtyři, obou, jednoho, jedné
NumValue
seems to be lexical feature of NUM
. 100% lemmas (18) occur only with one value of NumValue
.
ADJ
30 cs-pos/ADJ tokens (0% of all ADJ
tokens) have a non-empty value of NumValue
.
The most frequent other feature values with which ADJ
and NumValue
co-occurred: Degree=EMPTY (30; 100%), Negative=EMPTY (30; 100%), Number=Plur (30; 100%), Animacy=EMPTY (19; 63%).
ADJ
tokens may have the following values of NumValue
:
1
(30; 100% of non-emptyNumValue
): jedny, jedni, jedněch, jedněm, jedněmi
Relations with Agreement in NumValue
The 10 most frequent relations where parent and child node agree in NumValue
:
NUM –[conj]–> NUM (99; 69%).
Treebank Statistics (UD_Czech-CAC)
This feature is language-specific.
It occurs with 3 different values: 1
, 2
, 3
.
Some words have combined values of the feature; 1 combinations have been observed: 1|2|3
.
1972 tokens (0%) have a non-empty value of NumValue
.
55 types (0%) occur at least once with a non-empty value of NumValue
.
9 lemmas (0%) occur at least once with a non-empty value of NumValue
.
The feature is used with 2 part-of-speech tags: cs-pos/NUM (1962; 0% instances), cs-pos/ADJ (10; 0% instances).
NUM
1962 cs-pos/NUM tokens (27% of all NUM
tokens) have a non-empty value of NumValue
.
The most frequent other feature values with which NUM
and NumValue
co-occurred: NumForm=Word (1962; 100%), NumType=Card (1962; 100%), Number=Plur (1120; 57%).
NUM
tokens may have the following values of NumValue
:
1,2,3
(1962; 100% of non-emptyNumValue
): dvou, jeden, dvě, tři, dva, obou, jedné, jednoho, jedním, dvěma
ADJ
10 cs-pos/ADJ tokens (0% of all ADJ
tokens) have a non-empty value of NumValue
.
The most frequent other feature values with which ADJ
and NumValue
co-occurred: Degree=EMPTY (10; 100%), Number=Plur (10; 100%), Negative=EMPTY (10; 100%), Animacy=EMPTY (7; 70%), Gender=EMPTY (6; 60%).
ADJ
tokens may have the following values of NumValue
:
1
(10; 100% of non-emptyNumValue
): jedněch, jedni, jedny
Relations with Agreement in NumValue
The 10 most frequent relations where parent and child node agree in NumValue
:
NUM –[conj]–> NUM (22; 81%).
Treebank Statistics (UD_Czech-CLTT)
This feature is language-specific.
It occurs with 3 different values: 1
, 2
, 3
.
Some words have combined values of the feature; 1 combinations have been observed: 1|2|3
.
58 tokens (0%) have a non-empty value of NumValue
.
17 types (0%) occur at least once with a non-empty value of NumValue
.
5 lemmas (0%) occur at least once with a non-empty value of NumValue
.
The feature is used with 1 part-of-speech tags: cs-pos/NUM (58; 0% instances).
NUM
58 cs-pos/NUM tokens (13% of all NUM
tokens) have a non-empty value of NumValue
.
The most frequent other feature values with which NUM
and NumValue
co-occurred: NumType=Card (58; 100%), NumForm=Word (58; 100%), Number=Sing (38; 66%).
NUM
tokens may have the following values of NumValue
:
1,2,3
(58; 100% of non-emptyNumValue
): jeden, jedné, tří, dvě, jedno, jednoho, jednou, obě, dvou, dvěma