home cs/feat edit page issue tracker

NumType: numeral type

Czech has a complex system of numerals. For example, in the school grammar of Czech, the main part of speech is “numeral”, it includes almost everything where counting is involved and there are various subtypes. It also includes interrogative, relative, indefinite and demonstrative quantifiers (words like kolik  “how many”, tolik  “so many”, několik  “several”), so at the same time we may have a non-empty value of PronType.

From the syntactic point of view, some numtypes behave like adjectives and some behave like adverbs. We tag them cs-pos/ADJ and cs-pos/ADV respectively. Thus the NumType feature applies to several different parts of speech:

Card: cardinal number or corresponding interrogative / relative / indefinite / demonstrative word

Examples

Ord: ordinal number or corresponding interrogative / relative / indefinite / demonstrative word

This is a subtype of adjective or adverb.

Adjectival examples

Adverbial examples

Mult: multiplicative numeral or corresponding interrogative / relative / indefinite / demonstrative word

This is a subtype of adverb.

Examples

Frac: fraction

This is a subtype of cardinal numbers. It may denote a fraction or just the denominator of the fraction.

Examples

Sets: number of sets of things

Morphologically distinct class of numerals used to count sets of things, or nouns that are pluralia tantum.

Examples

Gen: generic numeral, i.e. a numeral that is neither of the above

Czech school grammar distinguishes this subclass, which is why it appears in Czech tagsets. (Note that “generic numerals” in Czech grammar also include the Sets subclass mentioned above.)

Examples


Treebank Statistics (UD_Czech)

This feature is universal. It occurs with 6 different values: Card, Frac, Gen, Mult, Ord, Sets.

49212 tokens (3%) have a non-empty value of NumType. 4024 types (3%) occur at least once with a non-empty value of NumType. 3572 lemmas (6%) occur at least once with a non-empty value of NumType. The feature is used with 5 part-of-speech tags: cs-pos/NUM (41510; 3% instances), cs-pos/ADJ (4990; 0% instances), cs-pos/DET (1552; 0% instances), cs-pos/ADV (741; 0% instances), cs-pos/PRON (419; 0% instances).

NUM

41510 cs-pos/NUM tokens (100% of all NUM tokens) have a non-empty value of NumType.

The most frequent other feature values with which NUM and NumType co-occurred: Gender=EMPTY (36751; 89%), NumValue=EMPTY (33460; 81%), Case=EMPTY (29887; 72%), Number=EMPTY (29861; 72%), NumForm=Digit (29484; 71%).

NUM tokens may have the following values of NumType:

NumType seems to be lexical feature of NUM. 100% lemmas (3436) occur only with one value of NumType.

ADJ

4990 cs-pos/ADJ tokens (3% of all ADJ tokens) have a non-empty value of NumType.

The most frequent other feature values with which ADJ and NumType co-occurred: Degree=EMPTY (4990; 100%), Negative=EMPTY (4990; 100%), Number=Sing (4215; 84%), Animacy=EMPTY (3246; 65%).

ADJ tokens may have the following values of NumType:

Paradigm dvojíSetsGen
Animacy=Inan|Case=Acc|Gender=Masc|Number=Singdvojí
Case=Acc|Gender=Fem|Number=Singdvojí
Case=Acc|Gender=Neut|Number=Singdvojí
Case=Acc|Number=Plurdvoje
Case=Gen|Gender=Fem|Number=Singdvojí
Case=Gen|Gender=Neut|Number=Singdvojího
Case=Ins|Gender=Masc|Number=Singdvojím
Case=Ins|Gender=Fem|Number=Singdvojí
Case=Ins|Gender=Neut|Number=Singdvojím
Case=Ins|Number=Plurdvojími
Case=Loc|Gender=Neut|Number=Singdvojím
Case=Nom|Number=Singdvojí

NumType seems to be lexical feature of ADJ. 96% lemmas (64) occur only with one value of NumType.

DET

1552 cs-pos/DET tokens (6% of all DET tokens) have a non-empty value of NumType.

The most frequent other feature values with which DET and NumType co-occurred: Number[psor]=EMPTY (1552; 100%), Gender[psor]=EMPTY (1552; 100%), Poss=EMPTY (1552; 100%), Reflex=EMPTY (1552; 100%), Person=EMPTY (1552; 100%), Gender=EMPTY (1542; 99%), Number=EMPTY (1542; 99%), PronType=Dem,Ind (1454; 94%).

DET tokens may have the following values of NumType:

NumType seems to be lexical feature of DET. 100% lemmas (13) occur only with one value of NumType.

ADV

741 cs-pos/ADV tokens (1% of all ADV tokens) have a non-empty value of NumType.

The most frequent other feature values with which ADV and NumType co-occurred: Degree=EMPTY (741; 100%), Negative=EMPTY (741; 100%).

ADV tokens may have the following values of NumType:

NumType seems to be lexical feature of ADV. 100% lemmas (49) occur only with one value of NumType.

PRON

419 cs-pos/PRON tokens (1% of all PRON tokens) have a non-empty value of NumType.

The most frequent other feature values with which PRON and NumType co-occurred: Variant=EMPTY (419; 100%), Reflex=EMPTY (419; 100%), Person=EMPTY (419; 100%), Gender=EMPTY (413; 99%), Number=EMPTY (413; 99%), PronType=Dem,Ind (318; 76%).

PRON tokens may have the following values of NumType:

NumType seems to be lexical feature of PRON. 100% lemmas (19) occur only with one value of NumType.

Relations with Agreement in NumType

The 10 most frequent relations where parent and child node agree in NumType: NUM –[conj]–> NUM (3378; 100%), NUM –[compound]–> NUM (2797; 100%), ADJ –[conj]–> ADJ (75; 56%), NUM –[dep]–> NUM (52; 100%), NUM –[det:nummod]–> DET (16; 100%), DET –[conj]–> PRON (4; 80%), PRON –[conj]–> PRON (3; 100%), DET –[appos]–> NUM (3; 100%), DET –[det:nummod]–> DET (2; 100%), DET –[dep]–> NUM (1; 100%).


NumType in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]