home cs/feat edit page issue tracker

Number: number

Number is an inflectional feature of nouns and other parts of speech (adjectives, verbs) that mark agreement with nouns.

Sing: singular number

A singular noun denotes one person, animal or thing.

Examples

Plur: plural number

A plural noun denotes several persons, animals or things.

Examples

Dual: dual number

A dual noun denotes two objects. The dual number has almost vanished from Czech with the exception of special instrumental case suffixes for body parts that occur in pairs, and any adjectives that modify them.

Examples

The noun noha  means either “leg” of a human, or of a table. Dual is used for the former and plural for the latter:

The numeral sto  “hundred” has also a special form of plural that is actually the dual:

Ptan: plurale tantum

Some nouns appear only in the plural form even though they denote one thing (semantic singular); some tagsets mark this distinction. Grammatically they behave like plurals, so Plur is obviously the back-off value here; however, the non-existence of singular form sometimes means that the gender is unknown. In Czech, special type of numerals is used when counting nouns that are plurale tantum (NumType=Sets).

Examples

Coll: collective / mass / singulare tantum

Collective or mass or singulare tantum is a special case of singular. It applies to words that use grammatical singular to describe sets of objects, i.e. semantic plural. Although in theory they might be able to form plural, in practice it would be rarely semantically plausible. Sometimes, the plural form exists and means “several sorts of” or “several packages of”.

Examples

Diffs

Prague Dependency Treebank

The PDT tagset does not distinguish Ptan from Plur and Coll from Sing, therefore this distinction is not being made in the converted data.


Treebank Statistics (UD_Czech)

This feature is universal. It occurs with 3 different values: Dual, Plur, Sing. Some words have combined values of the feature; 1 combinations have been observed: Plur|Sing.

This is a layered feature with the following layers: Number, Number[psor].

834248 tokens (55%) have a non-empty value of Number. 129500 types (101%) occur at least once with a non-empty value of Number. 48834 lemmas (84%) occur at least once with a non-empty value of Number. The feature is used with 8 part-of-speech tags: cs-pos/NOUN (363302; 24% instances), cs-pos/ADJ (176213; 12% instances), cs-pos/VERB (140097; 9% instances), cs-pos/PROPN (68761; 5% instances), cs-pos/PRON (40779; 3% instances), cs-pos/DET (21264; 1% instances), cs-pos/AUX (12183; 1% instances), cs-pos/NUM (11649; 1% instances).

NOUN

363302 cs-pos/NOUN tokens (98% of all NOUN tokens) have a non-empty value of Number.

The most frequent other feature values with which NOUN and Number co-occurred: Negative=Pos (362738; 100%), Animacy=EMPTY (203009; 56%).

NOUN tokens may have the following values of Number:

Paradigm rukaSingPlurDual
Case=Accrukuruce
Case=Datruce
Case=Genrukyrukou
Case=Insrukourukama
Case=Locrucerukou, rukách
Case=Nomrukaruce

ADJ

176213 cs-pos/ADJ tokens (97% of all ADJ tokens) have a non-empty value of Number.

The most frequent other feature values with which ADJ and Number co-occurred: Negative=Pos (164160; 93%), Degree=Pos (155101; 88%), Animacy=EMPTY (102299; 58%).

ADJ tokens may have the following values of Number:

Paradigm českýSingPlurDual
Animacy=Anim|Case=Acc|Gender=Masc|Negative=Posčeskéhočeské
Animacy=Anim|Case=Dat|Gender=Masc|Negative=Posčeskémučeským, českých
Animacy=Anim|Case=Gen|Gender=Masc|Negative=Posčeskéhočeských
Animacy=Anim|Case=Ins|Gender=Masc|Negative=Posčeskýmčeskými
Animacy=Anim|Case=Loc|Gender=Masc|Negative=Posčeskémčeských
Animacy=Anim|Case=Nom|Gender=Masc|Negative=Posčeskýčeští
Animacy=Inan|Case=Acc|Gender=Masc|Negative=Posčeskýčeské
Animacy=Inan|Case=Dat|Gender=Masc|Negative=Posčeskémučeským
Animacy=Inan|Case=Gen|Gender=Masc|Negative=Posčeskéhočeských
Animacy=Inan|Case=Ins|Gender=Masc|Negative=Posčeskýmčeskými
Animacy=Inan|Case=Loc|Gender=Masc|Negative=Posčeskémčeských
Animacy=Inan|Case=Nom|Gender=Masc|Negative=Posčeskýčeské
Case=Acc|Gender=Fem|Negative=Negnečeskou
Case=Acc|Gender=Fem|Negative=Posčeskoučeské
Case=Acc|Gender=Neut|Negative=Posčeskéčeská
Case=Dat|Gender=Fem|Negative=Posčeskéčeským
Case=Dat|Gender=Neut|Negative=Posčeskému
Case=Gen|Gender=Fem|Negative=Posčeskéčeských
Case=Gen|Gender=Neut|Negative=Posčeskéhočeských
Case=Ins|Gender=Fem|Negative=Posčeskoučeskýmičeskýma
Case=Ins|Gender=Neut|Negative=Posčeskýmčeskými
Case=Loc|Gender=Fem|Negative=Posčeskéčeských
Case=Loc|Gender=Neut|Negative=Posčeskémčeských
Case=Nom|Gender=Fem|Negative=Posčeskáčeské
Case=Nom|Gender=Fem|Negative=Pos|Style=Collčeský
Case=Nom|Gender=Neut|Negative=Posčeskéčeská

VERB

140097 cs-pos/VERB tokens (85% of all VERB tokens) have a non-empty value of Number.

The most frequent other feature values with which VERB and Number co-occurred: Voice=Act (129612; 93%), Negative=Pos (125939; 90%), Gender=EMPTY (76702; 55%), VerbForm=Fin (76681; 55%), Mood=Ind (75722; 54%), Tense=Pres (74325; 53%).

VERB tokens may have the following values of Number:

Paradigm býtPlur,SingSingPlur
Abbr=Yes|Mood=Ind|Negative=Pos|Person=3|Tense=Pres|VerbForm=Fin|Voice=Actj
Animacy=Anim|Gender=Masc|Negative=Neg|Tense=Past|VerbForm=Part|Voice=Actnebyli
Animacy=Anim|Gender=Masc|Negative=Pos|Tense=Past|VerbForm=Part|Voice=Actbyli
Animacy=Inan|Gender=Fem,Masc|Negative=Neg|Tense=Past|VerbForm=Part|Voice=Actnebyly
Animacy=Inan|Gender=Fem,Masc|Negative=Pos|Tense=Past|VerbForm=Part|Voice=Actbyly
Aspect=Imp|Gender=Masc|Negative=Pos|Tense=Pres|VerbForm=Trans|Voice=Actjsa
Aspect=Imp|Gender=Fem,Neut|Negative=Pos|Tense=Pres|VerbForm=Trans|Voice=Actjsouc
Aspect=Imp|Negative=Pos|Tense=Pres|VerbForm=Trans|Voice=Actjsouce
Foreign=Foreign|Gender=Neut|Negative=Pos|Tense=Past|VerbForm=Part|Voice=Actbolo
Gender=Masc|Negative=Neg|Tense=Past|VerbForm=Part|Voice=Actnebyl
Gender=Masc|Negative=Pos|Tense=Past|VerbForm=Part|Voice=Actbyl
Gender=Fem,Neut|Negative=Neg|Tense=Past|VerbForm=Part|Voice=Actnebyla
Gender=Fem,Neut|Negative=Pos|Tense=Past|VerbForm=Part|Voice=Actbyla
Gender=Neut|Negative=Neg|Tense=Past|VerbForm=Part|Voice=Actnebylo
Gender=Neut|Negative=Pos|Tense=Past|VerbForm=Part|Voice=Actbylo
Mood=Cnd|Person=1|Style=Coll|VerbForm=Finbysme
Mood=Cnd|Person=1|VerbForm=Finbychbychom
Mood=Imp|Negative=Neg|Person=2|VerbForm=FinNebuďte
Mood=Imp|Negative=Pos|Person=1|VerbForm=FinBuďme
Mood=Imp|Negative=Pos|Person=2|VerbForm=Finbuďbuďte
Mood=Imp|Negative=Pos|Person=3|Style=Arch|VerbForm=Finbuď
Mood=Imp|Negative=Pos|Person=3|VerbForm=Finbudiž
Mood=Ind|Negative=Neg|Person=1|Tense=Fut|VerbForm=Fin|Voice=Actnebudunebudeme
Mood=Ind|Negative=Neg|Person=1|Tense=Pres|VerbForm=Fin|Voice=Actnejsemnejsme
Mood=Ind|Negative=Neg|Person=2|Tense=Fut|VerbForm=Fin|Voice=Actnebudešnebudete
Mood=Ind|Negative=Neg|Person=2|Tense=Pres|VerbForm=Fin|Voice=Actnejste
Mood=Ind|Negative=Neg|Person=3|Style=Arch|Tense=Pres|VerbForm=Fin|Voice=ActNenínésó
Mood=Ind|Negative=Neg|Person=3|Tense=Fut|VerbForm=Fin|Voice=Actnebudenebudou
Mood=Ind|Negative=Neg|Person=3|Tense=Pres|VerbForm=Fin|Voice=Actnenínejsou
Mood=Ind|Negative=Pos|Person=1|Tense=Fut|VerbForm=Fin|Voice=Actbudubudeme
Mood=Ind|Negative=Pos|Person=1|Tense=Pres|VerbForm=Fin|Voice=Actjsemjsme
Mood=Ind|Negative=Pos|Person=2|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Actsi
Mood=Ind|Negative=Pos|Person=2|Tense=Fut|VerbForm=Fin|Voice=Actbudete
Mood=Ind|Negative=Pos|Person=2|Tense=Pres|VerbForm=Fin|Voice=Actjsijste
Mood=Ind|Negative=Pos|Person=3|Style=Arch|Tense=Pres|VerbForm=Fin|Voice=Actjest
Mood=Ind|Negative=Pos|Person=3|Tense=Fut|VerbForm=Fin|Voice=Actbudebudou
Mood=Ind|Negative=Pos|Person=3|Tense=Pres|VerbForm=Fin|Voice=Actjejsou

PROPN

68761 cs-pos/PROPN tokens (82% of all PROPN tokens) have a non-empty value of Number.

The most frequent other feature values with which PROPN and Number co-occurred: Negative=Pos (68761; 100%), Abbr=EMPTY (67938; 99%), Gender=Masc (44970; 65%), Case=Nom (37791; 55%), Animacy=Anim (34755; 51%).

PROPN tokens may have the following values of Number:

Paradigm JanSingPlur
Case=AccJanaJany
Case=DatJanu, Janovi
Case=GenJana, JANAJanů
Case=InsJanem
Case=LocJanu, Janovi
Case=NomJan, JANJanové

Number seems to be lexical feature of PROPN. 98% lemmas (12981) occur only with one value of Number.

PRON

40779 cs-pos/PRON tokens (56% of all PRON tokens) have a non-empty value of Number.

The most frequent other feature values with which PRON and Number co-occurred: Reflex=EMPTY (40664; 100%), Variant=EMPTY (38761; 95%), Person=EMPTY (29549; 72%).

PRON tokens may have the following values of Number:

Paradigm tenSingPlur
Abbr=Yes|Case=Nom|Gender=Neutt
Animacy=Anim|Case=Acc|Gender=Masctohoty
Animacy=Anim|Case=Nom|Gender=Mascti
Animacy=Inan|Case=Acc|Gender=Masctenty
Animacy=Inan|Case=Nom|Gender=Mascty
Case=Acc|Gender=Femtuty
Case=Acc|Gender=Neuttota
Case=Dat|Gender=Masc,Neuttomu
Case=Dat|Gender=Fem
Case=Dattěm
Case=Gen|Gender=Masc,Neuttoho
Case=Gen|Gender=Fem
Case=Gentěch
Case=Ins|Gender=Masc,Neuttím
Case=Ins|Gender=Femtou
Case=Instěmi
Case=Loc|Gender=Masc,Neuttom
Case=Loc|Gender=Fem
Case=Loctěch
Case=Nom|Gender=Mascten
Case=Nom|Gender=Femtaty
Case=Nom|Gender=Neutto, tenta
Case=Nom|Gender=Neut|Style=CollTy

DET

21264 cs-pos/DET tokens (76% of all DET tokens) have a non-empty value of Number.

The most frequent other feature values with which DET and Number co-occurred: Gender[psor]=EMPTY (19819; 93%), Number[psor]=EMPTY (16981; 80%), Person=EMPTY (16981; 80%), Reflex=EMPTY (16496; 78%), Poss=EMPTY (12213; 57%).

DET tokens may have the following values of Number:

Paradigm jehoSingPlurDual
Animacy=Anim|Case=Acc|Gender=Mascjejího
Animacy=Inan|Case=Acc|Gender=Mascjejí
Case=Acc|Gender=Neutjejí
Case=Accjejí
Case=Dat|Gender=Masc,Neutjejímu
Case=Datjejím
Case=Gen|Gender=Masc,Neutjejího
Case=Genjejích
Case=Ins|Gender=Masc,Neutjejím
Case=Ins|Gender=Femjejíma
Case=Insjejími
Case=Loc|Gender=Masc,Neutjejím
Case=Locjejích
Case=Nom|Gender=Masc,Neutjejí
Case=Nomjejí
Gender=Femjejí

AUX

12183 cs-pos/AUX tokens (59% of all AUX tokens) have a non-empty value of Number.

The most frequent other feature values with which AUX and Number co-occurred: Voice=Act (11146; 91%), Negative=Pos (10360; 85%), Gender=EMPTY (8931; 73%), VerbForm=Fin (8930; 73%), Mood=Ind (7893; 65%).

AUX tokens may have the following values of Number:

Paradigm býtPlur,SingSingPlur
Animacy=Anim|Gender=Masc|Negative=Neg|Tense=Past|VerbForm=Part|Voice=Actnebyli
Animacy=Anim|Gender=Masc|Negative=Pos|Tense=Past|VerbForm=Part|Voice=Actbyli
Animacy=Inan|Gender=Fem,Masc|Negative=Neg|Tense=Past|VerbForm=Part|Voice=Actnebyly
Animacy=Inan|Gender=Fem,Masc|Negative=Pos|Tense=Past|VerbForm=Part|Voice=Actbyly
Aspect=Imp|Negative=Pos|Tense=Pres|VerbForm=Trans|Voice=Actjsouce
Gender=Masc|Negative=Neg|Tense=Past|VerbForm=Part|Voice=Actnebyl
Gender=Masc|Negative=Pos|Tense=Past|VerbForm=Part|Voice=Actbyl
Gender=Fem,Neut|Negative=Neg|Tense=Past|VerbForm=Part|Voice=Actnebyla
Gender=Fem,Neut|Negative=Pos|Tense=Past|VerbForm=Part|Voice=Actbyla
Gender=Neut|Negative=Neg|Tense=Past|VerbForm=Part|Voice=Actnebylo
Gender=Neut|Negative=Pos|Tense=Past|VerbForm=Part|Voice=Actbylo
Mood=Cnd|Person=1|VerbForm=Finbychbychom
Mood=Cnd|Person=2|VerbForm=Finbysbyste
Mood=Imp|Negative=Pos|Person=3|Style=Arch|VerbForm=Finbudiž
Mood=Imp|Negative=Pos|Person=3|VerbForm=Finbudiž
Mood=Ind|Negative=Neg|Person=1|Tense=Fut|VerbForm=Fin|Voice=Actnebudunebudeme
Mood=Ind|Negative=Neg|Person=1|Tense=Pres|VerbForm=Fin|Voice=Actnejsme
Mood=Ind|Negative=Neg|Person=2|Tense=Fut|VerbForm=Fin|Voice=ActNebudešnebudete
Mood=Ind|Negative=Neg|Person=3|Tense=Fut|VerbForm=Fin|Voice=Actnebudenebudou
Mood=Ind|Negative=Neg|Person=3|Tense=Pres|VerbForm=Fin|Voice=Actnenínejsou
Mood=Ind|Negative=Pos|Person=1|Style=Coll|Tense=Fut|VerbForm=Fin|Voice=Actbudem
Mood=Ind|Negative=Pos|Person=1|Tense=Fut|VerbForm=Fin|Voice=Actbudubudeme
Mood=Ind|Negative=Pos|Person=1|Tense=Pres|VerbForm=Fin|Voice=Actjsemjsme
Mood=Ind|Negative=Pos|Person=2|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Actsi
Mood=Ind|Negative=Pos|Person=2|Tense=Fut|VerbForm=Fin|Voice=Actbudešbudete
Mood=Ind|Negative=Pos|Person=2|Tense=Pres|VerbForm=Fin|Voice=Actjsijste
Mood=Ind|Negative=Pos|Person=3|Tense=Fut|VerbForm=Fin|Voice=Actbudebudou
Mood=Ind|Negative=Pos|Person=3|Tense=Pres|VerbForm=Fin|Voice=Actjejsou

NUM

11649 cs-pos/NUM tokens (28% of all NUM tokens) have a non-empty value of Number.

The most frequent other feature values with which NUM and Number co-occurred: NumType=Card (11307; 97%), NumForm=Word (11307; 97%), NumValue=1,2,3 (8050; 69%), Gender=EMPTY (6890; 59%).

NUM tokens may have the following values of Number:

Paradigm dvaPlurDual
Case=Acc|Gender=Mascdva
Case=Acc|Gender=Fem,Neutdvě
Case=Datdvěma
Case=Gendvou
Case=Ins|Gender=Femdvěma
Case=Insdvěma
Case=Locdvou
Case=Nom|Gender=Mascdva
Case=Nom|Gender=Fem,Neutdvě

Relations with Agreement in Number

The 10 most frequent relations where parent and child node agree in Number: NOUN –[amod]–> ADJ (148090; 99%), NOUN –[nmod]–> NOUN (67827; 61%), VERB –[nsubj]–> NOUN (38774; 80%), NOUN –[det]–> DET (20902; 81%), NOUN –[conj]–> NOUN (17248; 78%), NOUN –[nmod]–> PROPN (12057; 53%), PROPN –[name]–> PROPN (10797; 82%), VERB –[conj]–> VERB (10770; 70%), VERB –[nsubj]–> PROPN (10258; 75%), VERB –[nsubj]–> PRON (9854; 74%).


Number in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]