Number: number
Number is an inflectional feature of nouns and
other parts of speech (adjectives,
verbs) that mark agreement with nouns.
Sing: singular number
A singular noun denotes one person, animal or thing.
Examples
- starý muž přišel “an old man came”
- mladá žena přišla “a young woman came”
- malé kuře přišlo “a small chicken came”
Plur: plural number
A plural noun denotes several persons, animals or things.
Examples
- staří muži přišli “old men came”
- mladé ženy přišly “young women came”
- malá kuřata přišla “small chickens came”
Dual: dual number
A dual noun denotes two objects. The dual number has almost vanished from Czech with the exception of special instrumental case suffixes for body parts that occur in pairs, and any adjectives that modify them.
Examples
The noun noha means either “leg” of a human, or of a table. Dual is used for the former and plural for the latter:
- holka s dlouhýma nohama “a girl with long legs”
- stůl s dlouhými nohami “a table with long legs”
The numeral sto “hundred” has also a special form of plural that is actually the dual:
- dvě stě “two hundred”
- tři sta “three hundred”
Ptan: plurale tantum
Some nouns appear only in the plural form even though they denote one
thing (semantic singular); some tagsets mark this distinction.
Grammatically they behave like plurals, so Plur is obviously the
back-off value here; however, the
non-existence of singular form sometimes means that the gender is
unknown. In Czech, special type of numerals is used when counting
nouns that are plurale tantum (NumType=Sets).
Examples
- nůžky, kalhoty “scissors, pants”
Coll: collective / mass / singulare tantum
Collective or mass or singulare tantum is a special case of singular. It applies to words that use grammatical singular to describe sets of objects, i.e. semantic plural. Although in theory they might be able to form plural, in practice it would be rarely semantically plausible. Sometimes, the plural form exists and means “several sorts of” or “several packages of”.
Examples
- lidstvo “mankind”
Diffs
Prague Dependency Treebank
The PDT tagset does not distinguish Ptan from Plur and Coll from Sing,
therefore this distinction is not being made in the converted data.
Treebank Statistics (UD_Czech)
This feature is universal.
It occurs with 3 different values: Dual, Plur, Sing.
Some words have combined values of the feature; 1 combinations have been observed: Plur|Sing.
This is a layered feature with the following layers: Number, Number[psor].
834248 tokens (55%) have a non-empty value of Number.
129500 types (101%) occur at least once with a non-empty value of Number.
48834 lemmas (84%) occur at least once with a non-empty value of Number.
The feature is used with 8 part-of-speech tags: cs-pos/NOUN (363302; 24% instances), cs-pos/ADJ (176213; 12% instances), cs-pos/VERB (140097; 9% instances), cs-pos/PROPN (68761; 5% instances), cs-pos/PRON (40779; 3% instances), cs-pos/DET (21264; 1% instances), cs-pos/AUX (12183; 1% instances), cs-pos/NUM (11649; 1% instances).
NOUN
363302 cs-pos/NOUN tokens (98% of all NOUN tokens) have a non-empty value of Number.
The most frequent other feature values with which NOUN and Number co-occurred: Negative=Pos (362738; 100%), Animacy=EMPTY (203009; 56%).
NOUN tokens may have the following values of Number:
Dual(81; 0% of non-emptyNumber): očima, rukama, nohama, ušimaPlur(103375; 28% of non-emptyNumber): korun, let, procent, lidí, letech, lidé, milionů, miliónů, zemí, dolarůSing(259846; 72% of non-emptyNumber): roku, roce, době, případě, společnosti, zákona, rok, ministr, vláda, stranyEMPTY(9064): Kč, s, r, p, m, tel, c, č, km, b
| Paradigm ruka | Sing | Plur | Dual |
|---|---|---|---|
| Case=Acc | ruku | ruce | |
| Case=Dat | ruce | ||
| Case=Gen | ruky | rukou | |
| Case=Ins | rukou | rukama | |
| Case=Loc | ruce | rukou, rukách | |
| Case=Nom | ruka | ruce |
ADJ
176213 cs-pos/ADJ tokens (97% of all ADJ tokens) have a non-empty value of Number.
The most frequent other feature values with which ADJ and Number co-occurred: Negative=Pos (164160; 93%), Degree=Pos (155101; 88%), Animacy=EMPTY (102299; 58%).
ADJ tokens may have the following values of Number:
Dual(24; 0% of non-emptyNumber): zavřenýma, otevřenýma, Sudetoněmeckýma, dlouhýma, filmovýma, holýma, odřenýma, oteklýma, plnýma, prázdnýmaPlur(53953; 31% of non-emptyNumber): další, dalších, českých, posledních, nové, jiných, nových, jednotlivých, různých, zahraničníchPlur,Sing(196; 0% of non-emptyNumber): schopna, ráda, ochotna, známa, povinna, vědoma, spokojena, přítomna, spjata, hotovaSing(122040; 69% of non-emptyNumber): první, české, další, druhé, poslední, státní, každý, možné, třeba, českáEMPTY(4598): tzv, New, a, the, čs, česko, open, sv, RM, US
| Paradigm český | Sing | Plur | Dual |
|---|---|---|---|
| Animacy=Anim|Case=Acc|Gender=Masc|Negative=Pos | českého | české | |
| Animacy=Anim|Case=Dat|Gender=Masc|Negative=Pos | českému | českým, českých | |
| Animacy=Anim|Case=Gen|Gender=Masc|Negative=Pos | českého | českých | |
| Animacy=Anim|Case=Ins|Gender=Masc|Negative=Pos | českým | českými | |
| Animacy=Anim|Case=Loc|Gender=Masc|Negative=Pos | českém | českých | |
| Animacy=Anim|Case=Nom|Gender=Masc|Negative=Pos | český | čeští | |
| Animacy=Inan|Case=Acc|Gender=Masc|Negative=Pos | český | české | |
| Animacy=Inan|Case=Dat|Gender=Masc|Negative=Pos | českému | českým | |
| Animacy=Inan|Case=Gen|Gender=Masc|Negative=Pos | českého | českých | |
| Animacy=Inan|Case=Ins|Gender=Masc|Negative=Pos | českým | českými | |
| Animacy=Inan|Case=Loc|Gender=Masc|Negative=Pos | českém | českých | |
| Animacy=Inan|Case=Nom|Gender=Masc|Negative=Pos | český | české | |
| Case=Acc|Gender=Fem|Negative=Neg | nečeskou | ||
| Case=Acc|Gender=Fem|Negative=Pos | českou | české | |
| Case=Acc|Gender=Neut|Negative=Pos | české | česká | |
| Case=Dat|Gender=Fem|Negative=Pos | české | českým | |
| Case=Dat|Gender=Neut|Negative=Pos | českému | ||
| Case=Gen|Gender=Fem|Negative=Pos | české | českých | |
| Case=Gen|Gender=Neut|Negative=Pos | českého | českých | |
| Case=Ins|Gender=Fem|Negative=Pos | českou | českými | českýma |
| Case=Ins|Gender=Neut|Negative=Pos | českým | českými | |
| Case=Loc|Gender=Fem|Negative=Pos | české | českých | |
| Case=Loc|Gender=Neut|Negative=Pos | českém | českých | |
| Case=Nom|Gender=Fem|Negative=Pos | česká | české | |
| Case=Nom|Gender=Fem|Negative=Pos|Style=Coll | český | ||
| Case=Nom|Gender=Neut|Negative=Pos | české | česká |
VERB
140097 cs-pos/VERB tokens (85% of all VERB tokens) have a non-empty value of Number.
The most frequent other feature values with which VERB and Number co-occurred: Voice=Act (129612; 93%), Negative=Pos (125939; 90%), Gender=EMPTY (76702; 55%), VerbForm=Fin (76681; 55%), Mood=Ind (75722; 54%), Tense=Pres (74325; 53%).
VERB tokens may have the following values of Number:
Plur(37812; 27% of non-emptyNumber): jsou, mají, mohou, měli, byly, měly, nejsou, byli, máme, mohliPlur,Sing(12651; 9% of non-emptyNumber): byla, měla, mohla, stala, začala, získala, nebyla, musela, vznikla, oznámilaSing(89634; 64% of non-emptyNumber): je, má, není, byl, může, bylo, řekl, měl, bude, jdeEMPTY(25537): být, mít, získat, stát, hrát, říci, platit, muset, dělat, dostat
| Paradigm být | Plur,Sing | Sing | Plur |
|---|---|---|---|
| Abbr=Yes|Mood=Ind|Negative=Pos|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act | j | ||
| Animacy=Anim|Gender=Masc|Negative=Neg|Tense=Past|VerbForm=Part|Voice=Act | nebyli | ||
| Animacy=Anim|Gender=Masc|Negative=Pos|Tense=Past|VerbForm=Part|Voice=Act | byli | ||
| Animacy=Inan|Gender=Fem,Masc|Negative=Neg|Tense=Past|VerbForm=Part|Voice=Act | nebyly | ||
| Animacy=Inan|Gender=Fem,Masc|Negative=Pos|Tense=Past|VerbForm=Part|Voice=Act | byly | ||
| Aspect=Imp|Gender=Masc|Negative=Pos|Tense=Pres|VerbForm=Trans|Voice=Act | jsa | ||
| Aspect=Imp|Gender=Fem,Neut|Negative=Pos|Tense=Pres|VerbForm=Trans|Voice=Act | jsouc | ||
| Aspect=Imp|Negative=Pos|Tense=Pres|VerbForm=Trans|Voice=Act | jsouce | ||
| Foreign=Foreign|Gender=Neut|Negative=Pos|Tense=Past|VerbForm=Part|Voice=Act | bolo | ||
| Gender=Masc|Negative=Neg|Tense=Past|VerbForm=Part|Voice=Act | nebyl | ||
| Gender=Masc|Negative=Pos|Tense=Past|VerbForm=Part|Voice=Act | byl | ||
| Gender=Fem,Neut|Negative=Neg|Tense=Past|VerbForm=Part|Voice=Act | nebyla | ||
| Gender=Fem,Neut|Negative=Pos|Tense=Past|VerbForm=Part|Voice=Act | byla | ||
| Gender=Neut|Negative=Neg|Tense=Past|VerbForm=Part|Voice=Act | nebylo | ||
| Gender=Neut|Negative=Pos|Tense=Past|VerbForm=Part|Voice=Act | bylo | ||
| Mood=Cnd|Person=1|Style=Coll|VerbForm=Fin | bysme | ||
| Mood=Cnd|Person=1|VerbForm=Fin | bych | bychom | |
| Mood=Imp|Negative=Neg|Person=2|VerbForm=Fin | Nebuďte | ||
| Mood=Imp|Negative=Pos|Person=1|VerbForm=Fin | Buďme | ||
| Mood=Imp|Negative=Pos|Person=2|VerbForm=Fin | buď | buďte | |
| Mood=Imp|Negative=Pos|Person=3|Style=Arch|VerbForm=Fin | buď | ||
| Mood=Imp|Negative=Pos|Person=3|VerbForm=Fin | budiž | ||
| Mood=Ind|Negative=Neg|Person=1|Tense=Fut|VerbForm=Fin|Voice=Act | nebudu | nebudeme | |
| Mood=Ind|Negative=Neg|Person=1|Tense=Pres|VerbForm=Fin|Voice=Act | nejsem | nejsme | |
| Mood=Ind|Negative=Neg|Person=2|Tense=Fut|VerbForm=Fin|Voice=Act | nebudeš | nebudete | |
| Mood=Ind|Negative=Neg|Person=2|Tense=Pres|VerbForm=Fin|Voice=Act | nejste | ||
| Mood=Ind|Negative=Neg|Person=3|Style=Arch|Tense=Pres|VerbForm=Fin|Voice=Act | Není | nésó | |
| Mood=Ind|Negative=Neg|Person=3|Tense=Fut|VerbForm=Fin|Voice=Act | nebude | nebudou | |
| Mood=Ind|Negative=Neg|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act | není | nejsou | |
| Mood=Ind|Negative=Pos|Person=1|Tense=Fut|VerbForm=Fin|Voice=Act | budu | budeme | |
| Mood=Ind|Negative=Pos|Person=1|Tense=Pres|VerbForm=Fin|Voice=Act | jsem | jsme | |
| Mood=Ind|Negative=Pos|Person=2|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Act | si | ||
| Mood=Ind|Negative=Pos|Person=2|Tense=Fut|VerbForm=Fin|Voice=Act | budete | ||
| Mood=Ind|Negative=Pos|Person=2|Tense=Pres|VerbForm=Fin|Voice=Act | jsi | jste | |
| Mood=Ind|Negative=Pos|Person=3|Style=Arch|Tense=Pres|VerbForm=Fin|Voice=Act | jest | ||
| Mood=Ind|Negative=Pos|Person=3|Tense=Fut|VerbForm=Fin|Voice=Act | bude | budou | |
| Mood=Ind|Negative=Pos|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act | je | jsou |
PROPN
68761 cs-pos/PROPN tokens (82% of all PROPN tokens) have a non-empty value of Number.
The most frequent other feature values with which PROPN and Number co-occurred: Negative=Pos (68761; 100%), Abbr=EMPTY (67938; 99%), Gender=Masc (44970; 65%), Case=Nom (37791; 55%), Animacy=Anim (34755; 51%).
PROPN tokens may have the following values of Number:
Plur(5579; 8% of non-emptyNumber): LN, USA, Čechách, Němci, Čech, Češi, ČEZ, Němců, Vítkovice, BudějoviceSing(63182; 92% of non-emptyNumber): Praha, Praze, Jiří, Jan, Evropy, Brno, Prahy, Václav, Jana, PetrEMPTY(15270): ČR, ODS, J, OSN, ODA, M, ČSFR, V, A, SR
| Paradigm Jan | Sing | Plur |
|---|---|---|
| Case=Acc | Jana | Jany |
| Case=Dat | Janu, Janovi | |
| Case=Gen | Jana, JANA | Janů |
| Case=Ins | Janem | |
| Case=Loc | Janu, Janovi | |
| Case=Nom | Jan, JAN | Janové |
Number seems to be lexical feature of PROPN. 98% lemmas (12981) occur only with one value of Number.
PRON
40779 cs-pos/PRON tokens (56% of all PRON tokens) have a non-empty value of Number.
The most frequent other feature values with which PRON and Number co-occurred: Reflex=EMPTY (40664; 100%), Variant=EMPTY (38761; 95%), Person=EMPTY (29549; 72%).
PRON tokens may have the following values of Number:
Plur(13317; 33% of non-emptyNumber): které, kteří, nás, je, nám, nich, všechny, všech, jim, nichžSing(27462; 67% of non-emptyNumber): to, který, která, tím, tom, které, tomu, toho, mu, kterouEMPTY(31769): se, si, co, kdo, což, nic, něco, nikdo, sebe, někdo
| Paradigm ten | Sing | Plur |
|---|---|---|
| Abbr=Yes|Case=Nom|Gender=Neut | t | |
| Animacy=Anim|Case=Acc|Gender=Masc | toho | ty |
| Animacy=Anim|Case=Nom|Gender=Masc | ti | |
| Animacy=Inan|Case=Acc|Gender=Masc | ten | ty |
| Animacy=Inan|Case=Nom|Gender=Masc | ty | |
| Case=Acc|Gender=Fem | tu | ty |
| Case=Acc|Gender=Neut | to | ta |
| Case=Dat|Gender=Masc,Neut | tomu | |
| Case=Dat|Gender=Fem | té | |
| Case=Dat | těm | |
| Case=Gen|Gender=Masc,Neut | toho | |
| Case=Gen|Gender=Fem | té | |
| Case=Gen | těch | |
| Case=Ins|Gender=Masc,Neut | tím | |
| Case=Ins|Gender=Fem | tou | |
| Case=Ins | těmi | |
| Case=Loc|Gender=Masc,Neut | tom | |
| Case=Loc|Gender=Fem | té | |
| Case=Loc | těch | |
| Case=Nom|Gender=Masc | ten | |
| Case=Nom|Gender=Fem | ta | ty |
| Case=Nom|Gender=Neut | to, ten | ta |
| Case=Nom|Gender=Neut|Style=Coll | Ty |
DET
21264 cs-pos/DET tokens (76% of all DET tokens) have a non-empty value of Number.
The most frequent other feature values with which DET and Number co-occurred: Gender[psor]=EMPTY (19819; 93%), Number[psor]=EMPTY (16981; 80%), Person=EMPTY (16981; 80%), Reflex=EMPTY (16496; 78%), Poss=EMPTY (12213; 57%).
DET tokens may have the following values of Number:
Dual(4; 0% of non-emptyNumber): jejíma, svýma, těmaPlur(6267; 29% of non-emptyNumber): těchto, své, tyto, svých, některých, některé, našich, někteří, naše, jejíSing(14993; 71% of non-emptyNumber): této, tento, své, tohoto, její, svou, svého, tato, tomto, svůjEMPTY(6549): jeho, jejich, několik, několika, jejichž, mnoho, mnoha, jehož, kolik, málo
| Paradigm jeho | Sing | Plur | Dual |
|---|---|---|---|
| Animacy=Anim|Case=Acc|Gender=Masc | jejího | ||
| Animacy=Inan|Case=Acc|Gender=Masc | její | ||
| Case=Acc|Gender=Neut | její | ||
| Case=Acc | její | ||
| Case=Dat|Gender=Masc,Neut | jejímu | ||
| Case=Dat | jejím | ||
| Case=Gen|Gender=Masc,Neut | jejího | ||
| Case=Gen | jejích | ||
| Case=Ins|Gender=Masc,Neut | jejím | ||
| Case=Ins|Gender=Fem | jejíma | ||
| Case=Ins | jejími | ||
| Case=Loc|Gender=Masc,Neut | jejím | ||
| Case=Loc | jejích | ||
| Case=Nom|Gender=Masc,Neut | její | ||
| Case=Nom | její | ||
| Gender=Fem | její |
AUX
12183 cs-pos/AUX tokens (59% of all AUX tokens) have a non-empty value of Number.
The most frequent other feature values with which AUX and Number co-occurred: Voice=Act (11146; 91%), Negative=Pos (10360; 85%), Gender=EMPTY (8931; 73%), VerbForm=Fin (8930; 73%), Mood=Ind (7893; 65%).
AUX tokens may have the following values of Number:
Plur(4691; 39% of non-emptyNumber): jsme, budou, bychom, byly, jsou, jste, budeme, byli, nebudou, bystePlur,Sing(837; 7% of non-emptyNumber): byla, nebyla, bývalaSing(6655; 55% of non-emptyNumber): bude, jsem, byl, je, bylo, bych, nebude, nebyl, budu, neníEMPTY(8612): by, být, býti
| Paradigm být | Plur,Sing | Sing | Plur |
|---|---|---|---|
| Animacy=Anim|Gender=Masc|Negative=Neg|Tense=Past|VerbForm=Part|Voice=Act | nebyli | ||
| Animacy=Anim|Gender=Masc|Negative=Pos|Tense=Past|VerbForm=Part|Voice=Act | byli | ||
| Animacy=Inan|Gender=Fem,Masc|Negative=Neg|Tense=Past|VerbForm=Part|Voice=Act | nebyly | ||
| Animacy=Inan|Gender=Fem,Masc|Negative=Pos|Tense=Past|VerbForm=Part|Voice=Act | byly | ||
| Aspect=Imp|Negative=Pos|Tense=Pres|VerbForm=Trans|Voice=Act | jsouce | ||
| Gender=Masc|Negative=Neg|Tense=Past|VerbForm=Part|Voice=Act | nebyl | ||
| Gender=Masc|Negative=Pos|Tense=Past|VerbForm=Part|Voice=Act | byl | ||
| Gender=Fem,Neut|Negative=Neg|Tense=Past|VerbForm=Part|Voice=Act | nebyla | ||
| Gender=Fem,Neut|Negative=Pos|Tense=Past|VerbForm=Part|Voice=Act | byla | ||
| Gender=Neut|Negative=Neg|Tense=Past|VerbForm=Part|Voice=Act | nebylo | ||
| Gender=Neut|Negative=Pos|Tense=Past|VerbForm=Part|Voice=Act | bylo | ||
| Mood=Cnd|Person=1|VerbForm=Fin | bych | bychom | |
| Mood=Cnd|Person=2|VerbForm=Fin | bys | byste | |
| Mood=Imp|Negative=Pos|Person=3|Style=Arch|VerbForm=Fin | budiž | ||
| Mood=Imp|Negative=Pos|Person=3|VerbForm=Fin | budiž | ||
| Mood=Ind|Negative=Neg|Person=1|Tense=Fut|VerbForm=Fin|Voice=Act | nebudu | nebudeme | |
| Mood=Ind|Negative=Neg|Person=1|Tense=Pres|VerbForm=Fin|Voice=Act | nejsme | ||
| Mood=Ind|Negative=Neg|Person=2|Tense=Fut|VerbForm=Fin|Voice=Act | Nebudeš | nebudete | |
| Mood=Ind|Negative=Neg|Person=3|Tense=Fut|VerbForm=Fin|Voice=Act | nebude | nebudou | |
| Mood=Ind|Negative=Neg|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act | není | nejsou | |
| Mood=Ind|Negative=Pos|Person=1|Style=Coll|Tense=Fut|VerbForm=Fin|Voice=Act | budem | ||
| Mood=Ind|Negative=Pos|Person=1|Tense=Fut|VerbForm=Fin|Voice=Act | budu | budeme | |
| Mood=Ind|Negative=Pos|Person=1|Tense=Pres|VerbForm=Fin|Voice=Act | jsem | jsme | |
| Mood=Ind|Negative=Pos|Person=2|Style=Coll|Tense=Pres|VerbForm=Fin|Voice=Act | si | ||
| Mood=Ind|Negative=Pos|Person=2|Tense=Fut|VerbForm=Fin|Voice=Act | budeš | budete | |
| Mood=Ind|Negative=Pos|Person=2|Tense=Pres|VerbForm=Fin|Voice=Act | jsi | jste | |
| Mood=Ind|Negative=Pos|Person=3|Tense=Fut|VerbForm=Fin|Voice=Act | bude | budou | |
| Mood=Ind|Negative=Pos|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act | je | jsou |
NUM
11649 cs-pos/NUM tokens (28% of all NUM tokens) have a non-empty value of Number.
The most frequent other feature values with which NUM and Number co-occurred: NumType=Card (11307; 97%), NumForm=Word (11307; 97%), NumValue=1,2,3 (8050; 69%), Gender=EMPTY (6890; 59%).
NUM tokens may have the following values of Number:
Dual(27; 0% of non-emptyNumber): oběma, dvěma, čtyřmaPlur(6148; 53% of non-emptyNumber): dva, tři, dvě, dvou, čtyři, obou, oba, tří, pěti, oběSing(5474; 47% of non-emptyNumber): jeden, tisíc, pět, jednoho, jedné, jedna, jednu, deset, jedním, šestEMPTY(29861): 1, 2, 3, 4, 6, 5, 1992, 10, 1994, 1993
| Paradigm dva | Plur | Dual |
|---|---|---|
| Case=Acc|Gender=Masc | dva | |
| Case=Acc|Gender=Fem,Neut | dvě | |
| Case=Dat | dvěma | |
| Case=Gen | dvou | |
| Case=Ins|Gender=Fem | dvěma | |
| Case=Ins | dvěma | |
| Case=Loc | dvou | |
| Case=Nom|Gender=Masc | dva | |
| Case=Nom|Gender=Fem,Neut | dvě |
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number:
NOUN –[amod]–> ADJ (148090; 99%),
NOUN –[nmod]–> NOUN (67827; 61%),
VERB –[nsubj]–> NOUN (38774; 80%),
NOUN –[det]–> DET (20902; 81%),
NOUN –[conj]–> NOUN (17248; 78%),
NOUN –[nmod]–> PROPN (12057; 53%),
PROPN –[name]–> PROPN (10797; 82%),
VERB –[conj]–> VERB (10770; 70%),
VERB –[nsubj]–> PROPN (10258; 75%),
VERB –[nsubj]–> PRON (9854; 74%).
Number in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]