Number
: number
In Slovenian, Number
is an inflectional feature of nouns and proper nouns, and other parts of speech (adjectives, auxiliaries, determiners, numerals, pronouns, verbs) that mark agreement with nouns.
Slovenian distinguishes three Number
values: singular
, dual
and plural
. Plurale tantum and Singulare tantum are not explicitly marked and are tagged as plural
or singular
, respectively.
Sing
: singular number
Examples
- kolo “one bicycle”
- en gospod “one gentleman”
- moja pisarna “my office”
- sem “I am”
Dual
: dual number
Examples
- kolesi “two bicycles”
- dva gospoda “two gentlemen”
- moji pisarni “my two offices”
- sva “we (two) are”
Plur
: plural number
Examples
- kolesa “two bicycles”
- trije gospodi “three gentlemen”
- moje pisarne “my (three or more) offices”
- smo “we (three or four) are”
Conversion from JOS
All tokens with feature Number=singular are converted to Number=Sing
, all tokens with Number=dual are converted to Number=Dual
and all tokens with Number=plural are converted to Number=Plur
.
Treebank Statistics (UD_Slovenian)
This feature is universal.
It occurs with 3 different values: Dual
, Plur
, Sing
.
This is a layered feature with the following layers: Number, Number[psor].
80134 tokens (57%) have a non-empty value of Number
.
31840 types (101%) occur at least once with a non-empty value of Number
.
15210 lemmas (90%) occur at least once with a non-empty value of Number
.
The feature is used with 8 part-of-speech tags: NOUN (30139; 21% instances), VERB (15900; 11% instances), ADJ (15027; 11% instances), AUX (6261; 4% instances), PROPN (4682; 3% instances), PRON (4508; 3% instances), DET (2877; 2% instances), NUM (740; 1% instances).
NOUN
30139 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Number
.
NOUN
tokens may have the following values of Number
:
Dual
(310; 1% of non-emptyNumber
): letoma, leti, strani, meseca, otroka, primerih, starša, letih, partnerja, policistaPlur
(8484; 28% of non-emptyNumber
): let, letih, ljudi, tolarjev, dni, milijonov, oči, odstotkov, podatkov, letiSing
(21345; 71% of non-emptyNumber
): leta, dan, leto, čas, življenje, strani, del, delu, način, dela
Paradigm leto | Sing | Dual | Plur |
---|---|---|---|
Case=Acc | leto | leti | leta |
Case=Dat | letom | ||
Case=Gen | leta | let | |
Case=Ins | letom | letoma | leti |
Case=Loc | letu | letih | letih |
Case=Nom | leto | leta |
VERB
15900 VERB tokens (92% of all VERB
tokens) have a non-empty value of Number
.
The most frequent other feature values with which VERB
and Number
co-occurred: Negative=EMPTY (12778; 80%), Gender=EMPTY (8256; 52%), VerbForm=Fin (8256; 52%), Mood=Ind (7982; 50%).
VERB
tokens may have the following values of Number
:
Dual
(464; 3% of non-emptyNumber
): sta, bila, imata, imela, morala, odšla, bosta, hotela, srečala, začelaPlur
(5238; 33% of non-emptyNumber
): so, bili, imajo, bile, niso, smo, moramo, bodo, imeli, moraliSing
(10198; 64% of non-emptyNumber
): je, bil, bilo, ni, bila, bo, ima, gre, mora, imelEMPTY
(1410): biti, videti, imeti, reči, narediti, vedeti, slišati, dobiti, najti, povedati
Paradigm biti | Sing | Dual | Plur |
---|---|---|---|
Aspect=Imp|Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | bijejo | ||
Gender=Masc|VerbForm=Part | bil | bila, bla | bili |
Gender=Fem|VerbForm=Part | bila | bili | bile |
Gender=Neut|VerbForm=Part | bilo, blo | bila | |
Mood=Imp|Person=2|VerbForm=Fin | Bodi | bodite | |
Mood=Ind|Negative=Neg|Person=1|Tense=Pres|VerbForm=Fin | nisem | nismo | |
Mood=Ind|Negative=Neg|Person=2|Tense=Pres|VerbForm=Fin | nisi | niste | |
Mood=Ind|Negative=Neg|Person=3|Tense=Pres|VerbForm=Fin | ni | nista | niso |
Mood=Ind|Negative=Pos|Person=1|Tense=Fut|VerbForm=Fin | bom | bomo | |
Mood=Ind|Negative=Pos|Person=1|Tense=Pres|VerbForm=Fin | sem | sva | smo |
Mood=Ind|Negative=Pos|Person=2|Tense=Fut|VerbForm=Fin | boš | bosta | boste |
Mood=Ind|Negative=Pos|Person=2|Tense=Pres|VerbForm=Fin | si | ste | |
Mood=Ind|Negative=Pos|Person=3|Tense=Fut|VerbForm=Fin | bo | bosta | bodo |
Mood=Ind|Negative=Pos|Person=3|Tense=Pres|VerbForm=Fin | je | sta | so |
ADJ
15027 ADJ tokens (100% of all ADJ
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADJ
and Number
co-occurred: Degree=Pos (13768; 92%), VerbForm=EMPTY (13098; 87%), Definite=EMPTY (12959; 86%).
ADJ
tokens may have the following values of Number
:
Dual
(126; 1% of non-emptyNumber
): desnima, drugačna, najboljša, nemških, obsojena, predstavljena, slovenska, soodvisni, srečna, trebušniPlur
(4770; 32% of non-emptyNumber
): druge, drugih, različnih, zadnjih, nove, slovenskih, novih, sami, drugi, drugimiSing
(10131; 67% of non-emptyNumber
): mogoče, prvi, sam, drugi, novo, veliko, drugo, novega, pomembno, drugega
Paradigm drug | Sing | Dual | Plur |
---|---|---|---|
Case=Acc|Definite=Def|Gender=Masc | drugi | ||
Case=Acc|Definite=Ind|Gender=Masc | drug | ||
Case=Acc|Gender=Masc | drugega | druge | |
Case=Acc|Gender=Fem | drugo | druge | |
Case=Acc|Gender=Neut | drugo | druga | |
Case=Dat|Gender=Masc | drugemu | drugim | |
Case=Dat|Gender=Fem | drugi | ||
Case=Gen|Gender=Masc | drugega | drugih | |
Case=Gen|Gender=Fem | druge | drugih | |
Case=Gen|Gender=Neut | drugega | ||
Case=Ins|Gender=Masc | drugim | drugimi | |
Case=Ins|Gender=Fem | drugo | drugimi | |
Case=Ins|Gender=Neut | drugim | ||
Case=Loc|Gender=Masc | drugem | drugih | |
Case=Loc|Gender=Fem | drugi | drugih | drugih |
Case=Loc|Gender=Neut | drugem | drugih | |
Case=Nom|Definite=Def|Gender=Masc | drugi | ||
Case=Nom|Definite=Ind|Gender=Masc | drug | ||
Case=Nom|Gender=Masc | drugi | ||
Case=Nom|Gender=Fem | druga | druge | |
Case=Nom|Gender=Neut | drugo | druga |
AUX
6261 AUX tokens (88% of all AUX
tokens) have a non-empty value of Number
.
The most frequent other feature values with which AUX
and Number
co-occurred: VerbForm=Fin (6240; 100%), Mood=Ind (6240; 100%), Negative=Pos (5809; 93%), Tense=Pres (5426; 87%), Person=3 (5345; 85%).
AUX
tokens may have the following values of Number
:
Dual
(239; 4% of non-emptyNumber
): sta, sva, bosta, nista, nisva, bova, bilaPlur
(1806; 29% of non-emptyNumber
): so, bodo, smo, niso, bomo, boste, ste, nismo, niste, biliSing
(4216; 67% of non-emptyNumber
): je, bo, sem, ni, bom, nisem, si, bil, bila, nisiEMPTY
(886): bi, b
Paradigm biti | Sing | Dual | Plur |
---|---|---|---|
Gender=Masc|VerbForm=Part | bil | bila | bili |
Gender=Fem|VerbForm=Part | bila | ||
Mood=Ind|Negative=Neg|Person=1|Tense=Pres|VerbForm=Fin | nisem | nisva | nismo |
Mood=Ind|Negative=Neg|Person=2|Tense=Pres|VerbForm=Fin | nisi | niste | |
Mood=Ind|Negative=Neg|Person=3|Tense=Pres|VerbForm=Fin | ni | nista | niso |
Mood=Ind|Negative=Pos|Person=1|Tense=Fut|VerbForm=Fin | bom | bova | bomo |
Mood=Ind|Negative=Pos|Person=1|Tense=Pres|VerbForm=Fin | sem | sva | smo |
Mood=Ind|Negative=Pos|Person=2|Tense=Fut|VerbForm=Fin | boste | ||
Mood=Ind|Negative=Pos|Person=2|Tense=Pres|VerbForm=Fin | si, as | sta | ste |
Mood=Ind|Negative=Pos|Person=3|Tense=Fut|VerbForm=Fin | bo | bosta | bodo, bojo |
Mood=Ind|Negative=Pos|Person=3|Tense=Pres|VerbForm=Fin | je | sta | so |
PROPN
4682 PROPN tokens (100% of all PROPN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PROPN
and Number
co-occurred: Gender=Masc (2922; 62%), Case=Nom (2427; 52%).
PROPN
tokens may have the following values of Number
:
Dual
(4; 0% of non-emptyNumber
): Belokranjca, Egipčana, Francoza, LitijanaPlur
(270; 6% of non-emptyNumber
): ZDA, Slovenci, Slovencev, Francozi, Nemcev, Nemci, Slovence, Američani, Atenah, BrežiceSing
(4408; 94% of non-emptyNumber
): Slovenije, Sloveniji, Slovenija, EU, Ljubljani, Slovenijo, Evropi, Mariboru, LJUBLJANA, Ljubljana
Paradigm Francoz | Sing | Dual | Plur |
---|---|---|---|
Case=Dat | Francozom | ||
Case=Gen | Francozov | ||
Case=Ins | Francozom | ||
Case=Nom | Francoz | Francoza | Francozi |
Number
seems to be lexical feature of PROPN
. 99% lemmas (2573) occur only with one value of Number
.
PRON
4508 PRON tokens (65% of all PRON
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PRON
and Number
co-occurred: Variant=EMPTY (2919; 65%), PronType=Prs (2342; 52%).
PRON
tokens may have the following values of Number
:
Dual
(95; 2% of non-emptyNumber
): ju, oba, nama, jima, njima, naju, njiju, midva, obeh, vajuPlur
(1100; 24% of non-emptyNumber
): jih, nas, nam, vam, jim, katerih, vas, vsi, njih, njimiSing
(3313; 73% of non-emptyNumber
): to, ga, jo, tem, mu, kaj, mi, kar, tega, jiEMPTY
(2421): se, si, sebi, sebe, seboj, zase, sabo, nase, vase, čigar
Paradigm on | Sing | Dual | Plur |
---|---|---|---|
Case=Acc|Gender=Masc | njega | njiju | njih, nje |
Case=Acc|Gender=Masc|Variant=Short | ga | ju | jih |
Case=Acc|Gender=Fem | njo | ||
Case=Acc|Gender=Fem|Variant=Short | jo | ju | jih |
Case=Acc|Gender=Neut|Variant=Short | ga | ju | jih |
Case=Dat|Gender=Masc | njemu | njima | njim |
Case=Dat|Gender=Masc|Variant=Short | mu | jima | jim |
Case=Dat|Gender=Fem | njej | njim | |
Case=Dat|Gender=Fem|Variant=Short | ji | jima | jim |
Case=Dat|Gender=Neut|Variant=Short | mu | jim | |
Case=Gen|Gender=Masc | njega | njiju | njih |
Case=Gen|Gender=Masc|Variant=Short | ga | jih | |
Case=Gen|Gender=Fem | nje | njih | |
Case=Gen|Gender=Fem|Variant=Short | je | ju | jih |
Case=Gen|Gender=Neut | njega | njih | |
Case=Gen|Gender=Neut|Variant=Short | ga | jih | |
Case=Ins|Gender=Masc | njim | njima | njimi |
Case=Ins|Gender=Fem | njo | njima | njimi |
Case=Ins|Gender=Neut | njim | njimi | |
Case=Loc|Gender=Masc | njem | njiju | njih |
Case=Loc|Gender=Fem | njej | njih | |
Case=Loc|Gender=Neut | njem | njih | |
Case=Nom|Gender=Masc | on | onadva | oni |
Case=Nom|Gender=Fem | ona |
DET
2877 DET tokens (86% of all DET
tokens) have a non-empty value of Number
.
The most frequent other feature values with which DET
and Number
co-occurred: Degree=EMPTY (2877; 100%), Gender[psor]=EMPTY (2498; 87%), Number[psor]=EMPTY (2073; 72%), Poss=EMPTY (2073; 72%), Person=EMPTY (2073; 72%).
DET
tokens may have the following values of Number
:
Dual
(70; 2% of non-emptyNumber
): obeh, oba, obe, obema, svoja, njuni, ta, Moja, Njegova, NjeniPlur
(881; 31% of non-emptyNumber
): vse, vseh, teh, svoje, te, svojih, nekatere, vsi, naših, njegovihSing
(1926; 67% of non-emptyNumber
): ta, tem, svojo, to, svoje, svoj, vsak, tega, te, njegovEMPTY
(455): nekaj, več, veliko, dovolj, manj, malo, največ, pol, toliko, mnogo
Paradigm ta | Sing | Dual | Plur |
---|---|---|---|
Case=Acc|Gender=Masc | ta, tega | te | |
Case=Acc|Gender=Fem | to | ti | te |
Case=Acc|Gender=Neut | to | ta | |
Case=Dat|Gender=Masc | temu | ||
Case=Dat|Gender=Fem | tej | tem | |
Case=Dat|Gender=Neut | temu | ||
Case=Gen|Gender=Masc | tega | teh | teh |
Case=Gen|Gender=Fem | te | teh | |
Case=Gen|Gender=Neut | tega | teh | |
Case=Ins|Gender=Masc | tem | temi | |
Case=Ins|Gender=Fem | to | temi | |
Case=Ins|Gender=Neut | tem | ||
Case=Loc|Gender=Masc | tem | teh | |
Case=Loc|Gender=Fem | tej | teh | |
Case=Loc|Gender=Neut | tem | teh | |
Case=Nom|Gender=Masc | ta | ta | ti |
Case=Nom|Gender=Fem | ta | te | |
Case=Nom|Gender=Neut | to | ta |
NUM
740 NUM tokens (38% of all NUM
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NUM
and Number
co-occurred: NumForm=Word (740; 100%), NumType=Card (735; 99%).
NUM
tokens may have the following values of Number
:
Dual
(119; 16% of non-emptyNumber
): dveh, dva, dve, dvemaPlur
(415; 56% of non-emptyNumber
): tri, tisoč, štiri, štirih, pet, sto, treh, deset, šest, sedemSing
(206; 28% of non-emptyNumber
): eno, ena, eden, en, enega, enem, eni, ene, enim, dvojeEMPTY
(1187): 10, 15, 2000, 1., 50, 30, 3, 20, 20., 6
Paradigm en | Sing | Plur |
---|---|---|
Case=Acc|Gender=Masc | en, enega | |
Case=Acc|Gender=Fem | eno | |
Case=Acc|Gender=Neut | eno | |
Case=Dat|Gender=Masc | enemu | |
Case=Dat|Gender=Fem | eni | |
Case=Gen|Gender=Masc | enega | |
Case=Gen|Gender=Fem | ene | |
Case=Gen|Gender=Neut | enega | |
Case=Ins|Gender=Masc | enim | |
Case=Ins|Gender=Fem | eno | |
Case=Ins|Gender=Neut | enim | |
Case=Loc|Gender=Masc | enem | |
Case=Loc|Gender=Fem | eni | enih |
Case=Loc|Gender=Neut | enem | |
Case=Nom|Gender=Masc | en | eni |
Case=Nom|Gender=Fem | ena | |
Case=Nom|Gender=Neut | eno |
Number
seems to be lexical feature of NUM
. 97% lemmas (38) occur only with one value of Number
.
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number
:
NOUN –[amod]–> ADJ (11346; 99%),
VERB –[aux]–> AUX (5667; 88%),
NOUN –[nmod]–> NOUN (5138; 65%),
VERB –[nsubj]–> NOUN (3670; 92%),
VERB –[nmod]–> NOUN (3543; 51%),
NOUN –[det]–> DET (2824; 86%),
ADJ –[cop]–> VERB (1844; 97%),
NOUN –[conj]–> NOUN (1666; 78%),
NOUN –[nmod]–> PROPN (1230; 81%),
NOUN –[acl]–> VERB (1225; 72%).
Number in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]