Treebank Statistics: UD_Slovenian-SSJ: Features: Number
This feature is universal.
It occurs with 3 different values: Dual, Plur, Sing.
This is a layered feature with the following layers: Number, Number[psor].
148225 tokens (55%) have a non-empty value of Number.
48958 types (101%) occur at least once with a non-empty value of Number.
22215 lemmas (87%) occur at least once with a non-empty value of Number.
The feature is used with 8 part-of-speech tags: NOUN (56865; 21% instances), ADJ (28426; 11% instances), VERB (22411; 8% instances), AUX (15812; 6% instances), PROPN (10239; 4% instances), DET (7978; 3% instances), PRON (5026; 2% instances), NUM (1468; 1% instances).
NOUN
56865 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Number.
NOUN tokens may have the following values of Number:
Dual(599; 1% of non-emptyNumber): leti, letoma, meseca, letih, otroka, policista, primerih, starša, strani, državiPlur(15865; 28% of non-emptyNumber): let, ljudi, letih, dni, tolarjev, milijonov, ljudje, odstotkov, oči, podatkovSing(40401; 71% of non-emptyNumber): leta, strani, delo, primer, dan, leto, čas, del, mesto, času
| Paradigm leto | Sing | Dual | Plur |
|---|---|---|---|
| Case=Acc | leto | leti | leta |
| Case=Dat | letom | ||
| Case=Gen | leta | let | let |
| Case=Ins | letom | letoma | leti |
| Case=Loc | letu | letih | letih |
| Case=Nom | leto | leta |
ADJ
28426 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Number.
The most frequent other feature values with which ADJ and Number co-occurred: Degree=Pos (25970; 91%), VerbForm=EMPTY (24781; 87%), Definite=EMPTY (24330; 86%).
ADJ tokens may have the following values of Number:
Dual(280; 1% of non-emptyNumber): trebušni, izgubljeni, novi, posamični, zadnja, desnima, dodatna, drugačna, drugih, ediniPlur(8805; 31% of non-emptyNumber): drugih, druge, različnih, nove, drugi, novih, sami, drugimi, zadnjih, slovenskihSing(19341; 68% of non-emptyNumber): prvi, mogoče, drugi, sam, novo, veliko, prva, drugo, potrebno, drugega
| Paradigm drug | Sing | Dual | Plur |
|---|---|---|---|
| Case=Acc|Definite=Def|Gender=Masc | drugi | ||
| Case=Acc|Definite=Ind|Gender=Masc | drug | ||
| Case=Acc|Gender=Masc | drugega | druge | |
| Case=Acc|Gender=Fem | drugo | druge | |
| Case=Acc|Gender=Neut | drugo | druga | |
| Case=Dat|Gender=Masc | drugemu | drugim | |
| Case=Dat|Gender=Fem | drugi | ||
| Case=Gen|Gender=Masc | drugega | drugih | |
| Case=Gen|Gender=Fem | druge | drugih | |
| Case=Gen|Gender=Neut | drugega | drugih | |
| Case=Ins|Gender=Masc | drugim | drugimi | |
| Case=Ins|Gender=Fem | drugo | drugimi | |
| Case=Ins|Gender=Neut | drugim | drugimi | |
| Case=Loc|Gender=Masc | drugem | drugih | drugih |
| Case=Loc|Gender=Fem | drugi | drugih | drugih |
| Case=Loc|Gender=Neut | drugem | drugih | |
| Case=Nom|Definite=Def|Gender=Masc | drugi | ||
| Case=Nom|Definite=Ind|Gender=Masc | drug | ||
| Case=Nom|Gender=Masc | drugi | ||
| Case=Nom|Gender=Fem | druga | drugi | druge |
| Case=Nom|Gender=Neut | drugo | druga |
VERB
22411 VERB tokens (91% of all VERB tokens) have a non-empty value of Number.
The most frequent other feature values with which VERB and Number co-occurred: Tense=EMPTY (11880; 53%), Mood=EMPTY (11423; 51%), Person=EMPTY (11423; 51%), VerbForm=Part (11423; 51%).
VERB tokens may have the following values of Number:
Dual(650; 3% of non-emptyNumber): imata, sta, imela, morala, bila, odšla, bili, dobila, začela, morataPlur(7663; 34% of non-emptyNumber): so, imajo, imeli, morali, moramo, morajo, začeli, imamo, dobili, biliSing(14098; 63% of non-emptyNumber): je, ima, bilo, ni, gre, bo, mora, imel, pomeni, praviEMPTY(2181): videti, imeti, biti, vedeti, dobiti, narediti, povedati, reči, sprejeti, najti
| Paradigm biti | Sing | Dual | Plur |
|---|---|---|---|
| Aspect=Imp|Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | bijejo | ||
| Gender=Masc|VerbForm=Part | bil | bila, bla | bili |
| Gender=Fem|VerbForm=Part | bila | bili | bile |
| Gender=Neut|VerbForm=Part | bilo, blo | bili | |
| Mood=Imp|Person=2|VerbForm=Fin | bodi | ||
| Mood=Ind|Person=1|Polarity=Neg|Tense=Pres|VerbForm=Fin | nisem | nismo | |
| Mood=Ind|Person=1|Polarity=Pos|Tense=Fut|VerbForm=Fin | bom | bomo | |
| Mood=Ind|Person=1|Polarity=Pos|Tense=Pres|VerbForm=Fin | sem | sva | smo |
| Mood=Ind|Person=2|Polarity=Neg|Tense=Pres|VerbForm=Fin | nisi | niste | |
| Mood=Ind|Person=2|Polarity=Pos|Tense=Fut|VerbForm=Fin | boš | boste | |
| Mood=Ind|Person=2|Polarity=Pos|Tense=Pres|VerbForm=Fin | si, s | sta | ste |
| Mood=Ind|Person=3|Polarity=Neg|Tense=Pres|VerbForm=Fin | ni | niso | |
| Mood=Ind|Person=3|Polarity=Pos|Tense=Fut|VerbForm=Fin | bo | bosta | bodo |
| Mood=Ind|Person=3|Polarity=Pos|Tense=Pres|VerbForm=Fin | je | sta | so |
AUX
15812 AUX tokens (91% of all AUX tokens) have a non-empty value of Number.
The most frequent other feature values with which AUX and Number co-occurred: VerbForm=Fin (14371; 91%), Mood=Ind (14359; 91%), Polarity=Pos (13271; 84%), Tense=Pres (12883; 81%), Person=3 (12848; 81%).
AUX tokens may have the following values of Number:
Dual(487; 3% of non-emptyNumber): sta, sva, bila, bosta, nista, nisva, bili, bovaPlur(4221; 27% of non-emptyNumber): so, bodo, smo, niso, bili, bomo, boste, ste, bile, nismoSing(11104; 70% of non-emptyNumber): je, bo, ni, sem, bil, bila, bilo, bom, nisem, siEMPTY(1515): bi, biti, b
| Paradigm biti | Sing | Dual | Plur |
|---|---|---|---|
| Gender=Masc|VerbForm=Part | bil | bila | bili, bli |
| Gender=Fem|VerbForm=Part | bila, bla | bili | bile |
| Gender=Neut|VerbForm=Part | bilo, blo | bili | bila |
| Mood=Imp|Person=2|VerbForm=Fin | bodi | bodite | |
| Mood=Ind|Person=1|Polarity=Neg|Tense=Pres|VerbForm=Fin | nisem | nisva | nismo |
| Mood=Ind|Person=1|Polarity=Pos|Tense=Fut|VerbForm=Fin | bom | bova | bomo |
| Mood=Ind|Person=1|Polarity=Pos|Tense=Pres|VerbForm=Fin | sem | sva | smo |
| Mood=Ind|Person=2|Polarity=Neg|Tense=Pres|VerbForm=Fin | nisi | niste | |
| Mood=Ind|Person=2|Polarity=Pos|Tense=Fut|VerbForm=Fin | boš | bosta | boste |
| Mood=Ind|Person=2|Polarity=Pos|Tense=Pres|VerbForm=Fin | si, as | sta | ste |
| Mood=Ind|Person=3|Polarity=Neg|Tense=Pres|VerbForm=Fin | ni | nista | niso |
| Mood=Ind|Person=3|Polarity=Pos|Tense=Fut|VerbForm=Fin | bo | bosta | bodo, bojo |
| Mood=Ind|Person=3|Polarity=Pos|Tense=Pres|VerbForm=Fin | je | sta | so |
PROPN
10239 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Number.
The most frequent other feature values with which PROPN and Number co-occurred: Gender=Masc (6668; 65%), Case=Nom (5770; 56%).
PROPN tokens may have the following values of Number:
Dual(7; 0% of non-emptyNumber): Francoza, Belokranjca, Egipčana, Francozov, Litijana, MakedoncaPlur(530; 5% of non-emptyNumber): ZDA, Slovenci, Slovencev, Nemci, Francozi, Rusi, Slovence, Američani, Aten, AtenahSing(9702; 95% of non-emptyNumber): Slovenije, Sloveniji, EU, Slovenija, Evropi, Ljubljana, Ljubljani, Evrope, Slovenijo, Maribor
| Paradigm Francoz | Sing | Dual | Plur |
|---|---|---|---|
| Case=Acc | Francoza | Francoze | |
| Case=Dat | Francozom | ||
| Case=Gen | Francozov | Francozov | |
| Case=Ins | Francozom | ||
| Case=Nom | Francoz | Francoza | Francozi |
Number seems to be lexical feature of PROPN. 99% lemmas (5013) occur only with one value of Number.
DET
7978 DET tokens (85% of all DET tokens) have a non-empty value of Number.
The most frequent other feature values with which DET and Number co-occurred: Number[psor]=EMPTY (6502; 81%), Person=EMPTY (6502; 81%), Poss=EMPTY (5684; 71%).
DET tokens may have the following values of Number:
Dual(155; 2% of non-emptyNumber): oba, obeh, obe, obema, ti, svoja, ta, katerih, katerima, njuniPlur(2131; 27% of non-emptyNumber): vse, vseh, vsi, teh, katerih, te, svojih, svoje, nekatere, nekateriSing(5692; 71% of non-emptyNumber): to, tem, tega, ta, vse, svojo, svoje, vsak, katerem, svojEMPTY(1374): več, nekaj, veliko, manj, dovolj, malo, toliko, pol, preveč, največ
| Paradigm ta | Sing | Dual | Plur |
|---|---|---|---|
| Case=Acc|Gender=Masc | ta, tega | te | |
| Case=Acc|Gender=Fem | to | ti | te |
| Case=Acc|Gender=Neut | to | ta | |
| Case=Dat|Gender=Masc | temu | tem | |
| Case=Dat|Gender=Fem | tej | tem | |
| Case=Dat|Gender=Neut | temu | tem | |
| Case=Gen|Gender=Masc | tega | teh | teh |
| Case=Gen|Gender=Fem | te | teh | |
| Case=Gen|Gender=Neut | tega | teh | |
| Case=Ins|Gender=Masc | tem | tema | temi |
| Case=Ins|Gender=Fem | to | temi | |
| Case=Ins|Gender=Neut | tem | temi | |
| Case=Loc|Gender=Masc | tem | teh | teh |
| Case=Loc|Gender=Fem | tej | teh | |
| Case=Loc|Gender=Neut | tem | teh | |
| Case=Nom|Gender=Masc | ta | ta | ti |
| Case=Nom|Gender=Fem | ta | ti | te |
| Case=Nom|Gender=Neut | to | ta |
PRON
5026 PRON tokens (55% of all PRON tokens) have a non-empty value of Number.
The most frequent other feature values with which PRON and Number co-occurred: Reflex=EMPTY (5026; 100%), PronType=Prs (3884; 77%), Person=3 (2772; 55%), Variant=Short (2535; 50%).
PRON tokens may have the following values of Number:
Dual(115; 2% of non-emptyNumber): ju, nama, jima, njima, njiju, naju, midva, onadva, vaju, vamaPlur(1367; 27% of non-emptyNumber): jih, nas, jim, nam, vam, njih, njimi, vas, vi, miSing(3544; 71% of non-emptyNumber): ga, jo, kar, mu, kaj, mi, ji, me, nekaj, kdoEMPTY(4080): se, si, seboj, sebi, sebe, zase, sabo, nase, vase, medse
| Paradigm on | Sing | Dual | Plur |
|---|---|---|---|
| Case=Acc|Gender=Masc | njega | njiju, onadva | njih, nje |
| Case=Acc|Gender=Masc|Variant=Short | ga | ju, jih | jih |
| Case=Acc|Gender=Fem | njo | njiju | |
| Case=Acc|Gender=Fem|Variant=Short | jo | ju | jih |
| Case=Acc|Gender=Neut|Variant=Short | ga | ju | jih |
| Case=Dat|Gender=Masc | njemu | njima | njim |
| Case=Dat|Gender=Masc|Variant=Short | mu | jima | jim |
| Case=Dat|Gender=Fem | njej | njim | |
| Case=Dat|Gender=Fem|Variant=Short | ji | jima | jim |
| Case=Dat|Gender=Neut|Variant=Short | mu | jim | |
| Case=Gen|Gender=Masc | njega | njiju | njih |
| Case=Gen|Gender=Masc|Variant=Short | ga | jih | |
| Case=Gen|Gender=Fem | nje | njih | |
| Case=Gen|Gender=Fem|Variant=Short | je | ju | jih |
| Case=Gen|Gender=Neut | njega | njih | |
| Case=Gen|Gender=Neut|Variant=Short | ga | jih | |
| Case=Ins|Gender=Masc | njim | njima | njimi |
| Case=Ins|Gender=Fem | njo | njima | njimi |
| Case=Ins|Gender=Neut | njim | njimi | |
| Case=Loc|Gender=Masc | njem | njiju | njih |
| Case=Loc|Gender=Fem | njej | njima | njih |
| Case=Loc|Gender=Neut | njem | njih | |
| Case=Nom|Gender=Masc | on | onadva | oni |
| Case=Nom|Gender=Fem | ona |
NUM
1468 NUM tokens (26% of all NUM tokens) have a non-empty value of Number.
The most frequent other feature values with which NUM and Number co-occurred: NumForm=Word (1468; 100%), NumType=Card (1462; 100%).
NUM tokens may have the following values of Number:
Dual(278; 19% of non-emptyNumber): dve, dva, dveh, dvemaPlur(739; 50% of non-emptyNumber): tri, štiri, pet, tisoč, treh, deset, štirih, sto, šest, sedemSing(451; 31% of non-emptyNumber): eno, ena, eden, enega, en, enem, eni, ene, enim, dvojeEMPTY(4117): 2, 1, 10, 3, 6, 30, 1., 20, 4, 2000
| Paradigm en | Sing | Plur |
|---|---|---|
| Case=Acc|Gender=Masc | en, enega, Enga | |
| Case=Acc|Gender=Fem | eno | |
| Case=Acc|Gender=Neut | eno | |
| Case=Dat|Gender=Masc | enemu | |
| Case=Dat|Gender=Fem | eni | |
| Case=Gen|Gender=Masc | enega, enga | enih |
| Case=Gen|Gender=Fem | ene | |
| Case=Gen|Gender=Neut | enega | |
| Case=Ins|Gender=Masc | enim | |
| Case=Ins|Gender=Fem | eno | |
| Case=Ins|Gender=Neut | enim | |
| Case=Loc|Gender=Masc | enem | |
| Case=Loc|Gender=Fem | eni | enih |
| Case=Loc|Gender=Neut | enem | |
| Case=Nom|Gender=Masc | en | eni |
| Case=Nom|Gender=Fem | ena | |
| Case=Nom|Gender=Neut | eno |
Number seems to be lexical feature of NUM. 96% lemmas (43) occur only with one value of Number.
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number:
NOUN –[amod]–> ADJ (21319; 100%),
VERB –[aux]–> AUX (9362; 88%),
NOUN –[nmod]–> NOUN (9311; 64%),
VERB –[nsubj]–> NOUN (6430; 93%),
VERB –[obl]–> NOUN (6111; 52%),
NOUN –[det]–> DET (5007; 88%),
NOUN –[conj]–> NOUN (3583; 79%),
ADJ –[cop]–> AUX (3290; 97%),
NOUN –[nmod]–> PROPN (2458; 82%),
NOUN –[acl]–> VERB (2097; 73%).