Treebank Statistics: UD_Slovenian-SST: Features: Number
This feature is universal.
It occurs with 3 different values: Dual
, Plur
, Sing
.
This is a layered feature with the following layers: Number, Number[psor].
14475 tokens (49%) have a non-empty value of Number
.
5267 types (86%) occur at least once with a non-empty value of Number
.
3299 lemmas (84%) occur at least once with a non-empty value of Number
.
The feature is used with 8 part-of-speech tags: VERB (3662; 12% instances), NOUN (3626; 12% instances), AUX (1790; 6% instances), ADJ (1664; 6% instances), DET (1611; 5% instances), PRON (1179; 4% instances), NUM (499; 2% instances), PROPN (444; 2% instances).
VERB
3662 VERB tokens (93% of all VERB
tokens) have a non-empty value of Number
.
The most frequent other feature values with which VERB
and Number
co-occurred: Polarity=EMPTY (2918; 80%), Gender=EMPTY (2498; 68%), VerbForm=Fin (2498; 68%), Mood=Ind (2290; 63%), Tense=Pres (2226; 61%).
VERB
tokens may have the following values of Number
:
Dual
(85; 2% of non-emptyNumber
): sta, šla, bila, gledava, imava, prišla, dajva, greva, imata, imelaPlur
(1037; 28% of non-emptyNumber
): recimo, so, imamo, imeli, imajo, imate, gremo, rekli, vemo, smoSing
(2540; 69% of non-emptyNumber
): je, vem, veš, mislim, bilo, ni, ima, bo, bil, praviEMPTY
(271): bi, narediti, reči, biti, delati, imeti, iti, priti, videti, kupiti
Paradigm biti | Sing | Dual | Plur |
---|---|---|---|
Aspect=Imp|Gender=Masc|VerbForm=Part | bil | ||
Aspect=Imp|Gender=Neut|VerbForm=Part | bilo | ||
Gender=Masc|VerbForm=Part | bil | bila | bili |
Gender=Fem|VerbForm=Part | bila | bile | |
Gender=Neut|VerbForm=Part | bilo | ||
Mood=Imp|Person=2|VerbForm=Fin | bodite | ||
Mood=Ind|Person=1|Polarity=Neg|Tense=Pres|VerbForm=Fin | nisem | nismo | |
Mood=Ind|Person=1|Polarity=Pos|Tense=Fut|VerbForm=Fin | bom | bova | bomo |
Mood=Ind|Person=1|Polarity=Pos|Tense=Pres|VerbForm=Fin | sem | sva | smo |
Mood=Ind|Person=2|Polarity=Neg|Tense=Pres|VerbForm=Fin | niste | ||
Mood=Ind|Person=2|Polarity=Pos|Tense=Fut|VerbForm=Fin | boš | ||
Mood=Ind|Person=2|Polarity=Pos|Tense=Pres|VerbForm=Fin | si | ste | |
Mood=Ind|Person=3|Polarity=Neg|Tense=Pres|VerbForm=Fin | ni | niso | |
Mood=Ind|Person=3|Polarity=Pos|Tense=Fut|VerbForm=Fin | bo | bosta | bodo, bojo |
Mood=Ind|Person=3|Polarity=Pos|Tense=Pres|VerbForm=Fin | je | sta | so |
NOUN
3626 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NOUN
and Number
co-occurred: Animacy=EMPTY (3245; 89%).
NOUN
tokens may have the following values of Number
:
Dual
(35; 1% of non-emptyNumber
): dni, elementa, milijona, akterja, bivola, brata, datuma, disciplinah, dneva, dogodkaPlur
(855; 24% of non-emptyNumber
): evrov, ljudi, stvari, dni, minut, stopinj, letih, let, razmere, letiSing
(2736; 75% of non-emptyNumber
): bistvu, dan, redu, strani, jutro, leto, čas, koncu, gospod, hvala
Paradigm leto | Sing | Dual | Plur |
---|---|---|---|
Case=Acc | leto | leta | |
Case=Gen | leta | let | |
Case=Ins | letom | leti | |
Case=Loc | letu | letih | |
Case=Nom | leto | leti |
AUX
1790 AUX tokens (92% of all AUX
tokens) have a non-empty value of Number
.
The most frequent other feature values with which AUX
and Number
co-occurred: VerbForm=Fin (1662; 93%), Mood=Ind (1658; 93%), Polarity=Pos (1538; 86%), Tense=Pres (1423; 79%), Person=3 (1122; 63%).
AUX
tokens may have the following values of Number
:
Dual
(51; 3% of non-emptyNumber
): sta, sva, bova, bila, bosta, nisvaPlur
(448; 25% of non-emptyNumber
): so, smo, bomo, ste, boste, bili, bodo, nismo, niso, bojoSing
(1291; 72% of non-emptyNumber
): je, sem, bo, ni, si, bil, bila, bilo, bom, bošEMPTY
(146): bi, biti
Paradigm biti | Sing | Dual | Plur |
---|---|---|---|
Aspect=Imp|Gender=Masc|VerbForm=Part | bil | ||
Aspect=Imp|Gender=Neut|VerbForm=Part | bilo | ||
Gender=Masc|VerbForm=Part | bil | bila | bili |
Gender=Fem|VerbForm=Part | bila | bile | |
Gender=Neut|VerbForm=Part | bilo | bila | |
Mood=Imp|Person=2|VerbForm=Fin | bodi | bodite | |
Mood=Ind|Person=1|Polarity=Neg|Tense=Pres|VerbForm=Fin | nisem | nisva | nismo |
Mood=Ind|Person=1|Polarity=Pos|Tense=Fut|VerbForm=Fin | bom | bova | bomo |
Mood=Ind|Person=1|Polarity=Pos|Tense=Pres|VerbForm=Fin | sem | sva | smo |
Mood=Ind|Person=2|Polarity=Neg|Tense=Pres|VerbForm=Fin | nisi | niste | |
Mood=Ind|Person=2|Polarity=Pos|Tense=Fut|VerbForm=Fin | boš | boste | |
Mood=Ind|Person=2|Polarity=Pos|Tense=Pres|VerbForm=Fin | si | ste | |
Mood=Ind|Person=3|Polarity=Neg|Tense=Pres|VerbForm=Fin | ni | niso | |
Mood=Ind|Person=3|Polarity=Pos|Tense=Fut|VerbForm=Fin | bo | bosta | bodo, bojo |
Mood=Ind|Person=3|Polarity=Pos|Tense=Pres|VerbForm=Fin | je, biti | sta | so |
ADJ
1664 ADJ tokens (100% of all ADJ
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADJ
and Number
co-occurred: VerbForm=EMPTY (1478; 89%), Degree=Pos (1442; 87%), Definite=EMPTY (1350; 81%), Case=Nom (880; 53%).
ADJ
tokens may have the following values of Number
:
Dual
(12; 1% of non-emptyNumber
): blagovni, blagovnih, drugih, grozna, mali, napisana, predvidena, spodnji, sprejeta, upognjenaPlur
(386; 23% of non-emptyNumber
): sami, zadnjih, same, dobri, druge, drugih, ljudske, psihološki, tujih, bogatejšiSing
(1266; 76% of non-emptyNumber
): dobro, drugo, prvi, dober, drugi, zanimivo, druga, drugega, glavnem, lep
Paradigm drug | Sing | Dual | Plur |
---|---|---|---|
Case=Acc|Definite=Def|Gender=Masc | drugi | ||
Case=Acc|Gender=Masc | druge | ||
Case=Acc|Gender=Fem | drugo | druge | |
Case=Acc|Gender=Neut | drugo | ||
Case=Dat|Gender=Masc | drugemu | ||
Case=Gen|Gender=Masc | drugega | drugih | |
Case=Gen|Gender=Fem | druge | drugih | |
Case=Gen|Gender=Neut | drugega | ||
Case=Ins|Gender=Fem | drugo | drugimi | |
Case=Ins|Gender=Neut | drugim | ||
Case=Loc|Gender=Fem | drugi | drugih | |
Case=Loc|Gender=Neut | drugem | ||
Case=Nom|Definite=Def|Gender=Masc | drugi | ||
Case=Nom|Definite=Ind|Gender=Masc | drug | ||
Case=Nom|Gender=Masc | drugi | ||
Case=Nom|Gender=Fem | druga | druge | |
Case=Nom|Gender=Neut | drugo |
DET
1611 DET tokens (87% of all DET
tokens) have a non-empty value of Number
.
The most frequent other feature values with which DET
and Number
co-occurred: PronType=Dem (1055; 65%), Gender=Neut (833; 52%).
DET
tokens may have the following values of Number
:
Dual
(14; 1% of non-emptyNumber
): obe, oba, obeh, moja, obadva, ona, onih, takšni, tiPlur
(265; 16% of non-emptyNumber
): te, teh, vsi, take, ti, tistih, vse, naših, vseh, kakšneSing
(1332; 83% of non-emptyNumber
): to, ta, vse, tem, tega, nič, tisto, nekaj, tole, tistiEMPTY
(233): malo, nekaj, več, koliko, dosti, toliko, veliko, pol, manj, preveč
Paradigm ta | Sing | Dual | Plur |
---|---|---|---|
Case=Acc|Gender=Masc | ta, tega | te | |
Case=Acc|Gender=Fem | to | te | |
Case=Acc|Gender=Neut | to | ta | |
Case=Dat|Gender=Masc | temu | tem | |
Case=Dat|Gender=Fem | tej | tem | |
Case=Dat|Gender=Neut | temu | tem | |
Case=Gen|Gender=Masc | tega | teh | |
Case=Gen|Gender=Fem | te | teh | |
Case=Gen|Gender=Neut | tega | teh | |
Case=Ins|Gender=Masc | tem | temi | |
Case=Ins|Gender=Fem | to | temi | |
Case=Ins|Gender=Neut | tem | ||
Case=Loc|Gender=Masc | tem | ||
Case=Loc|Gender=Fem | tej | teh | |
Case=Loc|Gender=Neut | tem | teh | |
Case=Nom|Gender=Masc | ta | ti | |
Case=Nom|Gender=Fem | ta | ti | te |
Case=Nom|Gender=Neut | to | ta |
PRON
1179 PRON tokens (72% of all PRON
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PRON
and Number
co-occurred: Reflex=EMPTY (1179; 100%), PronType=Prs (894; 76%), Variant=EMPTY (823; 70%).
PRON
tokens may have the following values of Number
:
Dual
(17; 1% of non-emptyNumber
): midve, onadva, vidva, midva, nama, vidve, njima, vajuPlur
(261; 22% of non-emptyNumber
): jih, mi, nas, vi, vam, jim, vas, oni, nam, namiSing
(901; 76% of non-emptyNumber
): kaj, jaz, ti, mi, ga, jo, kar, kdo, on, onaEMPTY
(462): se, si, sabo, sebi, sebe, seboj, zase
Paradigm jaz | Sing | Dual | Plur |
---|---|---|---|
Case=Acc | mene | nas | |
Case=Acc|Variant=Short | me | ||
Case=Dat | meni | nama | nam |
Case=Dat|Variant=Short | mi | ||
Case=Gen | mene | nas | |
Case=Gen|Variant=Short | me | ||
Case=Ins | mano | nami | |
Case=Loc | meni | nas | |
Case=Nom|Gender=Masc | midva | mi | |
Case=Nom|Gender=Fem | midve | me | |
Case=Nom | jaz |
NUM
499 NUM tokens (100% of all NUM
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NUM
and Number
co-occurred: NumForm=Word (499; 100%), NumType=Card (498; 100%).
NUM
tokens may have the following values of Number
:
Dual
(59; 12% of non-emptyNumber
): dva, dve, dvehPlur
(287; 58% of non-emptyNumber
): tri, tisoč, dvajset, pet, petnajst, štiri, sto, šest, deset, petdesetSing
(153; 31% of non-emptyNumber
): eno, en, ena, enega, ene, eden, enim, eni, enemu
Paradigm en | Sing | Plur |
---|---|---|
Case=Acc|Gender=Masc | en, enega | |
Case=Acc|Gender=Fem | eno | ene |
Case=Acc|Gender=Neut | eno | |
Case=Dat|Gender=Masc | enemu | |
Case=Gen|Gender=Masc | enega | enih |
Case=Gen|Gender=Fem | ene | |
Case=Ins|Gender=Masc | enim | |
Case=Ins|Gender=Fem | eno | |
Case=Ins|Gender=Neut | enim | |
Case=Loc|Gender=Fem | eni | |
Case=Nom|Gender=Masc | en | eni |
Case=Nom|Gender=Fem | ena | |
Case=Nom|Gender=Neut | eno | ena |
Number
seems to be lexical feature of NUM
. 98% lemmas (52) occur only with one value of Number
.
PROPN
444 PROPN tokens (59% of all PROPN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PROPN
and Number
co-occurred: Gender=Masc (267; 60%), Case=Nom (233; 52%).
PROPN
tokens may have the following values of Number
:
Dual
(3; 1% of non-emptyNumber
): američanki, italijanki, štajerPlur
(38; 9% of non-emptyNumber
): božjah, karavanke, ledinah, triestini, zrečah, alpe, američanov, beatlese, benetke, božjeSing
(403; 91% of non-emptyNumber
): slovenija, sloveniji, jones, slovenije, tom, david, healy, iraku, jezus, bistricaEMPTY
(314): [name:personal], [name:surname], [name:address], [name:organisation], [name:place]
Paradigm Herman | Sing | Plur |
---|---|---|
herman | hermani |
Number
seems to be lexical feature of PROPN
. 99% lemmas (304) occur only with one value of Number
.
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number
:
VERB –[aux]–> AUX (953; 88%),
NOUN –[amod]–> ADJ (945; 100%),
NOUN –[det]–> DET (591; 92%),
VERB –[obl]–> NOUN (464; 55%),
VERB –[nsubj]–> NOUN (370; 89%),
VERB –[nsubj]–> PRON (351; 96%),
VERB –[parataxis]–> VERB (319; 70%),
ADJ –[cop]–> AUX (302; 96%),
VERB –[obj]–> PRON (249; 62%),
NOUN –[nmod]–> NOUN (244; 65%).