Treebank Statistics: UD_Slovenian-SST: Features: Number
This feature is universal.
It occurs with 3 different values: Dual
, Plur
, Sing
.
This is a layered feature with the following layers: Number, Number[psor].
40541 tokens (41%) have a non-empty value of Number
.
11957 types (90%) occur at least once with a non-empty value of Number
.
6389 lemmas (84%) occur at least once with a non-empty value of Number
.
The feature is used with 8 part-of-speech tags: NOUN (11395; 12% instances), VERB (9311; 9% instances), ADJ (5272; 5% instances), AUX (4799; 5% instances), DET (4585; 5% instances), PRON (2860; 3% instances), PROPN (1271; 1% instances), NUM (1048; 1% instances).
NOUN
11395 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Number
.
NOUN
tokens may have the following values of Number
:
Dual
(83; 1% of non-emptyNumber
): leti, brata, otroka, dni, elementa, fanta, kovčka, meseca, milijona, oddelkaPlur
(3070; 27% of non-emptyNumber
): let, stvari, ljudi, ljudje, otrok, evrov, leta, letih, dni, otrokeSing
(8242; 72% of non-emptyNumber
): bistvu, strani, dan, čas, leto, način, hvala, primer, redu, koncu
Paradigm leto | Sing | Dual | Plur |
---|---|---|---|
Case=Acc | leto | leti | leta |
Case=Gen | leta | let | |
Case=Ins | letom | leti | |
Case=Loc | letu | letih | |
Case=Nom | leto | leti | leta |
VERB
9311 VERB tokens (93% of all VERB
tokens) have a non-empty value of Number
.
The most frequent other feature values with which VERB
and Number
co-occurred: Polarity=EMPTY (7622; 82%), Gender=EMPTY (6263; 67%), VerbForm=Fin (6263; 67%), Mood=Ind (5838; 63%), Tense=Pres (5714; 61%).
VERB
tokens may have the following values of Number
:
Dual
(204; 2% of non-emptyNumber
): šla, imela, sta, bila, sva, prišla, gledala, imata, imava, delavaPlur
(2802; 30% of non-emptyNumber
): recimo, so, imamo, imeli, imajo, imate, vemo, rekli, moramo, šliSing
(6305; 68% of non-emptyNumber
): je, vem, veš, mislim, bilo, ni, ima, pravi, gre, rekelEMPTY
(727): bi, biti, narediti, reči, iti, imeti, povedati, priti, govoriti, kupiti
Paradigm biti | Sing | Dual | Plur |
---|---|---|---|
Aspect=Imp|Gender=Masc|VerbForm=Part | bil | ||
Aspect=Imp|Gender=Neut|VerbForm=Part | bilo | ||
Gender=Masc|VerbForm=Part | bil | bila | bili |
Gender=Fem|VerbForm=Part | bila | bili | bile |
Gender=Neut|VerbForm=Part | bilo | ||
Mood=Imp|Person=2|VerbForm=Fin | bodite | ||
Mood=Ind|Person=1|Polarity=Neg|Tense=Pres|VerbForm=Fin | nisem | nismo | |
Mood=Ind|Person=1|Polarity=Pos|Tense=Fut|VerbForm=Fin | bom | bova | bomo |
Mood=Ind|Person=1|Polarity=Pos|Tense=Pres|VerbForm=Fin | sem | sva | smo |
Mood=Ind|Person=2|Polarity=Neg|Tense=Pres|VerbForm=Fin | niste | ||
Mood=Ind|Person=2|Polarity=Pos|Tense=Fut|VerbForm=Fin | boš | boste | |
Mood=Ind|Person=2|Polarity=Pos|Tense=Pres|VerbForm=Fin | si | sta | ste |
Mood=Ind|Person=3|Polarity=Neg|Tense=Pres|VerbForm=Fin | ni | nista | niso |
Mood=Ind|Person=3|Polarity=Pos|Tense=Fut|VerbForm=Fin | bo | bosta | bodo, bojo |
Mood=Ind|Person=3|Polarity=Pos|Tense=Pres|VerbForm=Fin | je | sta | so |
ADJ
5272 ADJ tokens (100% of all ADJ
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADJ
and Number
co-occurred: Degree=Pos (4663; 88%), VerbForm=EMPTY (4609; 87%), Definite=EMPTY (4425; 84%).
ADJ
tokens may have the following values of Number
:
Dual
(25; 0% of non-emptyNumber
): polna, blagovni, blagovnih, bolezenski, drugih, fer, grozna, ločeni, mali, medicinskiPlur
(1471; 28% of non-emptyNumber
): različne, sami, različnih, drugih, pozdravljeni, zadnjih, določene, nove, socialnih, dobriSing
(3776; 72% of non-emptyNumber
): drugi, dobro, drugo, prvi, zanimivo, dober, sam, lepa, pomembno, druga
Paradigm drug | Sing | Dual | Plur |
---|---|---|---|
Case=Acc|Definite=Def|Gender=Masc | drugi | ||
Case=Acc|Definite=Ind|Gender=Masc | drug | ||
Case=Acc|Gender=Masc | drugega | druge | |
Case=Acc|Gender=Fem | drugo | druge | |
Case=Acc|Gender=Neut | drugo | druga | |
Case=Dat|Gender=Masc | drugemu | drugim | |
Case=Dat|Gender=Neut | drugim | ||
Case=Gen|Gender=Masc | drugega | drugih | |
Case=Gen|Gender=Fem | druge | drugih | |
Case=Gen|Gender=Neut | drugega | ||
Case=Ins|Gender=Masc | drugimi | ||
Case=Ins|Gender=Fem | drugo | drugimi | |
Case=Ins|Gender=Neut | drugim | drugimi | |
Case=Loc|Gender=Masc | drugem | drugih | |
Case=Loc|Gender=Fem | drugi | drugih | drugih |
Case=Loc|Gender=Neut | drugem | ||
Case=Nom|Definite=Def|Gender=Masc | drugi | ||
Case=Nom|Definite=Ind|Gender=Masc | drug | ||
Case=Nom|Gender=Masc | drugi | ||
Case=Nom|Gender=Fem | druga | druge | |
Case=Nom|Gender=Neut | drugo | druga |
AUX
4799 AUX tokens (92% of all AUX
tokens) have a non-empty value of Number
.
The most frequent other feature values with which AUX
and Number
co-occurred: VerbForm=Fin (4465; 93%), Mood=Ind (4459; 93%), Polarity=Pos (4118; 86%), Tense=Pres (3930; 82%), Person=3 (3010; 63%).
AUX
tokens may have the following values of Number
:
Dual
(130; 3% of non-emptyNumber
): sta, sva, bova, bosta, bila, nisva, bodita, nistaPlur
(1243; 26% of non-emptyNumber
): so, smo, ste, bomo, bili, boste, bodo, niso, nismo, bojoSing
(3426; 71% of non-emptyNumber
): je, sem, ni, bo, si, bilo, bom, bila, bil, nisemEMPTY
(438): bi, biti
Paradigm biti | Sing | Dual | Plur |
---|---|---|---|
Aspect=Imp|Gender=Masc|VerbForm=Part | bil | ||
Aspect=Imp|Gender=Neut|VerbForm=Part | bilo | ||
Aspect=Imp|Mood=Imp|Person=2|VerbForm=Fin | bodita | ||
Gender=Masc|VerbForm=Part | bil | bila | bili |
Gender=Fem|VerbForm=Part | bila | bile | |
Gender=Neut|VerbForm=Part | bilo | bila | |
Mood=Imp|Person=2|VerbForm=Fin | bodi | bodite | |
Mood=Ind|Person=1|Polarity=Neg|Tense=Pres|VerbForm=Fin | nisem | nisva | nismo |
Mood=Ind|Person=1|Polarity=Pos|Tense=Fut|VerbForm=Fin | bom | bova | bomo |
Mood=Ind|Person=1|Polarity=Pos|Tense=Pres|VerbForm=Fin | sem | sva | smo |
Mood=Ind|Person=2|Polarity=Neg|Tense=Pres|VerbForm=Fin | nisi | niste | |
Mood=Ind|Person=2|Polarity=Pos|Tense=Fut|VerbForm=Fin | boš | bosta | boste |
Mood=Ind|Person=2|Polarity=Pos|Tense=Pres|VerbForm=Fin | si | sta | ste |
Mood=Ind|Person=3|Polarity=Neg|Tense=Pres|Typo=Yes|VerbForm=Fin | ni | ||
Mood=Ind|Person=3|Polarity=Neg|Tense=Pres|VerbForm=Fin | ni | nista | niso |
Mood=Ind|Person=3|Polarity=Pos|Tense=Fut|VerbForm=Fin | bo | bosta | bodo, bojo |
Mood=Ind|Person=3|Polarity=Pos|Tense=Pres|VerbForm=Fin | je, biti | sta | so, sa |
DET
4585 DET tokens (83% of all DET
tokens) have a non-empty value of Number
.
The most frequent other feature values with which DET
and Number
co-occurred: PronType=Dem (2802; 61%).
DET
tokens may have the following values of Number
:
Dual
(31; 1% of non-emptyNumber
): oba, obe, ta, obeh, moja, ona, ena, naša, obadva, onihPlur
(967; 21% of non-emptyNumber
): te, teh, vsi, ti, vse, vseh, tiste, tistih, tisti, katerihSing
(3587; 78% of non-emptyNumber
): to, ta, tega, vse, tem, tisto, neko, en, neki, tejEMPTY
(942): pol, malo, več, veliko, nekaj, koliko, dosti, toliko, manj, preveč
Paradigm ta | Sing | Dual | Plur |
---|---|---|---|
Case=Acc|Gender=Masc | ta, tega | ta | te |
Case=Acc|Gender=Fem | to | te | |
Case=Acc|Gender=Neut | to | ta | |
Case=Dat|Gender=Masc | temu | tem | |
Case=Dat|Gender=Fem | tej | tem | |
Case=Dat|Gender=Neut | temu | tem | |
Case=Gen|Gender=Masc | tega | teh | |
Case=Gen|Gender=Fem | te | teh | |
Case=Gen|Gender=Neut | tega | teh | |
Case=Ins|Gender=Masc | tem | temi | |
Case=Ins|Gender=Fem | to | temi | |
Case=Ins|Gender=Neut | tem | temi | |
Case=Loc|Gender=Masc | tem | teh | |
Case=Loc|Gender=Fem | tej | teh | |
Case=Loc|Gender=Neut | tem | teh | |
Case=Nom|Gender=Masc | ta | ta | ti |
Case=Nom|Gender=Fem | ta | ti | te |
Case=Nom|Gender=Neut | to | ta | |
Case=Nom|Gender=Neut|Typo=Yes | ta |
PRON
2860 PRON tokens (65% of all PRON
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PRON
and Number
co-occurred: Reflex=EMPTY (2860; 100%), PronType=Prs (2124; 74%), Variant=EMPTY (1891; 66%).
PRON
tokens may have the following values of Number
:
Dual
(45; 2% of non-emptyNumber
): midva, naju, onadva, vidva, midve, nama, ju, njima, jima, vidvePlur
(638; 22% of non-emptyNumber
): jih, mi, nas, nam, vi, vam, jim, vas, oni, namiSing
(2177; 76% of non-emptyNumber
): kaj, jaz, mi, ti, ga, kar, jo, me, meni, kdoEMPTY
(1524): se, si, sabo, sebe, sebi, seboj, zase
Paradigm jaz | Sing | Dual | Plur |
---|---|---|---|
Case=Acc | mene | naju | nas |
Case=Acc|Variant=Short | me | ||
Case=Dat | meni | nama | nam |
Case=Dat|Variant=Short | mi | ||
Case=Gen | mene | nas | |
Case=Gen|Variant=Short | me | ||
Case=Ins | mano | nama | nami |
Case=Loc | meni | nas | |
Case=Nom|Gender=Masc | midva | mi | |
Case=Nom|Gender=Fem | midve | me | |
Case=Nom | jaz |
PROPN
1271 PROPN tokens (73% of all PROPN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PROPN
and Number
co-occurred: Gender=Masc (693; 55%).
PROPN
tokens may have the following values of Number
:
Dual
(5; 0% of non-emptyNumber
): Afganistanca, Američanki, Italijanki, štajerPlur
(101; 8% of non-emptyNumber
): Romov, Božjah, Karavanke, slovenci, Italijani, Romi, Abitanti, Afganistanci, Izlake, JeseniceSing
(1165; 92% of non-emptyNumber
): Sloveniji, Slovenija, Slovenije, Ljubljani, Ljubljane, Mariboru, Agropop, Ljubljana, rtv, CeljaEMPTY
(467): [name:personal], [name:surname], [name:organisation], [name:address], si, ngl, [name:place], al, kk
Paradigm Slovenec | Sing | Plur |
---|---|---|
Case=Acc | Slovence | |
Case=Gen | Slovenca | Slovencev |
Case=Nom | Slovenec | slovenci |
Number
seems to be lexical feature of PROPN
. 99% lemmas (639) occur only with one value of Number
.
NUM
1048 NUM tokens (100% of all NUM
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NUM
and Number
co-occurred: NumForm=Word (1047; 100%), NumType=Card (1046; 100%), Gender=EMPTY (552; 53%).
NUM
tokens may have the following values of Number
:
Dual
(137; 13% of non-emptyNumber
): dva, dve, dveh, dvemaPlur
(682; 65% of non-emptyNumber
): tri, tisoč, pet, dvajset, trideset, deset, petnajst, štiri, sto, petdesetSing
(229; 22% of non-emptyNumber
): ena, en, eno, eden, enega, eni, ene, enem, enim, drugem
Paradigm en | Sing | Plur |
---|---|---|
Case=Acc|Gender=Masc | en, enega, een | |
Case=Acc|Gender=Fem | eno | ene |
Case=Acc|Gender=Neut | eno | |
Case=Dat|Gender=Fem | eni | |
Case=Gen|Gender=Masc | enega | enih |
Case=Gen|Gender=Fem | ene | |
Case=Gen|Gender=Neut | enega | enih |
Case=Ins|Gender=Masc | enim | |
Case=Ins|Gender=Fem | eno | |
Case=Ins|Gender=Neut | enim | |
Case=Loc|Gender=Masc | enem | |
Case=Loc|Gender=Fem | eni | |
Case=Loc|Gender=Neut | enem | |
Case=Nom|Gender=Masc | en | eni |
Case=Nom|Gender=Fem | ena | ene |
Case=Nom|Gender=Neut | eno | ena |
Number
seems to be lexical feature of NUM
. 99% lemmas (77) occur only with one value of Number
.
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number
:
NOUN –[amod]–> ADJ (3234; 100%),
VERB –[aux]–> AUX (2504; 87%),
NOUN –[det]–> DET (2059; 90%),
VERB –[obl]–> NOUN (1397; 53%),
VERB –[nsubj]–> NOUN (1189; 91%),
NOUN –[nmod]–> NOUN (1074; 64%),
ADJ –[cop]–> AUX (908; 96%),
VERB –[parataxis]–> VERB (830; 74%),
VERB –[nsubj]–> PRON (760; 95%),
VERB –[conj]–> VERB (656; 76%).