Treebank Statistics: UD_Slovenian-SST: Features: Number
This feature is universal.
It occurs with 3 different values: Dual
, Plur
, Sing
.
This is a layered feature with the following layers: Number, Number[psor].
40573 tokens (53%) have a non-empty value of Number
.
11988 types (90%) occur at least once with a non-empty value of Number
.
6420 lemmas (84%) occur at least once with a non-empty value of Number
.
The feature is used with 8 part-of-speech tags: NOUN (11411; 15% instances), VERB (9322; 12% instances), ADJ (5271; 7% instances), AUX (4792; 6% instances), DET (4438; 6% instances), PRON (2862; 4% instances), PROPN (1290; 2% instances), NUM (1187; 2% instances).
NOUN
11411 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Number
.
NOUN
tokens may have the following values of Number
:
Dual
(83; 1% of non-emptyNumber
): leti, brata, otroka, dni, elementa, fanta, kovčka, meseca, milijona, oddelkaPlur
(3072; 27% of non-emptyNumber
): let, stvari, ljudi, ljudje, otrok, evrov, leta, letih, dni, otrokeSing
(8256; 72% of non-emptyNumber
): bistvu, strani, dan, čas, način, leto, hvala, primer, redu, koncu
Paradigm leto | Sing | Dual | Plur |
---|---|---|---|
Case=Acc | leto | leti | leta |
Case=Gen | leta | let | |
Case=Ins | letom | leti | |
Case=Loc | letu | letih | |
Case=Nom | leto | leti | leta |
VERB
9322 VERB tokens (93% of all VERB
tokens) have a non-empty value of Number
.
The most frequent other feature values with which VERB
and Number
co-occurred: Polarity=EMPTY (7622; 82%), Gender=EMPTY (6273; 67%), VerbForm=Fin (6273; 67%), Mood=Ind (5847; 63%), Tense=Pres (5722; 61%).
VERB
tokens may have the following values of Number
:
Dual
(204; 2% of non-emptyNumber
): šla, imela, sta, bila, sva, prišla, gledala, imata, imava, delavaPlur
(2805; 30% of non-emptyNumber
): recimo, so, imamo, imeli, imajo, imate, rekli, vemo, moramo, šliSing
(6313; 68% of non-emptyNumber
): je, vem, veš, mislim, bilo, ni, ima, pravi, gre, rekelEMPTY
(726): bi, narediti, biti, reči, iti, imeti, povedati, priti, govoriti, delati
Paradigm biti | Sing | Dual | Plur |
---|---|---|---|
Aspect=Imp|Gender=Masc|VerbForm=Part | bil | ||
Aspect=Imp|Gender=Neut|VerbForm=Part | bilo | ||
Gender=Masc|VerbForm=Part | bil | bila | bili |
Gender=Fem|VerbForm=Part | bila | bili | bile |
Gender=Neut|VerbForm=Part | bilo | ||
Mood=Imp|Person=2|VerbForm=Fin | bodite | ||
Mood=Ind|Person=1|Polarity=Neg|Tense=Pres|VerbForm=Fin | nisem | nismo | |
Mood=Ind|Person=1|Polarity=Pos|Tense=Fut|VerbForm=Fin | bom | bova | bomo |
Mood=Ind|Person=1|Polarity=Pos|Tense=Pres|VerbForm=Fin | sem | sva | smo |
Mood=Ind|Person=2|Polarity=Neg|Tense=Pres|VerbForm=Fin | niste | ||
Mood=Ind|Person=2|Polarity=Pos|Tense=Fut|VerbForm=Fin | boš | boste | |
Mood=Ind|Person=2|Polarity=Pos|Tense=Pres|VerbForm=Fin | si | sta | ste |
Mood=Ind|Person=3|Polarity=Neg|Tense=Pres|VerbForm=Fin | ni | nista | niso |
Mood=Ind|Person=3|Polarity=Pos|Tense=Fut|VerbForm=Fin | bo | bosta | bodo, bojo |
Mood=Ind|Person=3|Polarity=Pos|Tense=Pres|VerbForm=Fin | je | sta | so |
ADJ
5271 ADJ tokens (100% of all ADJ
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADJ
and Number
co-occurred: Degree=Pos (4661; 88%), VerbForm=EMPTY (4608; 87%), Definite=EMPTY (4424; 84%).
ADJ
tokens may have the following values of Number
:
Dual
(25; 0% of non-emptyNumber
): polna, blagovni, blagovnih, bolezenski, drugih, fer, grozna, ločeni, mali, medicinskiPlur
(1469; 28% of non-emptyNumber
): različne, sami, različnih, drugih, pozdravljeni, določene, zadnjih, nove, socialnih, dobriSing
(3777; 72% of non-emptyNumber
): drugi, dobro, drugo, prvi, sam, zanimivo, dober, lepa, pomembno, druga
Paradigm drug | Sing | Dual | Plur |
---|---|---|---|
Case=Acc|Definite=Def|Gender=Masc | drugi | ||
Case=Acc|Definite=Ind|Gender=Masc | drug | ||
Case=Acc|Gender=Masc | drugega | druge | |
Case=Acc|Gender=Fem | drugo | druge | |
Case=Acc|Gender=Neut | drugo | druga | |
Case=Dat|Gender=Masc | drugemu | drugim | |
Case=Dat|Gender=Neut | drugim | ||
Case=Gen|Gender=Masc | drugega | drugih | |
Case=Gen|Gender=Fem | druge | drugih | |
Case=Gen|Gender=Neut | drugega | ||
Case=Ins|Gender=Masc | drugimi | ||
Case=Ins|Gender=Fem | drugo | drugimi | |
Case=Ins|Gender=Neut | drugim | drugimi | |
Case=Loc|Gender=Masc | drugem | drugih | |
Case=Loc|Gender=Fem | drugi | drugih | drugih |
Case=Loc|Gender=Neut | drugem | ||
Case=Nom|Definite=Def|Gender=Masc | drugi | ||
Case=Nom|Definite=Ind|Gender=Masc | drug | ||
Case=Nom|Gender=Masc | drugi | ||
Case=Nom|Gender=Fem | druga | druge | |
Case=Nom|Gender=Neut | drugo | druga |
AUX
4792 AUX tokens (92% of all AUX
tokens) have a non-empty value of Number
.
The most frequent other feature values with which AUX
and Number
co-occurred: VerbForm=Fin (4458; 93%), Mood=Ind (4452; 93%), Polarity=Pos (4111; 86%), Tense=Pres (3925; 82%), Person=3 (3005; 63%).
AUX
tokens may have the following values of Number
:
Dual
(130; 3% of non-emptyNumber
): sta, sva, bova, bosta, bila, nisva, bodita, nistaPlur
(1242; 26% of non-emptyNumber
): so, smo, ste, bomo, bili, boste, bodo, niso, nismo, bojoSing
(3420; 71% of non-emptyNumber
): je, sem, ni, bo, si, bilo, bom, bila, bil, nisemEMPTY
(437): bi, biti
Paradigm biti | Sing | Dual | Plur |
---|---|---|---|
Aspect=Imp|Gender=Masc|VerbForm=Part | bil | ||
Aspect=Imp|Gender=Neut|VerbForm=Part | bilo | ||
Aspect=Imp|Mood=Imp|Person=2|VerbForm=Fin | bodita | ||
Gender=Masc|VerbForm=Part | bil | bila | bili |
Gender=Fem|VerbForm=Part | bila | bile | |
Gender=Neut|VerbForm=Part | bilo | bila | |
Mood=Imp|Person=2|VerbForm=Fin | bodi | bodite | |
Mood=Ind|Person=1|Polarity=Neg|Tense=Pres|VerbForm=Fin | nisem | nisva | nismo |
Mood=Ind|Person=1|Polarity=Pos|Tense=Fut|VerbForm=Fin | bom | bova | bomo |
Mood=Ind|Person=1|Polarity=Pos|Tense=Pres|VerbForm=Fin | sem | sva | smo |
Mood=Ind|Person=2|Polarity=Neg|Tense=Pres|VerbForm=Fin | nisi | niste | |
Mood=Ind|Person=2|Polarity=Pos|Tense=Fut|VerbForm=Fin | boš | bosta | boste |
Mood=Ind|Person=2|Polarity=Pos|Tense=Pres|VerbForm=Fin | si | sta | ste |
Mood=Ind|Person=3|Polarity=Neg|Tense=Pres|Typo=Yes|VerbForm=Fin | ni | ||
Mood=Ind|Person=3|Polarity=Neg|Tense=Pres|VerbForm=Fin | ni | nista | niso |
Mood=Ind|Person=3|Polarity=Pos|Tense=Fut|VerbForm=Fin | bo | bosta | bodo, bojo |
Mood=Ind|Person=3|Polarity=Pos|Tense=Pres|VerbForm=Fin | je, biti | sta | so, sa |
DET
4438 DET tokens (82% of all DET
tokens) have a non-empty value of Number
.
The most frequent other feature values with which DET
and Number
co-occurred: PronType=Dem (2799; 63%).
DET
tokens may have the following values of Number
:
Dual
(30; 1% of non-emptyNumber
): oba, obe, ta, obeh, moja, ona, naša, obadva, onih, onimaPlur
(958; 22% of non-emptyNumber
): te, teh, vsi, ti, vse, vseh, tiste, tistih, tisti, katerihSing
(3450; 78% of non-emptyNumber
): to, ta, tega, vse, tem, tisto, neki, neko, tej, temuEMPTY
(945): pol, malo, več, veliko, nekaj, koliko, dosti, toliko, manj, preveč
Paradigm ta | Sing | Dual | Plur |
---|---|---|---|
Case=Acc|Gender=Masc | ta, tega | ta | te |
Case=Acc|Gender=Fem | to | te | |
Case=Acc|Gender=Neut | to | ta | |
Case=Dat|Gender=Masc | temu | tem | |
Case=Dat|Gender=Fem | tej | tem | |
Case=Dat|Gender=Neut | temu | tem | |
Case=Gen|Gender=Masc | tega | teh | |
Case=Gen|Gender=Fem | te | teh | |
Case=Gen|Gender=Neut | tega | teh | |
Case=Ins|Gender=Masc | tem | temi | |
Case=Ins|Gender=Fem | to | temi | |
Case=Ins|Gender=Neut | tem | temi | |
Case=Loc|Gender=Masc | tem | teh | |
Case=Loc|Gender=Fem | tej | teh | |
Case=Loc|Gender=Neut | tem | teh | |
Case=Nom|Gender=Masc | ta | ta | ti |
Case=Nom|Gender=Fem | ta | ti | te |
Case=Nom|Gender=Neut | to | ta | |
Case=Nom|Gender=Neut|Typo=Yes | ta |
PRON
2862 PRON tokens (65% of all PRON
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PRON
and Number
co-occurred: Reflex=EMPTY (2862; 100%), PronType=Prs (2125; 74%), Variant=EMPTY (1893; 66%).
PRON
tokens may have the following values of Number
:
Dual
(45; 2% of non-emptyNumber
): midva, naju, onadva, vidva, midve, nama, ju, njima, jima, vidvePlur
(640; 22% of non-emptyNumber
): jih, mi, nas, nam, vi, vam, jim, vas, oni, namiSing
(2177; 76% of non-emptyNumber
): kaj, jaz, mi, ti, ga, kar, jo, me, meni, kdoEMPTY
(1525): se, si, sabo, sebe, sebi, seboj, zase
Paradigm jaz | Sing | Dual | Plur |
---|---|---|---|
Case=Acc | mene | naju | nas |
Case=Acc|Variant=Short | me | ||
Case=Dat | meni | nama | nam |
Case=Dat|Variant=Short | mi | ||
Case=Gen | mene | nas | |
Case=Gen|Variant=Short | me | ||
Case=Ins | mano | nama | nami |
Case=Loc | meni | nas | |
Case=Nom|Gender=Masc | midva | mi | |
Case=Nom|Gender=Fem | midve | me | |
Case=Nom | jaz |
PROPN
1290 PROPN tokens (74% of all PROPN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PROPN
and Number
co-occurred: Gender=Masc (711; 55%).
PROPN
tokens may have the following values of Number
:
Dual
(5; 0% of non-emptyNumber
): Afganistanca, Američanki, Italijanki, štajerPlur
(98; 8% of non-emptyNumber
): Romov, Božjah, Karavanke, slovenci, Italijani, Romi, Afganistanci, Izlake, Jesenice, JulijcihSing
(1187; 92% of non-emptyNumber
): Sloveniji, Slovenija, Slovenije, Ljubljani, Ljubljane, Mariboru, Agropop, Ljubljana, rtv, CeljaEMPTY
(459): [name:personal], [name:surname], [name:organisation], [name:address], [name:place]
Paradigm Slovenec | Sing | Plur |
---|---|---|
Case=Acc | Slovence | |
Case=Gen | Slovenca | Slovencev |
Case=Nom | Slovenec | slovenci |
Number
seems to be lexical feature of PROPN
. 99% lemmas (657) occur only with one value of Number
.
NUM
1187 NUM tokens (100% of all NUM
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NUM
and Number
co-occurred: NumForm=Word (1186; 100%), NumType=Card (1185; 100%).
NUM
tokens may have the following values of Number
:
Dual
(138; 12% of non-emptyNumber
): dva, dve, dveh, dvema, enaPlur
(691; 58% of non-emptyNumber
): tri, tisoč, pet, dvajset, trideset, deset, petnajst, štiri, sto, petdesetSing
(358; 30% of non-emptyNumber
): en, eno, ena, enega, eden, ene, eni, enem, enim, enemu
Paradigm en | Sing | Dual | Plur |
---|---|---|---|
Case=Acc|Gender=Masc | en, enega, een | ||
Case=Acc|Gender=Fem | eno | ene | |
Case=Acc|Gender=Neut | eno | ||
Case=Dat|Gender=Masc | enemu | ||
Case=Dat|Gender=Fem | eni | ||
Case=Gen|Gender=Masc | enega | enih | |
Case=Gen|Gender=Fem | ene | enih | |
Case=Gen|Gender=Neut | enega | enih | |
Case=Ins|Gender=Masc | enim | ||
Case=Ins|Gender=Fem | eno | ||
Case=Ins|Gender=Neut | enim | ||
Case=Loc|Gender=Masc | enem | ||
Case=Loc|Gender=Fem | eni | ||
Case=Loc|Gender=Neut | enem | ||
Case=Nom|Gender=Masc | en | ena | eni |
Case=Nom|Gender=Fem | ena | ene | |
Case=Nom|Gender=Neut | eno | ena |
Number
seems to be lexical feature of NUM
. 99% lemmas (76) occur only with one value of Number
.
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number
:
NOUN –[amod]–> ADJ (3229; 100%),
VERB –[aux]–> AUX (2499; 87%),
NOUN –[det]–> DET (1925; 90%),
VERB –[obl]–> NOUN (1395; 54%),
VERB –[nsubj]–> NOUN (1189; 91%),
NOUN –[nmod]–> NOUN (1077; 64%),
ADJ –[cop]–> AUX (906; 96%),
VERB –[parataxis]–> VERB (820; 73%),
VERB –[nsubj]–> PRON (760; 95%),
VERB –[obj]–> PRON (688; 62%).