Treebank Statistics: UD_Slovenian-SST: Features: Case
This feature is universal.
It occurs with 6 different values: Acc
, Dat
, Gen
, Ins
, Loc
, Nom
.
32289 tokens (42%) have a non-empty value of Case
.
9335 types (70%) occur at least once with a non-empty value of Case
.
5251 lemmas (69%) occur at least once with a non-empty value of Case
.
The feature is used with 7 part-of-speech tags: NOUN (11411; 15% instances), ADP (5648; 7% instances), ADJ (5271; 7% instances), DET (4438; 6% instances), PRON (3044; 4% instances), PROPN (1290; 2% instances), NUM (1187; 2% instances).
NOUN
11411 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Case
.
The most frequent other feature values with which NOUN
and Case
co-occurred: Number=Sing (8256; 72%).
NOUN
tokens may have the following values of Case
:
Acc
(3117; 27% of non-emptyCase
): dan, način, leto, primer, čas, leta, otroke, šolo, teden, deloDat
(209; 2% of non-emptyCase
): ljudem, bolniku, bogu, boleznim, bolnikom, otrokom, očetu, covidu, državam, gostomGen
(2447; 21% of non-emptyCase
): let, leta, otrok, evrov, časa, ljudi, dni, strani, dela, minutIns
(543; 5% of non-emptyCase
): leti, ljudmi, stresom, boleznimi, debelostjo, avtobusom, letom, pomočjo, avtom, besedamiLoc
(1737; 15% of non-emptyCase
): bistvu, strani, redu, koncu, času, letih, mestu, šoli, področju, primeruNom
(3358; 29% of non-emptyCase
): hvala, ljudje, gospod, del, stvar, otroci, pot, država, gospa, zgodba
Paradigm človek | Nom | Acc | Dat | Gen | Loc | Ins |
---|---|---|---|---|---|---|
Animacy=Anim|Number=Sing | človeka | |||||
Number=Sing | človek | človeku | človeka | človeku | človekom | |
Number=Plur | ljudje | ljudi | ljudem | ljudi | ljudmi |
ADP
5648 ADP tokens (100% of all ADP
tokens) have a non-empty value of Case
.
ADP
tokens may have the following values of Case
:
Acc
(1686; 30% of non-emptyCase
): za, na, v, po, čez, skozi, med, nad, pod, predDat
(79; 1% of non-emptyCase
): proti, k, kljub, h, blizu, navkljub, preblizuGen
(854; 15% of non-emptyCase
): od, do, iz, zaradi, brez, z, preko, s, poleg, znotrajIns
(771; 14% of non-emptyCase
): z, s, med, pred, pod, za, nadLoc
(2258; 40% of non-emptyCase
): v, na, po, pri, o, ob, za
Paradigm za | Acc | Loc | Ins |
---|---|---|---|
za | za | za |
ADJ
5271 ADJ tokens (100% of all ADJ
tokens) have a non-empty value of Case
.
The most frequent other feature values with which ADJ
and Case
co-occurred: Degree=Pos (4661; 88%), VerbForm=EMPTY (4608; 87%), Definite=EMPTY (4424; 84%), Number=Sing (3777; 72%).
ADJ
tokens may have the following values of Case
:
Acc
(1086; 21% of non-emptyCase
): drugo, različne, celo, dobro, dober, drugi, lep, novo, prvo, noveDat
(66; 1% of non-emptyCase
): novim, drugemu, ostalim, drugim, zaposlenim, zdravniški, zdravniškim, Evropski, Svetemu, celovitiGen
(822; 16% of non-emptyCase
): drugega, različnih, drugih, prve, slovenske, socialnih, javnega, novih, parlamentarne, prvegaIns
(216; 4% of non-emptyCase
): drugim, drugimi, drugo, kratkim, strokovno, porodniško, različnimi, tretjo, vremenskimi, SlovenskoLoc
(581; 11% of non-emptyCase
): drugi, glavnem, prvi, zadnjem, prvem, osnovni, sami, zadnjih, akademskem, drugemNom
(2500; 47% of non-emptyCase
): sam, zanimivo, lepa, dobro, drugi, pomembno, druga, sami, dober, sama
Paradigm drug | Nom | Acc | Dat | Gen | Loc | Ins |
---|---|---|---|---|---|---|
Definite=Def|Gender=Masc|Number=Sing | drugi | drugi | ||||
Definite=Ind|Gender=Masc|Number=Sing | drug | drug | ||||
Gender=Masc|Number=Sing | drugega | drugemu | drugega | drugem | ||
Gender=Masc|Number=Plur | drugi | druge | drugim | drugih | drugih | drugimi |
Gender=Fem|Number=Sing | druga | drugo | druge | drugi | drugo | |
Gender=Fem|Number=Dual | drugih | |||||
Gender=Fem|Number=Plur | druge | druge | drugih | drugih | drugimi | |
Gender=Neut|Number=Sing | drugo | drugo | drugega | drugem | drugim | |
Gender=Neut|Number=Plur | druga | druga | drugim | drugimi |
DET
4438 DET tokens (82% of all DET
tokens) have a non-empty value of Case
.
The most frequent other feature values with which DET
and Case
co-occurred: Number=Sing (3450; 78%), PronType=Dem (2799; 63%).
DET
tokens may have the following values of Case
:
Acc
(1338; 30% of non-emptyCase
): to, ta, vse, te, tisto, neko, svoje, neki, tiste, kakšenDat
(106; 2% of non-emptyCase
): temu, vsem, tem, vsakemu, tej, našim, kateremu, mojemu, nekaterim, svojimGen
(577; 13% of non-emptyCase
): tega, teh, vseh, tistih, te, takega, nekega, neke, nekih, takihIns
(164; 4% of non-emptyCase
): tem, temi, katerimi, neko, vsemi, to, svojimi, takimi, katerim, tistimLoc
(373; 8% of non-emptyCase
): tem, tej, teh, katerih, vseh, nekem, katerem, naši, tistem, tistihNom
(1880; 42% of non-emptyCase
): to, ta, vse, tisti, vsi, te, ti, tisto, tak, takaEMPTY
(945): pol, malo, več, veliko, nekaj, koliko, dosti, toliko, manj, preveč
Paradigm ta | Nom | Acc | Dat | Gen | Loc | Ins |
---|---|---|---|---|---|---|
Gender=Masc|Number=Sing | ta | ta, tega | temu | tega | tem | tem |
Gender=Masc|Number=Dual | ta | ta | ||||
Gender=Masc|Number=Plur | ti | te | tem | teh | teh | temi |
Gender=Fem|Number=Sing | ta | to | tej | te | tej | to |
Gender=Fem|Number=Dual | ti | |||||
Gender=Fem|Number=Plur | te | te | tem | teh | teh | temi |
Gender=Neut|Number=Sing | to | to | temu | tega | tem | tem |
Gender=Neut|Number=Plur | ta | ta | tem | teh | teh | temi |
Gender=Neut|Number=Plur|Typo=Yes | ta |
PRON
3044 PRON tokens (69% of all PRON
tokens) have a non-empty value of Case
.
The most frequent other feature values with which PRON
and Case
co-occurred: Reflex=EMPTY (2862; 94%), PronType=Prs (2307; 76%), Number=Sing (2177; 72%), Variant=EMPTY (1938; 64%).
PRON
tokens may have the following values of Case
:
Acc
(885; 29% of non-emptyCase
): kaj, ga, jih, jo, kar, me, nas, nekaj, te, vasDat
(725; 24% of non-emptyCase
): mi, si, ti, nam, meni, vam, jim, mu, ji, njemuGen
(142; 5% of non-emptyCase
): jih, ga, je, mene, česa, nas, vas, nje, njih, tebeIns
(88; 3% of non-emptyCase
): sabo, nami, njimi, mano, njo, seboj, vami, njim, čim, njimaLoc
(61; 2% of non-emptyCase
): nas, sebi, njej, njem, njih, čemer, vas, kom, meni, tebiNom
(1143; 38% of non-emptyCase
): jaz, kaj, ti, mi, kar, kdo, on, vi, ona, oniEMPTY
(1343): se
Paradigm jaz | Nom | Acc | Dat | Gen | Loc | Ins |
---|---|---|---|---|---|---|
Gender=Masc|Number=Dual | midva | |||||
Gender=Masc|Number=Plur | mi | |||||
Gender=Fem|Number=Dual | midve | |||||
Gender=Fem|Number=Plur | me | |||||
Number=Sing | jaz | mene | meni | mene | meni | mano |
Number=Sing|Variant=Short | me | mi | me | |||
Number=Dual | naju | nama | nama | |||
Number=Plur | nas | nam | nas | nas | nami |
PROPN
1290 PROPN tokens (74% of all PROPN
tokens) have a non-empty value of Case
.
The most frequent other feature values with which PROPN
and Case
co-occurred: Number=Sing (1187; 92%), Gender=Masc (711; 55%).
PROPN
tokens may have the following values of Case
:
Acc
(155; 12% of non-emptyCase
): Nemčijo, Slovenijo, Ljubljano, Triglav, Ameriko, Bruselj, Harvard, Maribor, Paranoid, CeljeDat
(17; 1% of non-emptyCase
): Ljubljani, Andreju, Antonu, Belvedurju, Dragonji, HPV-ju, Kamniku, Konjičanu, Luciji, LutahrjuGen
(228; 18% of non-emptyCase
): Slovenije, Ljubljane, Celja, Evrope, Romov, Antona, Avstrije, Dunaja, Maribora, KranjaIns
(50; 4% of non-emptyCase
): Branetom, Špelo, Štefko, Alenko, Alešem, Andersonom, Antoličičem, Avstrijci, Avstrijo, BennyjemLoc
(256; 20% of non-emptyCase
): Sloveniji, Ljubljani, Mariboru, Evropi, Nemčiji, Netflixu, Avstriji, Božjah, Bruslju, IrakuNom
(584; 45% of non-emptyCase
): Slovenija, Agropop, Ljubljana, Jones, Nigerija, Tom, Bistrica, David, Healy, AlenkaEMPTY
(459): [name:personal], [name:surname], [name:organisation], [name:address], [name:place]
Paradigm Ljubljana | Nom | Acc | Dat | Gen | Loc |
---|---|---|---|---|---|
Ljubljana | Ljubljano | Ljubljani | Ljubljane | Ljubljani |
NUM
1187 NUM tokens (100% of all NUM
tokens) have a non-empty value of Case
.
The most frequent other feature values with which NUM
and Case
co-occurred: NumForm=Word (1186; 100%), NumType=Card (1185; 100%), Number=Plur (691; 58%).
NUM
tokens may have the following values of Case
:
Acc
(578; 49% of non-emptyCase
): eno, en, dva, tri, pet, dve, dvajset, tisoč, trideset, štiriDat
(5; 0% of non-emptyCase
): enemu, devetim, eni, štirimGen
(62; 5% of non-emptyCase
): ene, enega, dveh, enih, petih, treh, dvajsetih, dvanajstih, osmih, sedmihIns
(26; 2% of non-emptyCase
): enim, eno, sedmimi, tremi, dvema, dvanajstimi, enaindvajsetimi, enainpetdesetimi, petdesetimi, sedemnajstimiLoc
(54; 5% of non-emptyCase
): eni, enem, dveh, desetih, štirih, treh, devetnajstih, enajstih, osemnajstih, petihNom
(462; 39% of non-emptyCase
): ena, en, dva, tisoč, pet, eden, tri, devet, dvajset, eni
Paradigm en | Nom | Acc | Dat | Gen | Loc | Ins |
---|---|---|---|---|---|---|
Gender=Masc|Number=Sing | en | en, enega, een | enemu | enega | enem | enim |
Gender=Masc|Number=Dual | ena | |||||
Gender=Masc|Number=Plur | eni | enih | ||||
Gender=Fem|Number=Sing | ena | eno | eni | ene | eni | eno |
Gender=Fem|Number=Plur | ene | ene | enih | |||
Gender=Neut|Number=Sing | eno | eno | enega | enem | enim | |
Gender=Neut|Number=Plur | ena | enih |
Relations with Agreement in Case
The 10 most frequent relations where parent and child node agree in Case
:
NOUN –[case]–> ADP (3803; 98%),
NOUN –[amod]–> ADJ (3190; 98%),
NOUN –[det]–> DET (1901; 88%),
NOUN –[conj]–> NOUN (727; 92%),
PROPN –[case]–> ADP (456; 92%),
NOUN –[nummod]–> NUM (407; 61%),
DET –[case]–> ADP (360; 95%),
ADJ –[nsubj]–> NOUN (275; 98%),
PRON –[case]–> ADP (229; 97%),
NOUN –[nsubj]–> DET (206; 98%).