Treebank Statistics: UD_Slovenian-SST: Features: Case
This feature is universal.
It occurs with 6 different values: Acc, Dat, Gen, Ins, Loc, Nom.
32259 tokens (33%) have a non-empty value of Case.
9309 types (70%) occur at least once with a non-empty value of Case.
5219 lemmas (68%) occur at least once with a non-empty value of Case.
The feature is used with 7 part-of-speech tags: NOUN (11395; 12% instances), ADP (5646; 6% instances), ADJ (5272; 5% instances), DET (4585; 5% instances), PRON (3042; 3% instances), PROPN (1271; 1% instances), NUM (1048; 1% instances).
NOUN
11395 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Case.
The most frequent other feature values with which NOUN and Case co-occurred: Number=Sing (8242; 72%).
NOUN tokens may have the following values of Case:
Acc(3119; 27% of non-emptyCase): dan, način, leto, primer, čas, leta, otroke, šolo, teden, deloDat(209; 2% of non-emptyCase): ljudem, bolniku, bogu, boleznim, bolnikom, otrokom, očetu, covidu, državam, gostomGen(2445; 21% of non-emptyCase): let, leta, otrok, evrov, časa, ljudi, dni, strani, dela, minutIns(543; 5% of non-emptyCase): leti, ljudmi, stresom, boleznimi, debelostjo, avtobusom, letom, pomočjo, avtom, besedamiLoc(1735; 15% of non-emptyCase): bistvu, strani, redu, koncu, času, letih, mestu, šoli, področju, primeruNom(3344; 29% of non-emptyCase): hvala, ljudje, gospod, del, stvar, otroci, pot, država, gospa, zgodba
| Paradigm človek | Nom | Acc | Dat | Gen | Loc | Ins |
|---|---|---|---|---|---|---|
| Animacy=Anim|Number=Sing | človeka | |||||
| Number=Sing | človek | človeku | človeka | človeku | človekom | |
| Number=Plur | ljudje | ljudi | ljudem | ljudi | ljudmi |
ADP
5646 ADP tokens (100% of all ADP tokens) have a non-empty value of Case.
ADP tokens may have the following values of Case:
Acc(1688; 30% of non-emptyCase): za, na, v, po, čez, skozi, med, nad, pod, predDat(78; 1% of non-emptyCase): proti, k, kljub, h, blizu, navkljub, preblizuGen(854; 15% of non-emptyCase): od, do, iz, zaradi, brez, z, s, preko, poleg, znotrajIns(768; 14% of non-emptyCase): z, s, med, pred, pod, za, nadLoc(2258; 40% of non-emptyCase): v, na, po, pri, o, ob, za
| Paradigm za | Acc | Loc | Ins |
|---|---|---|---|
| za | za | za |
ADJ
5272 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Case.
The most frequent other feature values with which ADJ and Case co-occurred: Degree=Pos (4663; 88%), VerbForm=EMPTY (4609; 87%), Definite=EMPTY (4425; 84%), Number=Sing (3776; 72%).
ADJ tokens may have the following values of Case:
Acc(1086; 21% of non-emptyCase): drugo, različne, celo, dobro, dober, drugi, lep, novo, prvo, noveDat(66; 1% of non-emptyCase): novim, drugemu, ostalim, drugim, zaposlenim, zdravniški, zdravniškim, Evropski, Svetemu, celovitiGen(822; 16% of non-emptyCase): drugega, različnih, drugih, prve, slovenske, socialnih, javnega, novih, parlamentarne, prvegaIns(216; 4% of non-emptyCase): drugimi, drugim, drugo, kratkim, strokovno, porodniško, različnimi, tretjo, vremenskimi, SlovenskoLoc(582; 11% of non-emptyCase): drugi, glavnem, prvi, zadnjem, prvem, osnovni, zadnjih, sami, akademskem, drugemNom(2500; 47% of non-emptyCase): sam, zanimivo, lepa, dobro, drugi, pomembno, druga, sami, dober, sama
| Paradigm drug | Nom | Acc | Dat | Gen | Loc | Ins |
|---|---|---|---|---|---|---|
| Definite=Def|Gender=Masc|Number=Sing | drugi | drugi | ||||
| Definite=Ind|Gender=Masc|Number=Sing | drug | drug | ||||
| Gender=Masc|Number=Sing | drugega | drugemu | drugega | drugem | ||
| Gender=Masc|Number=Plur | drugi | druge | drugim | drugih | drugih | drugimi |
| Gender=Fem|Number=Sing | druga | drugo | druge | drugi | drugo | |
| Gender=Fem|Number=Dual | drugih | |||||
| Gender=Fem|Number=Plur | druge | druge | drugih | drugih | drugimi | |
| Gender=Neut|Number=Sing | drugo | drugo | drugega | drugem | drugim | |
| Gender=Neut|Number=Plur | druga | druga | drugim | drugimi |
DET
4585 DET tokens (83% of all DET tokens) have a non-empty value of Case.
The most frequent other feature values with which DET and Case co-occurred: Number=Sing (3587; 78%), PronType=Dem (2802; 61%).
DET tokens may have the following values of Case:
Acc(1402; 31% of non-emptyCase): to, ta, vse, te, tisto, neko, eno, svoje, neki, tisteDat(106; 2% of non-emptyCase): temu, vsem, tem, vsakemu, našim, tej, enemu, kateremu, mojemu, nekaterimGen(590; 13% of non-emptyCase): tega, teh, vseh, tistih, te, takega, nekega, nekih, takih, nekeIns(170; 4% of non-emptyCase): tem, temi, katerimi, neko, vsemi, to, svojimi, takimi, katerim, tistimLoc(377; 8% of non-emptyCase): tem, tej, teh, katerih, vseh, nekem, katerem, naši, tistem, kateriNom(1940; 42% of non-emptyCase): to, ta, vse, tisti, vsi, te, ti, tisto, en, takEMPTY(942): pol, malo, več, veliko, nekaj, koliko, dosti, toliko, manj, preveč
| Paradigm ta | Nom | Acc | Dat | Gen | Loc | Ins |
|---|---|---|---|---|---|---|
| Gender=Masc|Number=Sing | ta | ta, tega | temu | tega | tem | tem |
| Gender=Masc|Number=Dual | ta | ta | ||||
| Gender=Masc|Number=Plur | ti | te | tem | teh | teh | temi |
| Gender=Fem|Number=Sing | ta | to | tej | te | tej | to |
| Gender=Fem|Number=Dual | ti | |||||
| Gender=Fem|Number=Plur | te | te | tem | teh | teh | temi |
| Gender=Neut|Number=Sing | to | to | temu | tega | tem | tem |
| Gender=Neut|Number=Plur | ta | ta | tem | teh | teh | temi |
| Gender=Neut|Number=Plur|Typo=Yes | ta |
PRON
3042 PRON tokens (69% of all PRON tokens) have a non-empty value of Case.
The most frequent other feature values with which PRON and Case co-occurred: Reflex=EMPTY (2860; 94%), PronType=Prs (2306; 76%), Number=Sing (2177; 72%), Variant=EMPTY (1936; 64%).
PRON tokens may have the following values of Case:
Acc(882; 29% of non-emptyCase): kaj, ga, jih, jo, kar, me, nas, te, nekaj, vasDat(726; 24% of non-emptyCase): mi, si, ti, nam, meni, vam, jim, mu, ji, njemuGen(141; 5% of non-emptyCase): jih, ga, je, mene, česa, nas, vas, nje, njih, tebeIns(88; 3% of non-emptyCase): sabo, nami, njimi, mano, njo, seboj, vami, njim, čim, njimaLoc(61; 2% of non-emptyCase): nas, sebi, njej, njem, njih, čemer, vas, kom, meni, tebiNom(1144; 38% of non-emptyCase): jaz, kaj, ti, mi, kar, kdo, on, vi, ona, oniEMPTY(1342): se
| Paradigm jaz | Nom | Acc | Dat | Gen | Loc | Ins |
|---|---|---|---|---|---|---|
| Gender=Masc|Number=Dual | midva | |||||
| Gender=Masc|Number=Plur | mi | |||||
| Gender=Fem|Number=Dual | midve | |||||
| Gender=Fem|Number=Plur | me | |||||
| Number=Sing | jaz | mene | meni | mene | meni | mano |
| Number=Sing|Variant=Short | me | mi | me | |||
| Number=Dual | naju | nama | nama | |||
| Number=Plur | nas | nam | nas | nas | nami |
PROPN
1271 PROPN tokens (73% of all PROPN tokens) have a non-empty value of Case.
The most frequent other feature values with which PROPN and Case co-occurred: Number=Sing (1165; 92%), Gender=Masc (693; 55%).
PROPN tokens may have the following values of Case:
Acc(150; 12% of non-emptyCase): Nemčijo, Slovenijo, Ljubljano, Triglav, Ameriko, Bruselj, Harvard, Maribor, Paranoid, CeljeDat(17; 1% of non-emptyCase): Ljubljani, Andreju, Antonu, Belvedurju, Dragonji, HPV-ju, Kamniku, Konjičanu, Luciji, LutahrjuGen(229; 18% of non-emptyCase): Slovenije, Ljubljane, Celja, Evrope, Romov, Antona, Avstrije, Dunaja, Maribora, KranjaIns(49; 4% of non-emptyCase): Branetom, Špelo, Štefko, Alenko, Alešem, Andersonom, Antoličičem, Avstrijci, Avstrijo, BennyjemLoc(258; 20% of non-emptyCase): Sloveniji, Ljubljani, Mariboru, Evropi, Nemčiji, Netflixu, Avstriji, Božjah, Bruslju, IrakuNom(568; 45% of non-emptyCase): Slovenija, Agropop, Ljubljana, Jones, Nigerija, Tom, Bistrica, David, Healy, AlenkaEMPTY(467): [name:personal], [name:surname], [name:organisation], [name:address], si, ngl, [name:place], al, kk
| Paradigm Ljubljana | Nom | Acc | Dat | Gen | Loc |
|---|---|---|---|---|---|
| Ljubljana | Ljubljano | Ljubljani | Ljubljane | Ljubljani |
NUM
1048 NUM tokens (100% of all NUM tokens) have a non-empty value of Case.
The most frequent other feature values with which NUM and Case co-occurred: NumForm=Word (1047; 100%), NumType=Card (1046; 100%), Number=Plur (682; 65%), Gender=EMPTY (552; 53%).
NUM tokens may have the following values of Case:
Acc(520; 50% of non-emptyCase): eno, dva, tri, pet, en, dve, dvajset, tisoč, trideset, štiriDat(3; 0% of non-emptyCase): devetim, eni, štirimGen(48; 5% of non-emptyCase): ene, dveh, petih, treh, enega, dvajsetih, dvanajstih, enih, osmih, sedmihIns(21; 2% of non-emptyCase): enim, sedmimi, tremi, dvema, eno, dvanajstimi, enaindvajsetimi, enainpetdesetimi, petdesetimi, sedemnajstimiLoc(50; 5% of non-emptyCase): eni, dveh, enem, desetih, štirih, treh, devetnajstih, drugem, enajstih, osemnajstihNom(406; 39% of non-emptyCase): ena, dva, en, tisoč, pet, eden, tri, devet, dvajset, trije
| Paradigm en | Nom | Acc | Dat | Gen | Loc | Ins |
|---|---|---|---|---|---|---|
| Gender=Masc|Number=Sing | en | en, enega, een | enega | enem | enim | |
| Gender=Masc|Number=Plur | eni | enih | ||||
| Gender=Fem|Number=Sing | ena | eno | eni | ene | eni | eno |
| Gender=Fem|Number=Plur | ene | ene | ||||
| Gender=Neut|Number=Sing | eno | eno | enega | enem | enim | |
| Gender=Neut|Number=Plur | ena | enih |
Relations with Agreement in Case
The 10 most frequent relations where parent and child node agree in Case:
NOUN –[case]–> ADP (3806; 98%),
NOUN –[amod]–> ADJ (3196; 98%),
NOUN –[det]–> DET (2033; 89%),
NOUN –[conj]–> NOUN (728; 92%),
PROPN –[case]–> ADP (460; 91%),
DET –[case]–> ADP (370; 95%),
NOUN –[nummod]–> NUM (285; 52%),
ADJ –[nsubj]–> NOUN (276; 98%),
PRON –[case]–> ADP (229; 97%),
ADJ –[conj]–> ADJ (206; 98%).