Treebank Statistics: UD_Slovenian-SST: Features: Case
This feature is universal.
It occurs with 6 different values: Acc
, Dat
, Gen
, Ins
, Loc
, Nom
.
10889 tokens (37%) have a non-empty value of Case
.
4009 types (65%) occur at least once with a non-empty value of Case
.
2670 lemmas (68%) occur at least once with a non-empty value of Case
.
The feature is used with 7 part-of-speech tags: NOUN (3626; 12% instances), ADP (1802; 6% instances), ADJ (1664; 6% instances), DET (1611; 5% instances), PRON (1243; 4% instances), NUM (499; 2% instances), PROPN (444; 2% instances).
NOUN
3626 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Case
.
The most frequent other feature values with which NOUN
and Case
co-occurred: Animacy=EMPTY (3245; 89%), Number=Sing (2736; 75%).
NOUN
tokens may have the following values of Case
:
Acc
(1109; 31% of non-emptyCase
): dan, jutro, leto, način, petek, denar, izraz, teden, primer, stranDat
(39; 1% of non-emptyCase
): bogu, očetu, analizam, bližnjici, borcu, familijam, gospodu, gostom, hiši, izobraževanjuGen
(663; 18% of non-emptyCase
): evrov, leta, dni, ljudi, minut, stopinj, časa, let, veze, stvariIns
(152; 4% of non-emptyCase
): leti, copati, pinceto, pojavi, stresom, avtom, bajto, besedami, dnevi, gospodomLoc
(484; 13% of non-emptyCase
): bistvu, redu, strani, koncu, letih, mestu, primeru, nadaljevanju, začetku, trenutkuNom
(1179; 33% of non-emptyCase
): gospod, hvala, gospa, oče, problem, vprašanje, čas, človek, del, a
Paradigm hiša | Nom | Acc | Dat | Gen | Loc | Ins |
---|---|---|---|---|---|---|
hiša | hišo | hiši | hiše | hiši | hišo |
ADP
1802 ADP tokens (100% of all ADP
tokens) have a non-empty value of Case
.
ADP
tokens may have the following values of Case
:
Acc
(584; 32% of non-emptyCase
): za, na, v, po, čez, med, skozi, skoz, nad, podDat
(18; 1% of non-emptyCase
): k, proti, h, kljub, navkljubGen
(282; 16% of non-emptyCase
): od, do, iz, brez, z, zaradi, s, poleg, preko, blizuIns
(254; 14% of non-emptyCase
): z, s, pred, med, nad, pod, zaLoc
(664; 37% of non-emptyCase
): v, na, pri, po, o, ob
Paradigm v | Acc | Loc |
---|---|---|
v | v |
ADJ
1664 ADJ tokens (100% of all ADJ
tokens) have a non-empty value of Case
.
The most frequent other feature values with which ADJ
and Case
co-occurred: VerbForm=EMPTY (1478; 89%), Degree=Pos (1442; 87%), Definite=EMPTY (1350; 81%), Number=Sing (1266; 76%).
ADJ
tokens may have the following values of Case
:
Acc
(358; 22% of non-emptyCase
): dobro, drugo, celo, dober, prvi, drugi, cel, lep, prvo, drugeDat
(15; 1% of non-emptyCase
): drugemu, državni, javnim, krompirjevi, krompirjevim, levi, meteorološki, meteorološkim, neumnemu, novimGen
(185; 11% of non-emptyCase
): drugega, drugih, hudega, novega, druge, slovenske, drobnih, finančnih, iraških, logarskeIns
(57; 3% of non-emptyCase
): drugo, tretjo, vremenskimi, kratkim, pravim, aktivnim, belo, bivšim, debelim, dobrimiLoc
(169; 10% of non-emptyCase
): glavnem, zadnjih, drugi, prvi, spletni, laični, majhni, beli, bolniški, delovnihNom
(880; 53% of non-emptyCase
): druga, lepa, rdeča, sam, stari, zanimivo, dober, mali, drugi, prvi
Paradigm drug | Nom | Acc | Dat | Gen | Loc | Ins |
---|---|---|---|---|---|---|
Definite=Def|Gender=Masc|Number=Sing | drugi | drugi | ||||
Definite=Ind|Gender=Masc|Number=Sing | drug | |||||
Gender=Masc|Number=Sing | drugemu | drugega | ||||
Gender=Masc|Number=Plur | drugi | druge | drugih | |||
Gender=Fem|Number=Sing | druga | drugo | druge | drugi | drugo | |
Gender=Fem|Number=Dual | drugih | |||||
Gender=Fem|Number=Plur | druge | druge | drugih | drugimi | ||
Gender=Neut|Number=Sing | drugo | drugo | drugega | drugem | drugim |
DET
1611 DET tokens (87% of all DET
tokens) have a non-empty value of Case
.
The most frequent other feature values with which DET
and Case
co-occurred: Number=Sing (1332; 83%), PronType=Dem (1055; 65%), Gender=Neut (833; 52%).
DET
tokens may have the following values of Case
:
Acc
(491; 30% of non-emptyCase
): to, vse, ta, te, nič, nekaj, tisto, neko, tole, kakšenDat
(26; 2% of non-emptyCase
): temu, tem, vsakemu, mojemu, nobenemu, onemu, svojemu, svojim, tej, vsemGen
(161; 10% of non-emptyCase
): tega, teh, tistih, naše, neke, te, nobene, nobenega, takega, takšnegaIns
(47; 3% of non-emptyCase
): tem, katerimi, neko, tisto, njihovo, onim, temi, tistim, kakšnimi, katerimLoc
(112; 7% of non-emptyCase
): tem, tej, teh, tistem, naših, katerih, naši, nekem, neki, vsakemNom
(774; 48% of non-emptyCase
): to, ta, vse, nič, tisti, vsi, ti, te, nekaj, tistoEMPTY
(233): malo, nekaj, več, koliko, dosti, toliko, veliko, pol, manj, preveč
Paradigm ta | Nom | Acc | Dat | Gen | Loc | Ins |
---|---|---|---|---|---|---|
Gender=Masc|Number=Sing | ta | ta, tega | temu | tega | tem | tem |
Gender=Masc|Number=Plur | ti | te | tem | teh | temi | |
Gender=Fem|Number=Sing | ta | to | tej | te | tej | to |
Gender=Fem|Number=Dual | ti | |||||
Gender=Fem|Number=Plur | te | te | tem | teh | teh | temi |
Gender=Neut|Number=Sing | to | to | temu | tega | tem | tem |
Gender=Neut|Number=Plur | ta | ta | tem | teh | teh |
PRON
1243 PRON tokens (76% of all PRON
tokens) have a non-empty value of Case
.
The most frequent other feature values with which PRON
and Case
co-occurred: Reflex=EMPTY (1179; 95%), PronType=Prs (958; 77%), Number=Sing (901; 72%), Variant=EMPTY (837; 67%).
PRON
tokens may have the following values of Case
:
Acc
(315; 25% of non-emptyCase
): kaj, ga, jih, jo, me, kar, nas, te, vas, meneDat
(279; 22% of non-emptyCase
): mi, si, ti, vam, meni, jim, mu, ji, nam, njemuGen
(62; 5% of non-emptyCase
): jih, ga, mene, česa, je, me, nas, nje, te, tebeIns
(45; 4% of non-emptyCase
): nami, mano, njimi, sabo, vami, njo, čim, njim, seboj, taboLoc
(22; 2% of non-emptyCase
): nas, njej, meni, njih, njem, sebi, vas, čem, čemerNom
(520; 42% of non-emptyCase
): jaz, kaj, ti, mi, kdo, vi, on, ona, kar, oniEMPTY
(398): se
Paradigm jaz | Nom | Acc | Dat | Gen | Loc | Ins |
---|---|---|---|---|---|---|
Gender=Masc|Number=Dual | midva | |||||
Gender=Masc|Number=Plur | mi | |||||
Gender=Fem|Number=Dual | midve | |||||
Gender=Fem|Number=Plur | me | |||||
Number=Sing | jaz | mene | meni | mene | meni | mano |
Number=Sing|Variant=Short | me | mi | me | |||
Number=Dual | nama | |||||
Number=Plur | nas | nam | nas | nas | nami |
NUM
499 NUM tokens (100% of all NUM
tokens) have a non-empty value of Case
.
The most frequent other feature values with which NUM
and Case
co-occurred: NumForm=Word (499; 100%), NumType=Card (498; 100%), Number=Plur (287; 58%).
NUM
tokens may have the following values of Case
:
Acc
(245; 49% of non-emptyCase
): eno, dva, en, tri, dvajset, sto, tisoč, dve, enega, štiriDat
(2; 0% of non-emptyCase
): enemu, štirimGen
(24; 5% of non-emptyCase
): ene, enega, osmih, petih, sedmih, dvajsetih, dvanajstih, enih, osemdesetih, trehIns
(12; 2% of non-emptyCase
): enim, eno, dvanajstimi, enaindvajsetimi, petdesetimi, sedemnajstimi, tremi, štirinajstimiLoc
(16; 3% of non-emptyCase
): dveh, eni, štirih, desetih, devetnajstih, enajstih, petih, trehNom
(200; 40% of non-emptyCase
): ena, en, dva, tisoč, tri, šest, devet, eden, pet, dve
Paradigm en | Nom | Acc | Dat | Gen | Loc | Ins |
---|---|---|---|---|---|---|
Gender=Masc|Number=Sing | en | en, enega | enemu | enega | enim | |
Gender=Masc|Number=Plur | eni | enih | ||||
Gender=Fem|Number=Sing | ena | eno | ene | eni | eno | |
Gender=Fem|Number=Plur | ene | |||||
Gender=Neut|Number=Sing | eno | eno | enim | |||
Gender=Neut|Number=Plur | ena |
PROPN
444 PROPN tokens (59% of all PROPN
tokens) have a non-empty value of Case
.
The most frequent other feature values with which PROPN
and Case
co-occurred: Number=Sing (403; 91%), Gender=Masc (267; 60%).
PROPN
tokens may have the following values of Case
:
Acc
(51; 11% of non-emptyCase
): paranoid, ameriko, rodik, triglav, albanijo, ano, arturja, avstralijo, beatlese, benetkeDat
(3; 1% of non-emptyCase
): robertu, savianu, turnškuGen
(63; 14% of non-emptyCase
): slovenije, pohorja, viktorije, iraka, mure, afrike, američanov, borna, camorre, celjaIns
(24; 5% of non-emptyCase
): [name:personal], [name:surname], andersonom, avstrijo, bennyjem, bojanom, dimitrijem, dimom, istrabenzom, jezerskimLoc
(70; 16% of non-emptyCase
): sloveniji, božjah, iraku, evropi, jugoslaviji, gazi, ledinah, ljubljani, zrečah, aktualuNom
(233; 52% of non-emptyCase
): slovenija, jones, tom, david, healy, jezus, karavanke, bistrica, herman, orsaEMPTY
(314): [name:personal], [name:surname], [name:address], [name:organisation], [name:place]
Paradigm Ljubljana | Nom | Acc | Gen | Loc |
---|---|---|---|---|
ljubljana | ljubljano | ljubljane | ljubljani |
Relations with Agreement in Case
The 10 most frequent relations where parent and child node agree in Case
:
NOUN –[case]–> ADP (1149; 97%),
NOUN –[amod]–> ADJ (939; 99%),
NOUN –[det]–> DET (583; 90%),
NOUN –[conj]–> NOUN (161; 91%),
NOUN –[nummod]–> NUM (156; 59%),
PROPN –[case]–> ADP (135; 85%),
DET –[case]–> ADP (96; 90%),
PRON –[case]–> ADP (96; 94%),
ADJ –[nsubj]–> NOUN (76; 97%),
PROPN –[flat:name]–> PROPN (75; 100%).