Treebank Statistics: UD_Upper_Sorbian-UFAL: Features: Case
This feature is universal.
It occurs with 6 different values: Acc, Dat, Gen, Ins, Loc, Nom.
5092 tokens (46%) have a non-empty value of Case.
3273 types (76%) occur at least once with a non-empty value of Case.
2082 lemmas (68%) occur at least once with a non-empty value of Case.
The feature is used with 6 part-of-speech tags: NOUN (2518; 23% instances), ADJ (1403; 13% instances), PROPN (529; 5% instances), PRON (334; 3% instances), DET (274; 2% instances), NUM (34; 0% instances).
NOUN
2518 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Case.
The most frequent other feature values with which NOUN and Case co-occurred: Number=Sing (1680; 67%), Animacy=EMPTY (1382; 55%).
NOUN tokens may have the following values of Case:
Acc(493; 20% of non-emptyCase): př, rěč, nastawki, wobrazy, přikład, čas, dataje, lisćinu, móc, mócnarstwoDat(64; 3% of non-emptyCase): akademiji, dispoziciji, rostlinam, wotrjadam, Wopytowarjam, delće, dnjej, drohoćinkam, ekliptice, embryofytamGen(614; 24% of non-emptyCase): rěčow, lěta, kilometrow, wody, kraja, lěttysaca, lět, časa, biblioteki, institutaIns(164; 7% of non-emptyCase): l, pomocu, ablawtom, družinami, hamorom, krajemi, kralom, ličakami, mjenom, rostlinamiLoc(329; 13% of non-emptyCase): lěće, času, rěči, běhu, dobje, formje, lětstotku, stronje, wodźe, zemiNom(854; 34% of non-emptyCase): město, woda, stolica, rostliny, institut, pismo, rěč, stat, dołhosć, dźeńEMPTY(19): km, m, CEST, centrum, jan, přir, raz, t, thumb, čas
| Paradigm rěč | Nom | Acc | Dat | Gen | Loc | Ins |
|---|---|---|---|---|---|---|
| Number=Sing | rěč | rěč, rěc | rěči | rěče | rěči, rěče | rěču |
| Number=Plur | rěče | rěče | rěčow | rěčach |
ADJ
1403 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Case.
The most frequent other feature values with which ADJ and Case co-occurred: Animacy=EMPTY (1238; 88%), Voice=EMPTY (1214; 87%), VerbForm=EMPTY (1213; 86%), Degree=EMPTY (891; 64%), Number=Sing (883; 63%).
ADJ tokens may have the following values of Case:
Acc(265; 19% of non-emptyCase): druhe, wotpowědne, wulku, prěni, prěnje, wikowanske, wulke, Klemenowu, bohatu, cyłeDat(39; 3% of non-emptyCase): němskej, Delnjej, Hornjej, Indusowej, Jednotliwym, Ludowemu, Persiskemu, Popłatkowemu, ablawtowym, definowanymGen(288; 21% of non-emptyCase): druhich, serbskeje, Serbskeho, ablawtowych, wědomostnych, Třećeho, Zjednoćenych, delnjeho, mjenowanych, persiskehoIns(57; 4% of non-emptyCase): druhim, druhimi, jednotliwymi, nowymi, přiběracu, samsnym, Baltiskim, Kapadociskej, Persiskim, PrěnjuLoc(146; 10% of non-emptyCase): Serbskim, cyłym, sewjernej, babylonskej, chemiskich, druhej, historiskim, hornim, hornjej, južnejNom(608; 43% of non-emptyCase): najwjetše, Serbski, wulki, klinowe, prěnje, serbska, wuznamne, Ekscelentny, dalše, druheEMPTY(16): němsko, Awstro, Tibeto, al, d, dołho, duchowno, hornjo, krótko, online
| Paradigm serbski | Nom | Acc | Dat | Gen | Loc | Ins |
|---|---|---|---|---|---|---|
| Animacy=Inan|Degree=Pos|Gender=Masc|Number=Dual | serbskej | |||||
| Degree=Pos|Gender=Masc|Number=Sing | Serbski, SERBSKI | serbski | Serbskim | |||
| Degree=Pos|Gender=Fem|Number=Sing | serbska | serbskeje | ||||
| Degree=Pos|Gender=Neut|Number=Sing | serbske | |||||
| Gender=Masc|Number=Sing | serbskemu | Serbskeho | Serbskim | |||
| Gender=Masc|Number=Plur | serbskich | |||||
| Gender=Fem|Number=Sing | serbska | serbsku | serbskeje | serbskej | serbskej, serbsku | |
| Gender=Fem|Number=Plur | serbskim | |||||
| Gender=Neut|Number=Plur | serbske |
PROPN
529 PROPN tokens (89% of all PROPN tokens) have a non-empty value of Case.
The most frequent other feature values with which PROPN and Case co-occurred: Number=Sing (470; 89%), Gender=Masc (280; 53%).
PROPN tokens may have the following values of Case:
Acc(16; 3% of non-emptyCase): Esperanto, Mezopotamisku, Aziju, Babylon, Babylonsku, Fenicisku, Institut, Israel, Mnichow, PalestinuDat(9; 2% of non-emptyCase): Ešarrje, Francoskej, Hetitam, Leidenčanam, Mezopotamiskej, Serbam, Łužicy, Španiskej, ŠpaničanamGen(127; 24% of non-emptyCase): Mezopotamiskeje, Sumeričanow, Němskeje, Aramejčanow, Assyriskeje, Serbow, Syriskeje, Tigrisa, Łužicy, AkkadaIns(34; 6% of non-emptyCase): Babylonom, Eufratom, Iranom, Solawu, Wódru, Łobjom, Anatolskej, Andrapradešom, Assyriskej, AwstriskejLoc(72; 14% of non-emptyCase): Europje, Budyšinje, Mezopotamiskej, Africe, Americe, Babylonje, Berlinje, Indiskej, Litawskej, NižozemskejNom(271; 51% of non-emptyCase): Mezopotamiska, Assur, Assyriska, Aššur, Hammurabi, Jakub, Ur, Wikipedija, Assyričenjo, BabylonEMPTY(67): Wikimedia, Aššur, C, Commons, Adl, Angeles, Gasche, Los, Tamil, Tlustulimu
| Paradigm Mezopotamiska | Nom | Acc | Dat | Gen | Loc |
|---|---|---|---|---|---|
| Animacy=Inan | Mezopotamiskeje | ||||
| Mezopotamiska | Mezopotamisku | Mezopotamiskej | Mezopotamiskeje, Mezopotamiskej | Mezopotamiskej |
PRON
334 PRON tokens (100% of all PRON tokens) have a non-empty value of Case.
The most frequent other feature values with which PRON and Case co-occurred: Person=EMPTY (278; 83%), PronType=Prs (255; 76%), Gender=EMPTY (215; 64%), Number=EMPTY (204; 61%), Reflex=Yes (199; 60%).
PRON tokens may have the following values of Case:
Acc(217; 65% of non-emptyCase): so, to, je, jeho, jón, něšto, ju, ničo, nju, wšitkoDat(12; 4% of non-emptyCase): sej, tomu, nam, Jej, jeje, njej, sebiGen(21; 6% of non-emptyCase): toho, nich, njejeIns(11; 3% of non-emptyCase): tym, sobu, nimiLoc(14; 4% of non-emptyCase): tym, čimž, nimNom(59; 18% of non-emptyCase): to, wona, wón, kiž, wone, wono, Woni, ty, štož, WonejEMPTY(1): jón
| Paradigm to | Nom | Acc | Dat | Gen | Loc | Ins |
|---|---|---|---|---|---|---|
| Abbr=Yes|ExtPos=CCONJ | t | |||||
| to | to | tomu | toho | tym | tym |
DET
274 DET tokens (84% of all DET tokens) have a non-empty value of Case.
The most frequent other feature values with which DET and Case co-occurred: Abbr=EMPTY (238; 87%), Number[psor]=EMPTY (229; 84%), Person=EMPTY (229; 84%), Poss=EMPTY (200; 73%), Animacy=EMPTY (184; 67%), Number=Sing (166; 61%).
DET tokens may have the following values of Case:
Acc(48; 18% of non-emptyCase): swoje, swoju, tutu, tute, swój, kóžde, kóždy, wšě, wšěch, žaneDat(4; 1% of non-emptyCase): kotrymž, swojemu, wšemu, wšitkimGen(25; 9% of non-emptyCase): tutych, tutoho, kotrychž, tych, kotrehož, kotrejež, kóždychžkuli, někajkeho, někotrych, swojehoIns(42; 15% of non-emptyCase): n, swojimi, kotrymiž, swojej, tymLoc(39; 14% of non-emptyCase): někotrych, tutej, tutym, kotrejž, kotrychž, swojich, twojim, wšěch, kotrymž, kóždymNom(116; 42% of non-emptyCase): kotrež, kotraž, kotryž, tute, tutón, tuta, někotre, wšě, někotři, NašEMPTY(52): jeho, jich, wjele, jeje, mnoho, mjenje, n, najwjace, tróšku, tójšto
| Paradigm kotryž | Nom | Acc | Dat | Gen | Loc | Ins |
|---|---|---|---|---|---|---|
| Animacy=Anim|Gender=Masc|Number=Sing | kotryž | |||||
| Animacy=Anim|Gender=Masc|Number=Plur | kotřiž | kotrymž | ||||
| Animacy=Inan|Gender=Masc|Number=Sing | kotryž, kotrež | kotryž | ||||
| Animacy=Inan|Gender=Masc|Number=Plur | kotrež | kotrychž | kotrychž | |||
| Gender=Masc|Number=Sing | kotryž | kotrehož | kotrymž | |||
| Gender=Masc|Number=Plur | kotrež | |||||
| Gender=Fem|Number=Sing | kotraž | kotrejež | kotrejž | |||
| Gender=Fem|Number=Plur | kotrež | kotrychž | kotrymiž | |||
| Gender=Neut|Number=Sing | kotrež | |||||
| Gender=Neut|Number=Dual | kotrejž | |||||
| Gender=Neut|Number=Plur | kotrež | kotrychž |
NUM
34 NUM tokens (9% of all NUM tokens) have a non-empty value of Case.
The most frequent other feature values with which NUM and Case co-occurred: NumType=Card (33; 97%).
NUM tokens may have the following values of Case:
Acc(7; 21% of non-emptyCase): dwaj, jednu, jedynGen(8; 24% of non-emptyCase): Mio, štyrjoch, dweju, jedneho, miliardowIns(1; 3% of non-emptyCase): dwěmajLoc(5; 15% of non-emptyCase): dwěmaj, jednym, woběmajNom(13; 38% of non-emptyCase): jedyn, dwaj, jedna, dwě, jednyEMPTY(348): 2, 1, 6, 4, 3, 5, 7, I, 000, 10
| Paradigm dwaj | Nom | Acc | Gen | Loc | Ins |
|---|---|---|---|---|---|
| Animacy=Inan|Gender=Masc | dwaj | dwaj | dweju | ||
| Gender=Fem | dwaj, dwě | dwěmaj | |||
| Gender=Neut | dwěmaj | ||||
| dwaj |
Relations with Agreement in Case
The 10 most frequent relations where parent and child node agree in Case:
NOUN –[amod]–> ADJ (1076; 99%),
NOUN –[conj]–> NOUN (224; 94%),
NOUN –[det]–> DET (176; 82%),
PROPN –[conj]–> PROPN (85; 97%),
ADJ –[nsubj]–> NOUN (76; 90%),
ADJ –[conj]–> ADJ (63; 97%),
PROPN –[flat]–> PROPN (52; 79%),
PROPN –[amod]–> ADJ (43; 100%),
NOUN –[nsubj]–> NOUN (41; 87%),
NOUN –[appos]–> NOUN (33; 69%).