Treebank Statistics: UD_Gheg-GPS: Features: Case
This feature is universal.
It occurs with 5 different values: Abl, Acc, Dat, Gen, Nom.
5314 tokens (33%) have a non-empty value of Case.
998 types (39%) occur at least once with a non-empty value of Case.
440 lemmas (45%) occur at least once with a non-empty value of Case.
The feature is used with 5 part-of-speech tags: PRON (2483; 16% instances), NOUN (2446; 15% instances), DET (223; 1% instances), ADJ (161; 1% instances), NUM (1; 0% instances).
PRON
2483 PRON tokens (86% of all PRON tokens) have a non-empty value of Case.
The most frequent other feature values with which PRON and Case co-occurred: Number=Sing (1785; 72%), Person=3 (1707; 69%), Gender=EMPTY (1562; 63%), PronType=EMPTY (1514; 61%).
PRON tokens may have the following values of Case:
Abl(16; 1% of non-emptyCase): ati, ty:re, asa:jna, asajna, atina, aty:nve, atyne, atynve, kësa:j, qasajAcc(1216; 49% of non-emptyCase): e, i, a, at, ato, ata, u, a:, i:, ato:Dat(565; 23% of non-emptyCase): i, j, m, ati, e, ati:, kti, atyne, atina, i:Gen(23; 1% of non-emptyCase): tij, ksi, ti, ti:j, veten, ijit, tina, tina:, ty:re, tyneNom(663; 27% of non-emptyCase): aj, ata, ky, kta, ajo, ato, ai, ai:, ata:, kiEMPTY(415): qe, që, krejt, do, vet, qi, qysh, qka, kejt, disa
| Paradigm ti | Nom | Acc | Dat | Gen | Abl |
|---|---|---|---|---|---|
| Gender=Masc|Person=3|PronType=Prs | ti | ti | ti | ||
| Gender=Masc|Person=3|PronType=Prs|Reflex=Yes | ti | ||||
| Gender=Masc|PronType=Dem | ti | ||||
| Gender=Masc|PronType=Prs | ti | ||||
| Person=2|PronType=Prs | ti |
NOUN
2446 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Case.
The most frequent other feature values with which NOUN and Case co-occurred: Number=Sing (1747; 71%), Gender=Fem (1439; 59%), Definite=Ind (1300; 53%).
NOUN tokens may have the following values of Case:
Abl(112; 5% of non-emptyCase): rru:gës, lisit, rruges, biciklete, biciklles, dardhës, pemes, rrugës, bicikletes, da:rdhesAcc(1383; 57% of non-emptyCase): da:rdha, dardha, dardhat, da:rdhat, bicikell, dardh, korp, tok, dardha:, bicikle:tDat(116; 5% of non-emptyCase): djalit, dja:lit, djemve, bicikles, da:rdhave, filmit, gurit, moshës, biciklles, biqikletësGen(15; 1% of non-emptyCase): dardhes, kohës, rru:gës, da:rdhave, dardha:ve, dardhës, dja:lit, kohes:, naty:rës, shpo:rteveNom(820; 34% of non-emptyCase): djem, djali, djal, djemt, njeri, fmi:, burr, bujk, burri, da:rdhatEMPTY(27): birne, kappe, baum, belo:nig, bode, bujk, cowboy, djali, fi:lmi, jungs
| Paradigm dardhë | Nom | Acc | Dat | Gen | Abl |
|---|---|---|---|---|---|
| Definite=Def|Gender=Masc|Number=Sing | da:rdhen | ||||
| Definite=Def|Gender=Masc|Number=Plur | dardhat | dardhat | |||
| Definite=Def|Gender=Fem|Number=Sing | da:rdha, dardha, dardha: | da:rdhën, dardh, dardhen, dardhën, da:rdh, da:rdhat, da:rdhin, dardhat, dardhët | da:rdhes | dardhes, dardhës | dardhës, da:rdhes, da:rdhës, dardhes |
| Definite=Def|Gender=Fem|Number=Plur | da:rdhat, dardhat, dardha:t, da:rdhatë, da:rdhet, da:rdhët, dardha, dardhët, dordhat, ta:rdhat | dardhat, da:rdhat, dardha:t, dardhët, dordhat, dardha, dardhat:, da:rdha, da:rthat, dordha:t | da:rdhave, dardha:ve, dardhave | da:rdhave, dardha:ve | da:rdha |
| Definite=Ind|Gender=Masc|Number=Sing | da:rdh | ||||
| Definite=Ind|Gender=Masc|Number=Plur | da:rdha | da:rdh, da:rdha, dardha, dardha: | |||
| Definite=Ind|Gender=Fem | dardha: | ||||
| Definite=Ind|Gender=Fem|Number=Sing | da:rdh, dardh, da:rdha, dardha: | dardh, da:rdh, da:rdha, da:rdhë, dardha, da:rdha:, dardha:, dardhë, dardë/ | |||
| Definite=Ind|Gender=Fem|Number=Plur | dardha, dardha:, dardhat, dordha | da:rdha, dardha, dardha:, da:rdha:, dardh, dordha, d:ardha, da:dha, da:rdhat, da:rdhe | da:rdhave, dardha: |
DET
223 DET tokens (30% of all DET tokens) have a non-empty value of Case.
The most frequent other feature values with which DET and Case co-occurred: Number=Sing (177; 79%), Gender=EMPTY (128; 57%).
DET tokens may have the following values of Case:
Abl(1; 0% of non-emptyCase): sëAcc(99; 44% of non-emptyCase): e, t, të, e:, te, të:, e/, i, i/, i:Dat(29; 13% of non-emptyCase): t, i, të, e, sëGen(8; 4% of non-emptyCase): e, t, i, ëNom(86; 39% of non-emptyCase): i, e, t, të, /i, e:, i:, teEMPTY(527): ni, një, ni:, nji, të, nje, t, një:, s, e
| Paradigm e | Nom | Acc | Dat | Gen |
|---|---|---|---|---|
| Gender=Masc|Number=Sing | e | |||
| Gender=Masc|Number=Plur | e | ë | ||
| Gender=Fem|Number=Sing | e | e | ||
| Gender=Fem|Number=Plur | e | e | e | |
| Number=Sing | e, e: | e, e:, e/, ë:h:h: | e | e, ë |
| Number=Plur | e | e | e |
ADJ
161 ADJ tokens (68% of all ADJ tokens) have a non-empty value of Case.
The most frequent other feature values with which ADJ and Case co-occurred: Number=Sing (112; 70%), Gender=Masc (92; 57%).
ADJ tokens may have the following values of Case:
Abl(4; 2% of non-emptyCase): tjeter, kaotike, tjetresAcc(58; 36% of non-emptyCase): tjeter, tje:r, vogël, gjermanisht, mbu:shura, njejtën, tje:tër, tjetër, bishtale:ca, buku:rDat(5; 3% of non-emptyCase): elementa:re, shku:rt, tjer, tjetrin, tjetritNom(94; 58% of non-emptyCase): tjer, tjeter, tjetër, tje:r, vogël, vogel, ri, vjeter, anonym, mbu:shurEMPTY(75): pak, pa, shum, herët, normal, pa:, plot, quditshme, shu:m, vo:n
| Paradigm tjetër | Nom | Acc | Dat | Abl |
|---|---|---|---|---|
| Definite=Def|Gender=Masc|Number=Sing | tjetri | |||
| Definite=Ind | tjetër | |||
| Definite=Ind|Gender=Masc|Number=Sing | tjeter, tjetri | tjetrin, tjetrit | ||
| Definite=Ind|Gender=Fem|Number=Plur | tjera: | |||
| Definite=Ind|Number=Sing | tjeter | tje:trën | ||
| Gender=Masc | tjer | |||
| Gender=Masc|Number=Sing | tjetër, tjeter, tje:r, tjer | tjeter, tje:r, tjetër, qeter, tje:tër | tjeter | |
| Gender=Masc|Number=Plur | tjer, tje:r | tje:r | tjer | |
| Gender=Fem|Number=Sing | tjeter | tjeter, tje:tër | tjeter, tjetres | |
| Gender=Fem|Number=Plur | tjera, tjerat | |||
| Number=Sing | tjeter |
NUM
1 NUM tokens (0% of all NUM tokens) have a non-empty value of Case.
The most frequent other feature values with which NUM and Case co-occurred: Gender=Fem (1; 100%), NumType=Ord (1; 100%).
NUM tokens may have the following values of Case:
Acc(1; 100% of non-emptyCase): tretënEMPTY(321): tre, tri, ni, dy, tri:, një, dy:, nja, njo, tre:
Relations with Agreement in Case
The 10 most frequent relations where parent and child node agree in Case:
NOUN –[det]–> PRON (439; 84%),
NOUN –[amod]–> ADJ (105; 81%),
ADJ –[det]–> DET (76; 75%),
NOUN –[reparandum]–> NOUN (32; 78%),
NOUN –[conj]–> NOUN (22; 85%),
PRON –[reparandum]–> PRON (9; 64%),
PRON –[amod]–> ADJ (6; 75%),
NOUN –[reparandum]–> PRON (5; 71%),
PRON –[det]–> PRON (5; 71%),
PRON –[reparandum]–> DET (4; 67%).