Treebank Statistics: UD_Albanian-TSA: Features: Case
This feature is universal.
It occurs with 5 different values: Abl, Acc, Dat, Gen, Nom.
Some words have combined values of the feature; 1 combinations have been observed: Acc|Nom.
303 tokens (33%) have a non-empty value of Case.
261 types (55%) occur at least once with a non-empty value of Case.
208 lemmas (51%) occur at least once with a non-empty value of Case.
The feature is used with 3 part-of-speech tags: NOUN (235; 25% instances), PRON (53; 6% instances), PROPN (15; 2% instances).
NOUN
235 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Case.
The most frequent other feature values with which NOUN and Case co-occurred: NounType=EMPTY (205; 87%), Definite=Def (161; 69%), Number=Sing (159; 68%), Gender=Fem (140; 60%).
NOUN tokens may have the following values of Case:
Abl(6; 3% of non-emptyCase): gjinisë, komunikimit, ndryshimesh, person, problemi, shekujveAcc(95; 40% of non-emptyCase): drejtimet, mënyrë, shkencat, shtete, tregtinë, administrim, anë, armë, artikujt, bashkimAcc,Nom(1; 0% of non-emptyCase): karakteriDat(5; 2% of non-emptyCase): formave, informacionit, procesit, përbërësit, përvojaveGen(50; 21% of non-emptyCase): kohës, marrëdhënieve, njeriut, qytetit, sjelljes, ushqimit, anëtarëve, djegies, edukimit, ekonomieNom(78; 33% of non-emptyCase): Dashuria, Evolucioni, Ishulli, dramaturgu, Bujqësia, Buka, Familja, Forcat, Format, FrutatEMPTY(3): botëkuptim, etj, lloj
| Paradigm njeri | Nom | Acc | Gen |
|---|---|---|---|
| Definite=Def|Gender=Masc|Number=Sing | njeriut | ||
| Definite=Def|Gender=Masc|Number=Plur | njerëzit | ||
| Definite=Def|Gender=Fem|NounType=Het|Number=Plur | njerëzit | ||
| Definite=Ind|Gender=Masc|Number=Plur | njerëz | njerëz |
PRON
53 PRON tokens (100% of all PRON tokens) have a non-empty value of Case.
The most frequent other feature values with which PRON and Case co-occurred: Poss=EMPTY (47; 89%), Gender=Fem (28; 53%), Number=Sing (28; 53%).
PRON tokens may have the following values of Case:
Abl(1; 2% of non-emptyCase): cilitdoAcc(15; 28% of non-emptyCase): e, këtë, atë, cilat, cilën, gjitha, i, këto, tillë, tjeraDat(5; 9% of non-emptyCase): i, atyre, na, uGen(11; 21% of non-emptyCase): tij, saj, cilitdo, gjitha, kësaj, këtyre, tjetër, tyreNom(21; 40% of non-emptyCase): disa, Ata, këto, Kjo, Ky, ai, Cilat, ato, cila, gjitha
| Paradigm ai | Nom | Acc | Gen |
|---|---|---|---|
| Gender=Masc|Number=Sing|Person=3|PronType=Dem | ai | ||
| Gender=Masc|Number=Sing|Poss=Yes|PronType=Prs | tij | ||
| Gender=Masc|Number=Sing|PronType=Prs | Ai | ||
| Gender=Masc|Number=Plur|PronType=Prs | Ata | ||
| Gender=Fem|Number=Sing|PronType=Emp | e | ||
| Gender=Fem|Number=Plur|PronType=Prs | ato |
PROPN
15 PROPN tokens (75% of all PROPN tokens) have a non-empty value of Case.
The most frequent other feature values with which PROPN and Case co-occurred: Number=Sing (14; 93%), Definite=Def (13; 87%), Gender=Masc (8; 53%).
PROPN tokens may have the following values of Case:
Acc(5; 33% of non-emptyCase): Shqipëri, Djuin, Japoninë, KorenëGen(5; 33% of non-emptyCase): Bashkimit, Evropës, Kinës, Manit, NorsëveNom(5; 33% of non-emptyCase): Britania, Djui, Ruso, Zhak, ZhanEMPTY(5): Homo, Shakespeare, Shpëtim, William, Çuçka
| Paradigm Dju | Nom | Acc |
|---|---|---|
| Djui | Djuin |
Case seems to be lexical feature of PROPN. 92% lemmas (12) occur only with one value of Case.
Relations with Agreement in Case
The 10 most frequent relations where parent and child node agree in Case:
NOUN –[conj]–> NOUN (20; 87%),
NOUN –[det]–> PRON (20; 87%),
NOUN –[nsubj]–> NOUN (7; 100%),
PRON –[nsubj]–> NOUN (2; 100%),
PROPN –[flat]–> PROPN (2; 100%),
NOUN –[acl:relcl]–> NOUN (1; 100%),
PRON –[conj]–> PRON (1; 100%),
PRON –[nmod:poss]–> NOUN (1; 100%),
PROPN –[conj]–> PROPN (1; 100%).