Treebank Statistics: UD_Albanian-STAF: Features: Definite
This feature is universal.
It occurs with 2 different values: Def, Ind.
678 tokens (19%) have a non-empty value of Definite.
504 types (41%) occur at least once with a non-empty value of Definite.
401 lemmas (41%) occur at least once with a non-empty value of Definite.
The feature is used with 4 part-of-speech tags: NOUN (587; 16% instances), DET (45; 1% instances), PROPN (30; 1% instances), PRON (16; 0% instances).
NOUN
587 NOUN tokens (94% of all NOUN tokens) have a non-empty value of Definite.
The most frequent other feature values with which NOUN and Definite co-occurred: Number=Sing (480; 82%), Gender=Fem (350; 60%).
NOUN tokens may have the following values of Definite:
Def(349; 59% of non-emptyDefinite): gjenerali, sytë, Nëna, gjendjes, prifti, shtëpia, babai, dorën, kohën, mendjenInd(238; 41% of non-emptyDefinite): ditë, shi, fillim, arsye, fund, gjë, grua, herë, krahasim, mendEMPTY(38): gjak, lindur, Mysafiri, Zana, atë, babait, brejtja, dajë, djalë, dëshirës
| Paradigm njeri | Ind | Def |
|---|---|---|
| Case=Acc|Number=Plur | njerëzit | |
| Case=Dat|Number=Sing | njeriu | |
| Case=Dat|Number=Plur | njerëzve | |
| Case=Gen|Number=Plur | njerëzve | |
| Case=Nom|Number=Sing | njeri, njeriu | |
| Case=Nom|Number=Plur | njerëz | njerëzit |
DET
45 DET tokens (15% of all DET tokens) have a non-empty value of Definite.
The most frequent other feature values with which DET and Definite co-occurred: Case=EMPTY (45; 100%), Gender=EMPTY (45; 100%), Number=EMPTY (45; 100%), PronType=EMPTY (45; 100%).
DET tokens may have the following values of Definite:
Ind(45; 100% of non-emptyDefinite): një, NjaEMPTY(254): e, të, i, së, një, nja, pak
PROPN
30 PROPN tokens (77% of all PROPN tokens) have a non-empty value of Definite.
The most frequent other feature values with which PROPN and Definite co-occurred: Number=Sing (29; 97%), Gender=Masc (20; 67%).
PROPN tokens may have the following values of Definite:
Def(24; 80% of non-emptyDefinite): Ernesti, Ernestit, Linda, Vedati, Dizit, Ervehenë, Hadi, Hadin, Lorin, MargaInd(6; 20% of non-emptyDefinite): Shqipëri, Berti, Ernest, VajazanEMPTY(9): Bamit, Dizi, Dizin, Ernesti, Lindën, Nerminja, Odise, Varrit, shtunë
| Paradigm Ernest | Ind | Def |
|---|---|---|
| Case=Dat | Ernestit | |
| Case=Gen | Ernestit | |
| Case=Nom | Ernest | Ernesti |
PRON
16 PRON tokens (4% of all PRON tokens) have a non-empty value of Definite.
The most frequent other feature values with which PRON and Definite co-occurred: Person=EMPTY (16; 100%), Gender=Masc (13; 81%), Number=Sing (11; 69%), PronType=EMPTY (9; 56%).
PRON tokens may have the following values of Definite:
Def(10; 63% of non-emptyDefinite): tjerë, Ç’, ka, mi, njena, sajin, tjerash, tjerëveInd(6; 38% of non-emptyDefinite): tjetër, më, AsnjeriEMPTY(415): e, i, më, që, unë, ai, kjo, tij, ky, ajo
| Paradigm tjetër | Ind | Def |
|---|---|---|
| Case=Acc|Gender=Masc|Number=Sing | tjetër | |
| Case=Acc|Gender=Masc|Number=Plur | tjerëve | |
| Case=Acc|Gender=Fem|Number=Sing | tjetër | |
| Case=Nom|Gender=Masc|Number=Plur | tjerë |
Relations with Agreement in Definite
The 10 most frequent relations where parent and child node agree in Definite:
NOUN –[nmod:poss]–> NOUN (30; 67%),
NOUN –[nmod]–> NOUN (29; 58%),
NOUN –[conj]–> NOUN (23; 77%),
NOUN –[nmod:poss]–> PROPN (5; 83%),
NOUN –[nsubj]–> NOUN (2; 67%),
NOUN –[obl]–> NOUN (2; 67%),
NOUN –[amod]–> PRON (1; 100%),
PROPN –[nmod:poss]–> NOUN (1; 100%).