Treebank Statistics: UD_Gheg-GPS: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
3876 tokens (24%) have a non-empty value of Gender.
962 types (37%) occur at least once with a non-empty value of Gender.
415 lemmas (43%) occur at least once with a non-empty value of Gender.
The feature is used with 5 part-of-speech tags: NOUN (2451; 15% instances), PRON (966; 6% instances), NUM (213; 1% instances), ADJ (149; 1% instances), DET (97; 1% instances).
NOUN
2451 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (1752; 71%), Case=Acc (1383; 56%), Definite=Ind (1303; 53%).
NOUN tokens may have the following values of Gender:
Fem(1441; 59% of non-emptyGender): da:rdha, dardha, dardhat, da:rdhat, korp, dardh, bicikell, dardha:, tok, dardha:tMasc(1010; 41% of non-emptyGender): djem, djali, djal, djemt, njeri, kapuqin, burr, bujk, burri, fmi:EMPTY(22): birne, kappe, baum, belo:nig, bode, cowboy, djali, fi:lmi, jungs, koh
| Paradigm dardhë | Masc | Fem |
|---|---|---|
| Case=Abl|Definite=Def|Number=Sing | dardhës, da:rdhes, da:rdhës, dardhes | |
| Case=Abl|Definite=Def|Number=Plur | da:rdha | |
| Case=Acc|Definite=Def|Number=Sing | da:rdhen | da:rdhën, dardh, dardhen, dardhën, da:rdh, da:rdhat, da:rdhin, dardhat, dardhët |
| Case=Acc|Definite=Def|Number=Plur | dardhat | dardhat, da:rdhat, dardha:t, dardhët, dordhat, dardha, dardhat:, da:rdha, da:rthat, dordha:t |
| Case=Acc|Definite=Ind | dardha: | |
| Case=Acc|Definite=Ind|Number=Sing | da:rdh | dardh, da:rdh, da:rdha, da:rdhë, dardha, da:rdha:, dardha:, dardhë, dardë/ |
| Case=Acc|Definite=Ind|Number=Plur | da:rdh, da:rdha, dardha, dardha: | da:rdha, dardha, dardha:, da:rdha:, dardh, dordha, d:ardha, da:dha, da:rdhat, da:rdhe |
| Case=Dat|Definite=Def|Number=Sing | da:rdhes | |
| Case=Dat|Definite=Def|Number=Plur | da:rdhave, dardha:ve, dardhave | |
| Case=Dat|Definite=Ind|Number=Plur | da:rdhave, dardha: | |
| Case=Gen|Definite=Def|Number=Sing | dardhes, dardhës | |
| Case=Gen|Definite=Def|Number=Plur | da:rdhave, dardha:ve | |
| Case=Nom|Definite=Def|Number=Sing | da:rdha, dardha, dardha: | |
| Case=Nom|Definite=Def|Number=Plur | dardhat | da:rdhat, dardhat, dardha:t, da:rdhatë, da:rdhet, da:rdhët, dardha, dardhët, dordhat, ta:rdhat |
| Case=Nom|Definite=Ind|Number=Sing | da:rdh, dardh, da:rdha, dardha: | |
| Case=Nom|Definite=Ind|Number=Plur | da:rdha | dardha, dardha:, dardhat, dordha |
PRON
966 PRON tokens (33% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (663; 69%), Person=EMPTY (646; 67%), Case=Nom (596; 62%), PronType=Dem (562; 58%).
PRON tokens may have the following values of Gender:
Fem(224; 23% of non-emptyGender): ato, at, ajo, ato:, kjo, asaj, kto, gjitha, njanen, a:tMasc(742; 77% of non-emptyGender): aj, ata, ky, kta, at, ati, ai, ata:, ai:, kiEMPTY(1932): i, e, j, a, qe, që, m, u, do, krejt
| Paradigm i | Masc | Fem |
|---|---|---|
| Case=Acc|Number=Plur | i | |
| Case=Acc|Number=Plur|Person=3 | i | |
| Case=Nom|Number=Sing|Person=3 | i |
NUM
213 NUM tokens (66% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (204; 96%).
NUM tokens may have the following values of Gender:
Fem(85; 40% of non-emptyGender): tri, tri:, dy:, treta, tretën, tretësMasc(128; 60% of non-emptyGender): tre, tre:, treve, tret, /tre, traEMPTY(109): ni, dy, një, nja, njo, dy:, nji, nje, kater, ni:
| Paradigm tre | Masc | Fem |
|---|---|---|
| NumType=Card | tre, tre:, /tre, tra | tri, tri: |
| NumType=Ord | treta |
ADJ
149 ADJ tokens (63% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (104; 70%), Case=Nom (84; 56%).
ADJ tokens may have the following values of Gender:
Fem(54; 36% of non-emptyGender): tjeter, mbu:shura, vogël, mush, njejtën, re:, vogel, bardh, bishtale:ca, bu:kurMasc(95; 64% of non-emptyGender): tjer, tje:r, tjeter, tjetër, vogël, tjetri, vjeter, vogel, ri, sjellshëmEMPTY(87): pak, pa, shum, herët, mushta, normal, pa:, plot, quditshme, shqip
| Paradigm tjetër | Masc | Fem |
|---|---|---|
| Case=Abl|Number=Sing | tjeter | tjeter, tjetres |
| Case=Acc|Definite=Ind|Number=Plur | tjera: | |
| Case=Acc|Number=Sing | tjeter, tje:r, tjetër, qeter, tje:tër | tjeter, tje:tër |
| Case=Acc|Number=Plur | tje:r | tjera, tjerat |
| Case=Dat|Definite=Ind|Number=Sing | tjetrin, tjetrit | |
| Case=Dat|Number=Plur | tjer | |
| Case=Nom | tjer | |
| Case=Nom|Definite=Def|Number=Sing | tjetri | |
| Case=Nom|Definite=Ind|Number=Sing | tjeter, tjetri | |
| Case=Nom|Number=Sing | tjetër, tjeter, tje:r, tjer | tjeter |
| Case=Nom|Number=Plur | tjer, tje:r | |
| Definite=Def | tjetri |
DET
97 DET tokens (13% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (75; 77%), Case=Nom (50; 52%).
DET tokens may have the following values of Gender:
Fem(43; 44% of non-emptyGender): e, t, tëMasc(54; 56% of non-emptyGender): i, e, t, të, /i, i:, te, të:, ëEMPTY(653): ni, një, e, ni:, t, i, të, nji, nje, një:
| Paradigm e | Masc | Fem |
|---|---|---|
| Case=Acc|Number=Sing | e | e |
| Case=Acc|Number=Plur | ë | e |
| Case=Gen|Number=Plur | e | |
| Case=Nom|Number=Sing | e | |
| Case=Nom|Number=Plur | e | e |
| Number=Sing | e |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> PRON (400; 76%),
NOUN –[nummod]–> NUM (134; 55%),
NOUN –[nmod]–> NOUN (117; 51%),
NOUN –[amod]–> ADJ (100; 78%),
ADJ –[det]–> DET (46; 57%),
NOUN –[reparandum]–> NOUN (35; 85%),
NOUN –[conj]–> NOUN (22; 85%),
PRON –[nmod]–> NOUN (15; 68%),
NUM –[det]–> PRON (10; 91%),
NOUN –[acl]–> NOUN (8; 80%).