Treebank Statistics: UD_Gheg-GPS: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
3876 tokens (24%) have a non-empty value of Gender
.
962 types (37%) occur at least once with a non-empty value of Gender
.
415 lemmas (43%) occur at least once with a non-empty value of Gender
.
The feature is used with 5 part-of-speech tags: NOUN (2451; 15% instances), PRON (966; 6% instances), NUM (213; 1% instances), ADJ (149; 1% instances), DET (97; 1% instances).
NOUN
2451 NOUN tokens (99% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (1752; 71%), Case=Acc (1383; 56%), Definite=Ind (1303; 53%).
NOUN
tokens may have the following values of Gender
:
Fem
(1441; 59% of non-emptyGender
): da:rdha, dardha, dardhat, da:rdhat, korp, dardh, bicikell, dardha:, tok, dardha:tMasc
(1010; 41% of non-emptyGender
): djem, djali, djal, djemt, njeri, kapuqin, burr, bujk, burri, fmi:EMPTY
(22): birne, kappe, baum, belo:nig, bode, cowboy, djali, fi:lmi, jungs, koh
Paradigm dardhë | Masc | Fem |
---|---|---|
Case=Abl|Definite=Def|Number=Sing | dardhës, da:rdhes, da:rdhës, dardhes | |
Case=Abl|Definite=Def|Number=Plur | da:rdha | |
Case=Acc|Definite=Def|Number=Sing | da:rdhen | da:rdhën, dardh, dardhen, dardhën, da:rdh, da:rdhat, da:rdhin, dardhat, dardhët |
Case=Acc|Definite=Def|Number=Plur | dardhat | dardhat, da:rdhat, dardha:t, dardhët, dordhat, dardha, dardhat:, da:rdha, da:rthat, dordha:t |
Case=Acc|Definite=Ind | dardha: | |
Case=Acc|Definite=Ind|Number=Sing | da:rdh | dardh, da:rdh, da:rdha, da:rdhë, dardha, da:rdha:, dardha:, dardhë, dardë/ |
Case=Acc|Definite=Ind|Number=Plur | da:rdh, da:rdha, dardha, dardha: | da:rdha, dardha, dardha:, da:rdha:, dardh, dordha, d:ardha, da:dha, da:rdhat, da:rdhe |
Case=Dat|Definite=Def|Number=Sing | da:rdhes | |
Case=Dat|Definite=Def|Number=Plur | da:rdhave, dardha:ve, dardhave | |
Case=Dat|Definite=Ind|Number=Plur | da:rdhave, dardha: | |
Case=Gen|Definite=Def|Number=Sing | dardhes, dardhës | |
Case=Gen|Definite=Def|Number=Plur | da:rdhave, dardha:ve | |
Case=Nom|Definite=Def|Number=Sing | da:rdha, dardha, dardha: | |
Case=Nom|Definite=Def|Number=Plur | dardhat | da:rdhat, dardhat, dardha:t, da:rdhatë, da:rdhet, da:rdhët, dardha, dardhët, dordhat, ta:rdhat |
Case=Nom|Definite=Ind|Number=Sing | da:rdh, dardh, da:rdha, dardha: | |
Case=Nom|Definite=Ind|Number=Plur | da:rdha | dardha, dardha:, dardhat, dordha |
PRON
966 PRON tokens (33% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Number=Sing (663; 69%), Person=EMPTY (646; 67%), Case=Nom (596; 62%), PronType=Dem (562; 58%).
PRON
tokens may have the following values of Gender
:
Fem
(224; 23% of non-emptyGender
): ato, at, ajo, ato:, kjo, asaj, kto, gjitha, njanen, a:tMasc
(742; 77% of non-emptyGender
): aj, ata, ky, kta, at, ati, ai, ata:, ai:, kiEMPTY
(1932): i, e, j, a, qe, që, m, u, do, krejt
Paradigm i | Masc | Fem |
---|---|---|
Case=Acc|Number=Plur | i | |
Case=Acc|Number=Plur|Person=3 | i | |
Case=Nom|Number=Sing|Person=3 | i |
NUM
213 NUM tokens (66% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumType=Card (204; 96%).
NUM
tokens may have the following values of Gender
:
Fem
(85; 40% of non-emptyGender
): tri, tri:, dy:, treta, tretën, tretësMasc
(128; 60% of non-emptyGender
): tre, tre:, treve, tret, /tre, traEMPTY
(109): ni, dy, një, nja, njo, dy:, nji, nje, kater, ni:
Paradigm tre | Masc | Fem |
---|---|---|
NumType=Card | tre, tre:, /tre, tra | tri, tri: |
NumType=Ord | treta |
ADJ
149 ADJ tokens (63% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (104; 70%), Case=Nom (84; 56%).
ADJ
tokens may have the following values of Gender
:
Fem
(54; 36% of non-emptyGender
): tjeter, mbu:shura, vogël, mush, njejtën, re:, vogel, bardh, bishtale:ca, bu:kurMasc
(95; 64% of non-emptyGender
): tjer, tje:r, tjeter, tjetër, vogël, tjetri, vjeter, vogel, ri, sjellshëmEMPTY
(87): pak, pa, shum, herët, mushta, normal, pa:, plot, quditshme, shqip
Paradigm tjetër | Masc | Fem |
---|---|---|
Case=Abl|Number=Sing | tjeter | tjeter, tjetres |
Case=Acc|Definite=Ind|Number=Plur | tjera: | |
Case=Acc|Number=Sing | tjeter, tje:r, tjetër, qeter, tje:tër | tjeter, tje:tër |
Case=Acc|Number=Plur | tje:r | tjera, tjerat |
Case=Dat|Definite=Ind|Number=Sing | tjetrin, tjetrit | |
Case=Dat|Number=Plur | tjer | |
Case=Nom | tjer | |
Case=Nom|Definite=Def|Number=Sing | tjetri | |
Case=Nom|Definite=Ind|Number=Sing | tjeter, tjetri | |
Case=Nom|Number=Sing | tjetër, tjeter, tje:r, tjer | tjeter |
Case=Nom|Number=Plur | tjer, tje:r | |
Definite=Def | tjetri |
DET
97 DET tokens (13% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (75; 77%), Case=Nom (50; 52%).
DET
tokens may have the following values of Gender
:
Fem
(43; 44% of non-emptyGender
): e, t, tëMasc
(54; 56% of non-emptyGender
): i, e, t, të, /i, i:, te, të:, ëEMPTY
(653): ni, një, e, ni:, t, i, të, nji, nje, një:
Paradigm e | Masc | Fem |
---|---|---|
Case=Acc|Number=Sing | e | e |
Case=Acc|Number=Plur | ë | e |
Case=Gen|Number=Plur | e | |
Case=Nom|Number=Sing | e | |
Case=Nom|Number=Plur | e | e |
Number=Sing | e |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> PRON (400; 76%),
NOUN –[nummod]–> NUM (134; 55%),
NOUN –[nmod]–> NOUN (117; 51%),
NOUN –[amod]–> ADJ (100; 78%),
ADJ –[det]–> DET (46; 57%),
NOUN –[reparandum]–> NOUN (35; 85%),
NOUN –[conj]–> NOUN (22; 85%),
PRON –[nmod]–> NOUN (15; 68%),
NUM –[det]–> PRON (10; 91%),
NOUN –[acl]–> NOUN (8; 80%).