Treebank Statistics: UD_Albanian-TSA: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
500 tokens (54%) have a non-empty value of Gender.
339 types (71%) occur at least once with a non-empty value of Gender.
279 lemmas (68%) occur at least once with a non-empty value of Gender.
The feature is used with 5 part-of-speech tags: NOUN (235; 25% instances), DET (115; 12% instances), ADJ (83; 9% instances), PRON (52; 6% instances), PROPN (15; 2% instances).
NOUN
235 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: NounType=EMPTY (205; 87%), Definite=Def (161; 69%), Number=Sing (159; 68%).
NOUN tokens may have the following values of Gender:
Fem(140; 60% of non-emptyGender): Dashuria, kohës, marrëdhënieve, mënyrë, politikat, shkencat, shoqëri, sjelljes, tregtinë, BujqësiaMasc(95; 40% of non-emptyGender): Evolucioni, Ishulli, dramaturgu, njeriut, njerëz, qytetit, shtete, ushqimit, vend, InteresiEMPTY(3): botëkuptim, etj, lloj
| Paradigm njeri | Masc | Fem |
|---|---|---|
| Case=Acc|Definite=Def|NounType=Het|Number=Plur | njerëzit | |
| Case=Acc|Definite=Ind|Number=Plur | njerëz | |
| Case=Gen|Definite=Def|Number=Sing | njeriut | |
| Case=Nom|Definite=Def|Number=Plur | njerëzit | |
| Case=Nom|Definite=Ind|Number=Plur | njerëz |
Gender seems to be lexical feature of NOUN. 96% lemmas (174) occur only with one value of Gender.
DET
115 DET tokens (99% of all DET tokens) have a non-empty value of Gender.
DET tokens may have the following values of Gender:
Fem(71; 62% of non-emptyGender): e, të, një, sëMasc(44; 38% of non-emptyGender): i, të, një, sëEMPTY(1): e
| Paradigm i | Masc | Fem |
|---|---|---|
| _ | i, të, së | të, e, së |
| Number=Plur | e |
ADJ
83 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: VerbForm=EMPTY (71; 86%), Number=Sing (51; 61%).
ADJ tokens may have the following values of Gender:
Fem(49; 59% of non-emptyGender): komplekse, kryesore, njerëzore, sociale, Madhe, aplikuar, avancuara, caktuar, dendura, dixhitaleMasc(34; 41% of non-emptyGender): rëndësishëm, madh, njohur, Anglez, Evropian, abstrakt, caktuara, drejtpërdrejtë, emocionalë, interesuarEMPTY(1): lidhur
| Paradigm kryesor | Masc | Fem |
|---|---|---|
| Number=Sing | kryesor | kryesore |
| Number=Plur | kryesorë |
Gender seems to be lexical feature of ADJ. 93% lemmas (64) occur only with one value of Gender.
PRON
52 PRON tokens (98% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Poss=EMPTY (46; 88%), Number=Sing (28; 54%).
PRON tokens may have the following values of Gender:
Fem(28; 54% of non-emptyGender): disa, e, gjitha, këto, Kjo, cilat, këtë, saj, ato, atëMasc(24; 46% of non-emptyGender): Ata, i, tij, Ky, ai, cilitdo, disa, Këto, atyre, këtëEMPTY(1): u
| Paradigm ai | Masc | Fem |
|---|---|---|
| Case=Acc|Number=Sing|PronType=Emp | e | |
| Case=Gen|Number=Sing|Poss=Yes|PronType=Prs | tij | |
| Case=Nom|Number=Sing|Person=3|PronType=Dem | ai | |
| Case=Nom|Number=Sing|PronType=Prs | Ai | |
| Case=Nom|Number=Plur|PronType=Prs | Ata | ato |
PROPN
15 PROPN tokens (75% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (14; 93%), Definite=Def (13; 87%).
PROPN tokens may have the following values of Gender:
Fem(7; 47% of non-emptyGender): Shqipëri, Britania, Evropës, Japoninë, Kinës, KorenëMasc(8; 53% of non-emptyGender): Bashkimit, Djui, Djuin, Manit, Norsëve, Ruso, Zhak, ZhanEMPTY(5): Homo, Shakespeare, Shpëtim, William, Çuçka
Gender seems to be lexical feature of PROPN. 100% lemmas (13) occur only with one value of Gender.
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[amod]–> ADJ (63; 94%),
NOUN –[det]–> DET (46; 79%),
ADJ –[det:adj]–> DET (32; 94%),
NOUN –[det]–> PRON (17; 74%),
PRON –[det:pron]–> DET (11; 92%),
ADJ –[conj]–> ADJ (6; 100%),
ADJ –[nsubj]–> NOUN (4; 67%),
NOUN –[nmod:poss]–> PROPN (4; 80%),
NOUN –[nsubj]–> NOUN (4; 57%),
ADJ –[nmod]–> NOUN (3; 60%).