Treebank Statistics: UD_Albanian-TSA: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
500 tokens (54%) have a non-empty value of Gender
.
339 types (71%) occur at least once with a non-empty value of Gender
.
279 lemmas (68%) occur at least once with a non-empty value of Gender
.
The feature is used with 5 part-of-speech tags: NOUN (235; 25% instances), DET (115; 12% instances), ADJ (83; 9% instances), PRON (52; 6% instances), PROPN (15; 2% instances).
NOUN
235 NOUN tokens (99% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: NounType=EMPTY (205; 87%), Definite=Def (161; 69%), Number=Sing (159; 68%).
NOUN
tokens may have the following values of Gender
:
Fem
(140; 60% of non-emptyGender
): Dashuria, kohës, marrëdhënieve, mënyrë, politikat, shkencat, shoqëri, sjelljes, tregtinë, BujqësiaMasc
(95; 40% of non-emptyGender
): Evolucioni, Ishulli, dramaturgu, njeriut, njerëz, qytetit, shtete, ushqimit, vend, InteresiEMPTY
(3): botëkuptim, etj, lloj
Paradigm njeri | Masc | Fem |
---|---|---|
Case=Acc|Definite=Def|NounType=Het|Number=Plur | njerëzit | |
Case=Acc|Definite=Ind|Number=Plur | njerëz | |
Case=Gen|Definite=Def|Number=Sing | njeriut | |
Case=Nom|Definite=Def|Number=Plur | njerëzit | |
Case=Nom|Definite=Ind|Number=Plur | njerëz |
Gender
seems to be lexical feature of NOUN
. 96% lemmas (174) occur only with one value of Gender
.
DET
115 DET tokens (99% of all DET
tokens) have a non-empty value of Gender
.
DET
tokens may have the following values of Gender
:
Fem
(71; 62% of non-emptyGender
): e, të, një, sëMasc
(44; 38% of non-emptyGender
): i, të, një, sëEMPTY
(1): e
Paradigm i | Masc | Fem |
---|---|---|
_ | i, të, së | të, e, së |
Number=Plur | e |
ADJ
83 ADJ tokens (99% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: VerbForm=EMPTY (71; 86%), Number=Sing (51; 61%).
ADJ
tokens may have the following values of Gender
:
Fem
(49; 59% of non-emptyGender
): komplekse, kryesore, njerëzore, sociale, Madhe, aplikuar, avancuara, caktuar, dendura, dixhitaleMasc
(34; 41% of non-emptyGender
): rëndësishëm, madh, njohur, Anglez, Evropian, abstrakt, caktuara, drejtpërdrejtë, emocionalë, interesuarEMPTY
(1): lidhur
Paradigm kryesor | Masc | Fem |
---|---|---|
Number=Sing | kryesor | kryesore |
Number=Plur | kryesorë |
Gender
seems to be lexical feature of ADJ
. 93% lemmas (64) occur only with one value of Gender
.
PRON
52 PRON tokens (98% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Poss=EMPTY (46; 88%), Number=Sing (28; 54%).
PRON
tokens may have the following values of Gender
:
Fem
(28; 54% of non-emptyGender
): disa, e, gjitha, këto, Kjo, cilat, këtë, saj, ato, atëMasc
(24; 46% of non-emptyGender
): Ata, i, tij, Ky, ai, cilitdo, disa, Këto, atyre, këtëEMPTY
(1): u
Paradigm ai | Masc | Fem |
---|---|---|
Case=Acc|Number=Sing|PronType=Emp | e | |
Case=Gen|Number=Sing|Poss=Yes|PronType=Prs | tij | |
Case=Nom|Number=Sing|Person=3|PronType=Dem | ai | |
Case=Nom|Number=Sing|PronType=Prs | Ai | |
Case=Nom|Number=Plur|PronType=Prs | Ata | ato |
PROPN
15 PROPN tokens (75% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (14; 93%), Definite=Def (13; 87%).
PROPN
tokens may have the following values of Gender
:
Fem
(7; 47% of non-emptyGender
): Shqipëri, Britania, Evropës, Japoninë, Kinës, KorenëMasc
(8; 53% of non-emptyGender
): Bashkimit, Djui, Djuin, Manit, Norsëve, Ruso, Zhak, ZhanEMPTY
(5): Homo, Shakespeare, Shpëtim, William, Çuçka
Gender
seems to be lexical feature of PROPN
. 100% lemmas (13) occur only with one value of Gender
.
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[amod]–> ADJ (63; 94%),
NOUN –[det]–> DET (46; 79%),
ADJ –[det:adj]–> DET (32; 94%),
NOUN –[det]–> PRON (17; 74%),
PRON –[det:pron]–> DET (11; 92%),
ADJ –[conj]–> ADJ (6; 100%),
ADJ –[nsubj]–> NOUN (4; 67%),
NOUN –[nmod:poss]–> PROPN (4; 80%),
NOUN –[nsubj]–> NOUN (4; 57%),
ADJ –[nmod]–> NOUN (3; 60%).