home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Albanian-TSA: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

500 tokens (54%) have a non-empty value of Gender. 339 types (71%) occur at least once with a non-empty value of Gender. 279 lemmas (68%) occur at least once with a non-empty value of Gender. The feature is used with 5 part-of-speech tags: NOUN (235; 25% instances), DET (115; 12% instances), ADJ (83; 9% instances), PRON (52; 6% instances), PROPN (15; 2% instances).

NOUN

235 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: NounType=EMPTY (205; 87%), Definite=Def (161; 69%), Number=Sing (159; 68%).

NOUN tokens may have the following values of Gender:

Paradigm njeriMascFem
Case=Acc|Definite=Def|NounType=Het|Number=Plurnjerëzit
Case=Acc|Definite=Ind|Number=Plurnjerëz
Case=Gen|Definite=Def|Number=Singnjeriut
Case=Nom|Definite=Def|Number=Plurnjerëzit
Case=Nom|Definite=Ind|Number=Plurnjerëz

Gender seems to be lexical feature of NOUN. 96% lemmas (174) occur only with one value of Gender.

DET

115 DET tokens (99% of all DET tokens) have a non-empty value of Gender.

DET tokens may have the following values of Gender:

Paradigm iMascFem
_i, të, sëtë, e, së
Number=Plure

ADJ

83 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: VerbForm=EMPTY (71; 86%), Number=Sing (51; 61%).

ADJ tokens may have the following values of Gender:

Paradigm kryesorMascFem
Number=Singkryesorkryesore
Number=Plurkryesorë

Gender seems to be lexical feature of ADJ. 93% lemmas (64) occur only with one value of Gender.

PRON

52 PRON tokens (98% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Poss=EMPTY (46; 88%), Number=Sing (28; 54%).

PRON tokens may have the following values of Gender:

Paradigm aiMascFem
Case=Acc|Number=Sing|PronType=Empe
Case=Gen|Number=Sing|Poss=Yes|PronType=Prstij
Case=Nom|Number=Sing|Person=3|PronType=Demai
Case=Nom|Number=Sing|PronType=PrsAi
Case=Nom|Number=Plur|PronType=PrsAtaato

PROPN

15 PROPN tokens (75% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (14; 93%), Definite=Def (13; 87%).

PROPN tokens may have the following values of Gender:

Gender seems to be lexical feature of PROPN. 100% lemmas (13) occur only with one value of Gender.

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (63; 94%), NOUN –[det]–> DET (46; 79%), ADJ –[det:adj]–> DET (32; 94%), NOUN –[det]–> PRON (17; 74%), PRON –[det:pron]–> DET (11; 92%), ADJ –[conj]–> ADJ (6; 100%), ADJ –[nsubj]–> NOUN (4; 67%), NOUN –[nmod:poss]–> PROPN (4; 80%), NOUN –[nsubj]–> NOUN (4; 57%), ADJ –[nmod]–> NOUN (3; 60%).