Treebank Statistics: UD_Latvian-Cairo: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
64 tokens (38%) have a non-empty value of Gender.
56 types (49%) occur at least once with a non-empty value of Gender.
47 lemmas (46%) occur at least once with a non-empty value of Gender.
The feature is used with 6 part-of-speech tags: NOUN (26; 15% instances), PROPN (14; 8% instances), PRON (10; 6% instances), DET (9; 5% instances), ADJ (4; 2% instances), VERB (1; 1% instances).
NOUN
26 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (23; 88%).
NOUN tokens may have the following values of Gender:
Fem(14; 54% of non-emptyGender): mašīnu, Meitene, bronzu, draudzenei, dzeršanu, galvaspilsētā, istabu, jausmas, krāsā, smēķēšanuMasc(12; 46% of non-emptyGender): brālis, iemesla, kaimiņi, lietus, logu, matus, sudrabu, tētis, velosipēdu, vīram
Gender seems to be lexical feature of NOUN. 100% lemmas (24) occur only with one value of Gender.
PROPN
14 PROPN tokens (93% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (14; 100%).
PROPN tokens may have the following values of Gender:
Fem(7; 50% of non-emptyGender): Marija, Braunu, Džeina, Francijas, Mariju, ParīzēMasc(7; 50% of non-emptyGender): Pētera, Pēteris, Pēteri, Sem, SmituEMPTY(1): Igvasu
PRON
10 PRON tokens (59% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Person=3 (10; 100%), PronType=Prs (10; 100%), Number=Sing (9; 90%), Case=Nom (6; 60%).
PRON tokens may have the following values of Gender:
Fem(4; 40% of non-emptyGender): viņa, ViņaiMasc(6; 60% of non-emptyGender): viņš, Viņiem, viņa, viņamEMPTY(7): tu, Es, Man, ko
DET
9 DET tokens (100% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (9; 100%), Definite=EMPTY (8; 89%), Degree=EMPTY (8; 89%), Person=EMPTY (5; 56%), Poss=EMPTY (5; 56%).
DET tokens may have the following values of Gender:
Fem(2; 22% of non-emptyGender): savai, ŠīMasc(7; 78% of non-emptyGender): to, Mans, kurš, kāda, savam, tavējais
ADJ
4 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Definite=Ind (4; 100%), Number=Sing (4; 100%), Case=Nom (3; 75%), Degree=Pos (3; 75%).
ADJ tokens may have the following values of Gender:
Fem(3; 75% of non-emptyGender): liela, maza, sarkanāMasc(1; 25% of non-emptyGender): foršāks
VERB
1 VERB tokens (3% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Evident=EMPTY (1; 100%), Mood=EMPTY (1; 100%), Number=Sing (1; 100%), Person=EMPTY (1; 100%), Polarity=Pos (1; 100%), Reflex=EMPTY (1; 100%), Tense=Past (1; 100%), VerbForm=Part (1; 100%), Voice=Pass (1; 100%).
VERB tokens may have the following values of Gender:
Fem(1; 100% of non-emptyGender): piegādātaEMPTY(30): uzrakstīja, Nevarēja, apgriezt, apskāvās, atmest, atnākt, atstāja, atver, centās, domā
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (5; 100%),
NOUN –[amod]–> ADJ (2; 100%),
NOUN –[nmod]–> PROPN (2; 100%),
PROPN –[flat:name]–> PROPN (2; 100%),
ADJ –[advcl]–> DET (1; 100%),
ADJ –[conj]–> ADJ (1; 100%),
ADJ –[nsubj]–> NOUN (1; 100%),
NOUN –[conj]–> NOUN (1; 100%),
NOUN –[nmod]–> PRON (1; 100%),
NOUN –[orphan]–> NOUN (1; 100%).