Treebank Statistics: UD_Latvian-Cairo: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
64 tokens (38%) have a non-empty value of Gender
.
56 types (49%) occur at least once with a non-empty value of Gender
.
48 lemmas (47%) occur at least once with a non-empty value of Gender
.
The feature is used with 6 part-of-speech tags: NOUN (27; 16% instances), PRON (13; 8% instances), PROPN (13; 8% instances), ADJ (5; 3% instances), DET (5; 3% instances), VERB (1; 1% instances).
NOUN
27 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (24; 89%).
NOUN
tokens may have the following values of Gender
:
Fem
(15; 56% of non-emptyGender
): mašīnu, Marija, Meitene, bronzu, draudzenei, dzeršanu, galvaspilsētā, istabu, jausmas, krāsāMasc
(12; 44% of non-emptyGender
): brālis, iemesla, kaimiņi, lietus, logu, matus, sudrabu, tētis, velosipēdu, vīram
Gender
seems to be lexical feature of NOUN
. 100% lemmas (25) occur only with one value of Gender
.
PRON
13 PRON tokens (65% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Number=Sing (12; 92%), Person=3 (12; 92%), PronType=Prs (10; 77%), Case=Nom (7; 54%).
PRON
tokens may have the following values of Gender
:
Fem
(4; 31% of non-emptyGender
): viņa, ViņaiMasc
(9; 69% of non-emptyGender
): viņš, to, Viņiem, kurš, viņa, viņamEMPTY
(7): tu, Es, Man, ko
PROPN
13 PROPN tokens (93% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (13; 100%).
PROPN
tokens may have the following values of Gender
:
Fem
(6; 46% of non-emptyGender
): Braunu, Džeina, Francijas, Marija, Mariju, ParīzēMasc
(7; 54% of non-emptyGender
): Pētera, Pēteris, Pēteri, Sem, SmituEMPTY
(1): Igvasu
ADJ
5 ADJ tokens (100% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (5; 100%), Case=Nom (4; 80%), Definite=Ind (4; 80%), Degree=Pos (4; 80%).
ADJ
tokens may have the following values of Gender
:
Fem
(3; 60% of non-emptyGender
): liela, maza, sarkanāMasc
(2; 40% of non-emptyGender
): foršāks, tavējais
DET
5 DET tokens (100% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (5; 100%), Person=EMPTY (4; 80%), Poss=Yes (3; 60%), PronType=Prs (3; 60%).
DET
tokens may have the following values of Gender
:
Fem
(2; 40% of non-emptyGender
): savai, ŠīMasc
(3; 60% of non-emptyGender
): Mans, kāda, savam
VERB
1 VERB tokens (3% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Evident=EMPTY (1; 100%), Mood=EMPTY (1; 100%), Number=Sing (1; 100%), Person=EMPTY (1; 100%), Polarity=Pos (1; 100%), Reflex=EMPTY (1; 100%), Tense=Past (1; 100%), VerbForm=Part (1; 100%), Voice=Pass (1; 100%).
VERB
tokens may have the following values of Gender
:
Fem
(1; 100% of non-emptyGender
): piegādātaEMPTY
(31): uzrakstīja, Nevarēja, apgriezt, apskāvās, atmest, atnākt, atstāja, atver, centās, domā
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (5; 100%),
NOUN –[amod]–> ADJ (2; 100%),
NOUN –[nmod]–> PROPN (2; 100%),
PROPN –[flat:name]–> PROPN (2; 100%),
ADJ –[conj]–> ADJ (1; 100%),
ADJ –[nsubj]–> NOUN (1; 100%),
ADJ –[obl]–> ADJ (1; 100%),
NOUN –[conj]–> NOUN (1; 100%),
NOUN –[det]–> PRON (1; 100%),
NOUN –[orphan]–> NOUN (1; 100%).