This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home lv/feat issue tracker

Gender: gender

This document is a placeholder for the language-specific documentation for Gender.


Treebank Statistics (UD_Latvian)

This feature is universal. It occurs with 2 different values: Fem, Masc.

10173 tokens (49%) have a non-empty value of Gender. 4875 types (77%) occur at least once with a non-empty value of Gender. 2769 lemmas (71%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (6237; 30% instances), PROPN (1186; 6% instances), ADJ (962; 5% instances), VERB (778; 4% instances), PRON (412; 2% instances), DET (391; 2% instances), NUM (130; 1% instances), SCONJ (77; 0% instances).

NOUN

6237 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (4384; 70%).

NOUN tokens may have the following values of Gender:

Paradigm būvkompānijaMascFem
Case=Acc|Number=Singbūvkompāniju
Case=Gen|Number=PlurBūvkompānijubūvkompāniju

Gender seems to be lexical feature of NOUN. 100% lemmas (1570) occur only with one value of Gender.

PROPN

1186 PROPN tokens (75% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1155; 97%).

PROPN tokens may have the following values of Gender:

Paradigm SeisumsMascFem
Case=GenSeisuma
Case=NomSeisuma

Gender seems to be lexical feature of PROPN. 100% lemmas (424) occur only with one value of Gender.

ADJ

962 ADJ tokens (81% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: NumType=EMPTY (924; 96%), Degree=Pos (864; 90%), Number=Sing (640; 67%).

ADJ tokens may have the following values of Gender:

Paradigm lielaMascFem
Case=Acc|Degree=Pos|Number=Singlielulielu
Case=Acc|Degree=Pos|Number=Plurlielas
Case=Acc|Degree=Cmp|Number=Singlielāko
Case=Dat|Degree=Cmp|Number=Singlielākajai
Case=Loc|Degree=Pos|Number=Singlielajā
Case=Nom|Degree=Pos|Number=Singliela
Case=Nom|Degree=Pos|Number=Plurlielas
Case=Nom|Degree=Cmp|Number=Singlielākā, lielāka

Gender seems to be lexical feature of ADJ. 99% lemmas (386) occur only with one value of Gender.

VERB

778 VERB tokens (26% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (778; 100%), Negative=EMPTY (778; 100%), Person=EMPTY (778; 100%), VerbForm=Part (777; 100%), Degree=Pos (776; 100%), Tense=Past (667; 86%), Aspect=Perf (667; 86%), Voice=EMPTY (667; 86%), Definite=Ind (586; 75%), Case=Nom (585; 75%).

VERB tokens may have the following values of Gender:

Paradigm būtMascFem
Aspect=Imp|Case=Acc|Definite=Def|Number=Sing|Tense=Pres|Voice=Passesošo
Aspect=Imp|Case=Acc|Definite=Def|Number=Plur|Tense=Pres|Voice=Passesošos
Aspect=Imp|Case=Dat|Definite=Ind|Number=Plur|Tense=Pres|Voice=Passesošiem
Aspect=Imp|Case=Gen|Definite=Def|Number=Plur|Tense=Pres|Voice=Passesošo
Aspect=Imp|Case=Loc|Definite=Ind|Number=Plur|Tense=Pres|Voice=Passesošās
Aspect=Imp|Case=Nom|Definite=Def|Number=Sing|Tense=Pres|Voice=Passesošaisesošā
Aspect=Perf|Case=Acc|Definite=Def|Number=Sing|Tense=Pastbijušo
Aspect=Perf|Case=Nom|Definite=Ind|Number=Sing|Tense=Pastbijisbijusi
Aspect=Perf|Case=Nom|Definite=Ind|Number=Plur|Tense=Pastbijušibijušas

PRON

412 PRON tokens (73% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (333; 81%), Person=EMPTY (324; 79%), PronType=Dem (233; 57%), Case=Nom (210; 51%).

PRON tokens may have the following values of Gender:

Gender seems to be lexical feature of PRON. 100% lemmas (31) occur only with one value of Gender.

DET

391 DET tokens (100% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Poss=EMPTY (351; 90%), Number=Sing (248; 63%), PronType=Dem (201; 51%).

DET tokens may have the following values of Gender:

Paradigm savaMascFem
Case=Acc|Number=Singsavu
Case=Acc|Number=Plursavas
Case=Loc|Number=Singsavāsavā
Case=Loc|Number=PlurSavās

Gender seems to be lexical feature of DET. 95% lemmas (37) occur only with one value of Gender.

NUM

130 NUM tokens (31% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (130; 100%), Number=Plur (72; 55%).

NUM tokens may have the following values of Gender:

Paradigm vienaMascFem
Case=Accvienu
Case=Datvienai
Case=Genvienas
Case=Locvienāvienā
Case=Nomviena

SCONJ

77 SCONJ tokens (13% of all SCONJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which SCONJ and Gender co-occurred: PronType=Rel (77; 100%), Number=Sing (44; 57%).

SCONJ tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[nmod]–> NOUN (1232; 54%), NOUN –[amod]–> ADJ (776; 79%), NOUN –[det]–> DET (354; 95%), NOUN –[conj]–> NOUN (259; 66%), PROPN –[name]–> PROPN (226; 97%), NOUN –[amod]–> VERB (225; 97%), VERB –[nsubjpass]–> NOUN (156; 97%), PROPN –[nmod]–> NOUN (149; 74%), NOUN –[acl]–> NOUN (90; 53%), PROPN –[conj]–> PROPN (63; 62%).


Gender in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]