This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home es/feat issue tracker

Gender: gender

This document is a placeholder for the language-specific documentation for Gender.


Treebank Statistics (UD_Spanish)

This feature is universal. It occurs with 2 different values: Fem, Masc.

158441 tokens (37%) have a non-empty value of Gender. 20664 types (45%) occur at least once with a non-empty value of Gender. 14851 lemmas (41%) occur at least once with a non-empty value of Gender. The feature is used with 10 part-of-speech tags: es-pos/NOUN (70392; 16% instances), es-pos/DET (56064; 13% instances), es-pos/ADJ (15362; 4% instances), es-pos/VERB (7518; 2% instances), es-pos/PRON (4451; 1% instances), es-pos/PROPN (3490; 1% instances), es-pos/X (542; 0% instances), es-pos/AUX (292; 0% instances), es-pos/NUM (208; 0% instances), es-pos/SYM (122; 0% instances).

NOUN

70392 es-pos/NOUN tokens (91% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (50573; 72%).

NOUN tokens may have the following values of Gender:

Paradigm parteMascFem
Number=Singparteparte
Number=Plurpartes

Gender seems to be lexical feature of NOUN. 97% lemmas (8872) occur only with one value of Gender.

DET

56064 es-pos/DET tokens (92% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (51173; 91%), Number=Sing (44644; 80%), Definite=Def (43522; 78%).

DET tokens may have the following values of Gender:

Paradigm elMascFem
Number=Singella
Number=Plurloslas

ADJ

15362 es-pos/ADJ tokens (62% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (10933; 71%).

ADJ tokens may have the following values of Gender:

Paradigm primeroMascFem
Number=Singprimer, primeroprimera
Number=Sing|NumType=Ordprimer, primeroprimera
Number=Plurprimerosprimeras
Number=Plur|NumType=Ordprimerosprimeras

VERB

7518 es-pos/VERB tokens (18% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (7518; 100%), Person=EMPTY (7517; 100%), VerbForm=Part (6817; 91%), Number=Sing (6067; 81%), Tense=EMPTY (4395; 58%).

VERB tokens may have the following values of Gender:

Paradigm hacerMascFem
Tense=Past|VerbForm=Parthechohecha
VerbForm=Finhacerhaga, hacía
VerbForm=Parthechohecha

PRON

4451 es-pos/PRON tokens (32% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (4446; 100%), Number=Sing (3344; 75%), PronType=Prs (2879; 65%), Person=3 (2816; 63%), PrepCase=EMPTY (2271; 51%).

PRON tokens may have the following values of Gender:

Paradigm élMascFem
Case=Acc,Nom|Number=Singél, elloella
Case=Acc,Nom|Number=Plurellosellas
Case=Acc|Number=Sing|PrepCase=Nprlola
Case=Acc|Number=Plur|PrepCase=Nprloslas

PROPN

3490 es-pos/PROPN tokens (9% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (3002; 86%).

PROPN tokens may have the following values of Gender:

Paradigm theMascFem
thethe

Gender seems to be lexical feature of PROPN. 99% lemmas (2037) occur only with one value of Gender.

X

542 es-pos/X tokens (27% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Number=Sing (433; 80%).

X tokens may have the following values of Gender:

Paradigm 'sMascFem
_'s's
Number=Sing's's
Number=Sing|Person=3's

Gender seems to be lexical feature of X. 96% lemmas (383) occur only with one value of Gender.

AUX

292 es-pos/AUX tokens (5% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (292; 100%), Person=EMPTY (290; 99%), Number=Sing (272; 93%), VerbForm=Part (225; 77%), Tense=Past (222; 76%).

AUX tokens may have the following values of Gender:

Paradigm haberMascFem
Number=Singhan, haber
Number=Plur|Person=3has

NUM

208 es-pos/NUM tokens (2% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (208; 100%), Number=Sing (175; 84%).

NUM tokens may have the following values of Gender:

Paradigm unoMascFem
un, unouna

SYM

122 es-pos/SYM tokens (7% of all SYM tokens) have a non-empty value of Gender.

SYM tokens may have the following values of Gender:

Paradigm $MascFem
Number=Sing$$
Number=Sing|VerbForm=Part$
Number=Plur|VerbForm=Part$

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (42452; 84%), NOUN –[amod]–> ADJ (10281; 56%), NOUN –[conj]–> NOUN (2934; 54%), NOUN –[acl]–> VERB (1856; 82%), NOUN –[nummod]–> ADJ (793; 94%), VERB –[nsubjpass]–> NOUN (697; 89%), ADJ –[nsubj]–> NOUN (548; 57%), PRON –[nmod]–> NOUN (500; 68%), ADJ –[conj]–> ADJ (447; 54%), NOUN –[det]–> PRON (188; 71%).


Treebank Statistics (UD_Spanish-AnCora)

This feature is universal. It occurs with 2 different values: Fem, Masc.

200949 tokens (36%) have a non-empty value of Gender. 17301 types (44%) occur at least once with a non-empty value of Gender. 11645 lemmas (44%) occur at least once with a non-empty value of Gender. The feature is used with 9 part-of-speech tags: es-pos/NOUN (87899; 16% instances), es-pos/DET (78722; 14% instances), es-pos/ADJ (24251; 4% instances), es-pos/VERB (4616; 1% instances), es-pos/PRON (4410; 1% instances), es-pos/AUX (627; 0% instances), es-pos/NUM (291; 0% instances), es-pos/ADP (76; 0% instances), es-pos/ADV (57; 0% instances).

NOUN

87899 es-pos/NOUN tokens (87% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (61643; 70%).

NOUN tokens may have the following values of Gender:

Paradigm candidatoMascFem
Number=Singcandidato
Number=PlurcandidatosCANDIDATAS

Gender seems to be lexical feature of NOUN. 99% lemmas (7741) occur only with one value of Gender.

DET

78722 es-pos/DET tokens (92% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (71978; 91%), Definite=Def (62611; 80%), Number=Sing (62048; 79%).

DET tokens may have the following values of Gender:

Paradigm elMascFem
Number=Singella
Number=Plurloslas
EL

ADJ

24251 es-pos/ADJ tokens (67% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: VerbForm=EMPTY (17741; 73%), Number=Sing (17384; 72%).

ADJ tokens may have the following values of Gender:

Paradigm primeroMascFem
Number=Singprimer, primeroprimera
Number=Plurprimerosprimeras

VERB

4616 es-pos/VERB tokens (10% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (4615; 100%), Tense=Past (4615; 100%), Person=EMPTY (4615; 100%), VerbForm=Part (4615; 100%), Number=Sing (4311; 93%).

VERB tokens may have the following values of Gender:

Paradigm hacerMascFem
Number=Singhechohecha
Number=Plurhechos

PRON

4410 es-pos/PRON tokens (18% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Case=EMPTY (3182; 72%), Number=Sing (2924; 66%), Person=EMPTY (2401; 54%).

PRON tokens may have the following values of Gender:

Paradigm élMascFem
Case=Acc|Number=Singlo, le, Lesla
Case=Acc|Number=Plurlos, leslas
Number=Singélella
Number=Plurellosellas, les

AUX

627 es-pos/AUX tokens (4% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: VerbForm=Part (627; 100%), Person=EMPTY (627; 100%), Mood=EMPTY (627; 100%), Tense=Past (627; 100%), Number=Sing (610; 97%).

AUX tokens may have the following values of Gender:

Paradigm investigarMascFem
Number=Singinvestigado
Number=Plurinvestigadas

Gender seems to be lexical feature of AUX. 98% lemmas (61) occur only with one value of Gender.

NUM

291 es-pos/NUM tokens (3% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (290; 100%), NumForm=EMPTY (290; 100%), Number=Plur (176; 60%).

NUM tokens may have the following values of Gender:

Paradigm ambosMascFem
ambosambas

ADP

76 es-pos/ADP tokens (0% of all ADP tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADP and Gender co-occurred: AdpType=Preppron (76; 100%).

ADP tokens may have the following values of Gender:

Gender seems to be lexical feature of ADP. 100% lemmas (12) occur only with one value of Gender.

ADV

57 es-pos/ADV tokens (0% of all ADV tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADV and Gender co-occurred: Negative=EMPTY (57; 100%).

ADV tokens may have the following values of Gender:

Gender seems to be lexical feature of ADV. 100% lemmas (14) occur only with one value of Gender.

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (56734; 85%), NOUN –[amod]–> ADJ (16852; 63%), NOUN –[conj]–> NOUN (2460; 54%), DET –[det]–> DET (1076; 82%), ADJ –[det]–> DET (684; 55%), ADJ –[nsubj]–> NOUN (647; 57%), ADJ –[conj]–> ADJ (567; 56%), PRON –[nmod]–> NOUN (442; 72%), NOUN –[nmod]–> PRON (310; 54%), PRON –[amod]–> ADJ (104; 51%).


Gender in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]