home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Spanish-GSD: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

158718 tokens (37%) have a non-empty value of Gender. 20356 types (45%) occur at least once with a non-empty value of Gender. 14665 lemmas (43%) occur at least once with a non-empty value of Gender. The feature is used with 10 part-of-speech tags: NOUN (70647; 16% instances), DET (56090; 13% instances), ADJ (15916; 4% instances), VERB (6941; 2% instances), PRON (4602; 1% instances), PROPN (3418; 1% instances), X (506; 0% instances), AUX (269; 0% instances), NUM (209; 0% instances), SYM (120; 0% instances).

NOUN

70647 NOUN tokens (89% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (50738; 72%).

NOUN tokens may have the following values of Gender:

Paradigm parteMascFem
Number=Singparteparte
Number=Plurpartes

Gender seems to be lexical feature of NOUN. 97% lemmas (8762) occur only with one value of Gender.

DET

56090 DET tokens (92% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (51185; 91%), Number=Sing (44763; 80%), Definite=Def (43530; 78%).

DET tokens may have the following values of Gender:

Paradigm elMascFem
Definite=Def|Number=Singella, l'
Definite=Def|Number=Sing|Typo=Yesal, en, del, lea, al
Definite=Def|Number=Plurloslas
Number=Sing|Typo=Yesal, ena

ADJ

15916 ADJ tokens (62% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (11453; 72%).

ADJ tokens may have the following values of Gender:

Paradigm primeroMascFem
Number=Singprimer, primeroprimera
Number=Plurprimerosprimeras

VERB

6941 VERB tokens (19% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (6940; 100%), Mood=EMPTY (6939; 100%), VerbForm=Part (6934; 100%), Number=Sing (5565; 80%), Tense=EMPTY (3853; 56%).

VERB tokens may have the following values of Gender:

Paradigm tenerMascFem
Number=Singtenido
Number=Plurtenidostenidas

PRON

4602 PRON tokens (33% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (4597; 100%), Number=Sing (3496; 76%), PronType=Prs (2881; 63%), Person=3 (2818; 61%), PrepCase=EMPTY (2421; 53%).

PRON tokens may have the following values of Gender:

Paradigm élMascFem
Case=Acc,Nom|Number=Singél, elloella
Case=Acc,Nom|Number=Plurellosellas
Case=Acc|Number=Sing|PrepCase=Nprlola
Case=Acc|Number=Plur|PrepCase=Nprloslas
Case=Dat|Number=Sing|PrepCase=Npr|Typo=Yesla
Case=Nom|Number=Singél
Number=Sing|Typo=Yesel

PROPN

3418 PROPN tokens (9% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (2958; 87%).

PROPN tokens may have the following values of Gender:

Paradigm IslaMascFem
_Islas
Number=SingIsla
Number=PlurIslas

Gender seems to be lexical feature of PROPN. 99% lemmas (2126) occur only with one value of Gender.

X

506 X tokens (28% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Number=Sing (401; 79%).

X tokens may have the following values of Gender:

Paradigm 'sMascFem
_'s's
Number=Sing's's
Number=Sing|Person=3's

Gender seems to be lexical feature of X. 96% lemmas (360) occur only with one value of Gender.

AUX

269 AUX tokens (3% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (269; 100%), Number=Sing (269; 100%), Person=EMPTY (269; 100%), VerbForm=Part (269; 100%), Tense=Past (268; 100%).

AUX tokens may have the following values of Gender:

NUM

209 NUM tokens (2% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (209; 100%), Number=Sing (177; 85%), NumForm=Word (169; 81%).

NUM tokens may have the following values of Gender:

Paradigm unoMascFem
un, unouna

SYM

120 SYM tokens (7% of all SYM tokens) have a non-empty value of Gender.

SYM tokens may have the following values of Gender:

Paradigm $MascFem
Number=Sing$$
Number=Sing|VerbForm=Part$
Number=Plur|VerbForm=Part$

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (42692; 84%), NOUN –[amod]–> ADJ (11106; 58%), NOUN –[conj]–> NOUN (2943; 54%), NOUN –[acl]–> VERB (1935; 82%), VERB –[nsubj:pass]–> NOUN (696; 86%), PRON –[nmod]–> NOUN (509; 69%), ADJ –[nsubj]–> NOUN (471; 57%), ADJ –[conj]–> ADJ (448; 54%), NOUN –[nsubj]–> NOUN (423; 51%), NOUN –[det]–> PRON (186; 70%).