home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Spanish-AnCora: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

204131 tokens (36%) have a non-empty value of Gender. 17341 types (45%) occur at least once with a non-empty value of Gender. 11608 lemmas (45%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (89250; 16% instances), DET (78760; 14% instances), ADJ (24709; 4% instances), PRON (5858; 1% instances), VERB (4775; 1% instances), AUX (480; 0% instances), NUM (290; 0% instances), PROPN (9; 0% instances).

NOUN

89250 NOUN tokens (88% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (62388; 70%).

NOUN tokens may have the following values of Gender:

Paradigm candidatoMascFem
Number=Singcandidato
Number=PlurcandidatosCANDIDATAS

Gender seems to be lexical feature of NOUN. 99% lemmas (7755) occur only with one value of Gender.

DET

78760 DET tokens (93% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (71584; 91%), Number=Sing (62068; 79%), Definite=Def (62012; 79%).

DET tokens may have the following values of Gender:

Paradigm elMascFem
Definite=Def|ExtPos=ADV|Number=Sing|PronType=Artla
Definite=Def|ExtPos=SCONJ|Number=Sing|PronType=Artel
Definite=Def|Foreign=Yes|Number=Sing|PronType=Artla
Definite=Def|Foreign=Yes|Number=Plur|PronType=Artlesles
Definite=Def|Number=Sing|PronType=Artella
Definite=Def|Number=Plur|PronType=Artlos, elslas
Number=Sing|PronType=Demella
Number=Plur|PronType=Demloslas

ADJ

24709 ADJ tokens (67% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: VerbForm=EMPTY (18129; 73%), Number=Sing (17769; 72%).

ADJ tokens may have the following values of Gender:

Paradigm primeroMascFem
Number=Singprimer, primeroprimera
Number=Plurprimerosprimeras

PRON

5858 PRON tokens (23% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (5857; 100%), Number=Sing (4408; 75%), Person=3 (3444; 59%), PronType=Prs (3344; 57%), PrepCase=EMPTY (3222; 55%).

PRON tokens may have the following values of Gender:

Paradigm élMascFem
Case=Acc,Nom|Number=Sing|PronType=Prsél, elloella
Case=Acc,Nom|Number=Plur|PronType=Prsellosellas
Case=Acc|Definite=Def|Number=Sing|PrepCase=Npr|PronType=Prslo
Case=Acc|Definite=Ind|Number=Sing|PrepCase=Npr|PronType=PrsLO
Case=Acc|ExtPos=ADV|Number=Sing|PrepCase=Npr|PronType=Prslo
Case=Acc|ExtPos=CCONJ|Number=Sing|PrepCase=Npr|PronType=Prslo
Case=Acc|Number=Sing|PrepCase=Npr|PronType=Demlo
Case=Acc|Number=Sing|PrepCase=Npr|PronType=Prslola
Case=Acc|Number=Plur|PrepCase=Npr|PronType=Prsloslas
Case=Nom|Number=Sing|PronType=PrsElla

VERB

4775 VERB tokens (10% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (4774; 100%), Person=EMPTY (4774; 100%), VerbForm=Part (4774; 100%), Tense=Past (4772; 100%), Number=Sing (4455; 93%).

VERB tokens may have the following values of Gender:

Paradigm hacerMascFem
Number=Singhechohecha
Number=Plurhechos

AUX

480 AUX tokens (4% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (480; 100%), Number=Sing (480; 100%), Person=EMPTY (480; 100%), Tense=Past (480; 100%), VerbForm=Part (480; 100%).

AUX tokens may have the following values of Gender:

NUM

290 NUM tokens (3% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (290; 100%), NumForm=Word (289; 100%), Number=Plur (176; 61%).

NUM tokens may have the following values of Gender:

Paradigm ambosMascFem
ambosambas

PROPN

9 PROPN tokens (0% of all PROPN tokens) have a non-empty value of Gender.

PROPN tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (58636; 86%), NOUN –[amod]–> ADJ (17026; 63%), NOUN –[conj]–> NOUN (2535; 54%), NOUN –[appos]–> NOUN (928; 51%), ADJ –[det]–> DET (732; 66%), ADJ –[nsubj]–> NOUN (613; 57%), ADJ –[conj]–> ADJ (570; 56%), PRON –[nmod]–> NOUN (444; 73%), NOUN –[nmod]–> DET (189; 94%), ADJ –[det]–> PRON (167; 63%).