home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Spanish-COSER: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

2052 tokens (25%) have a non-empty value of Gender. 743 types (49%) occur at least once with a non-empty value of Gender. 602 lemmas (58%) occur at least once with a non-empty value of Gender. The feature is used with 9 part-of-speech tags: NOUN (886; 11% instances), DET (692; 9% instances), PRON (292; 4% instances), ADJ (127; 2% instances), VERB (46; 1% instances), AUX (4; 0% instances), NUM (3; 0% instances), PROPN (1; 0% instances), X (1; 0% instances).

NOUN

886 NOUN tokens (96% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (648; 73%).

NOUN tokens may have the following values of Gender:

Paradigm hijoMascFem
Number=Singhijohija
Number=Plurhijos

Gender seems to be lexical feature of NOUN. 98% lemmas (441) occur only with one value of Gender.

DET

692 DET tokens (94% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (562; 81%), Number=Sing (533; 77%), Definite=Def (425; 61%).

DET tokens may have the following values of Gender:

Paradigm elMascFem
Number=Singel, lla
Number=Plurloslas

PRON

292 PRON tokens (32% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (292; 100%), Number=Sing (198; 68%), PronType=Prs (187; 64%), Person=3 (176; 60%), PrepCase=Npr (157; 54%), Case=Acc (155; 53%).

PRON tokens may have the following values of Gender:

Paradigm élMascFem
Case=Acc,Nom|Number=Singél, elloella
Case=Acc,Nom|Number=Plurellosellas
Case=Acc|Number=Sing|PrepCase=Nprlola
Case=Acc|Number=Plur|PrepCase=Nprloslas

ADJ

127 ADJ tokens (73% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (104; 82%), VerbForm=EMPTY (100; 79%).

ADJ tokens may have the following values of Gender:

Paradigm primeroMascFem
primera
NumType=Ordprimer, primero

Gender seems to be lexical feature of ADJ. 97% lemmas (88) occur only with one value of Gender.

VERB

46 VERB tokens (5% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (46; 100%), Number=Sing (46; 100%), Person=EMPTY (46; 100%), Tense=Past (46; 100%), VerbForm=Part (46; 100%).

VERB tokens may have the following values of Gender:

Gender seems to be lexical feature of VERB. 100% lemmas (36) occur only with one value of Gender.

AUX

4 AUX tokens (2% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (4; 100%), Number=Sing (4; 100%), Person=EMPTY (4; 100%), Tense=Past (4; 100%), VerbForm=Part (4; 100%).

AUX tokens may have the following values of Gender:

NUM

3 NUM tokens (3% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (3; 100%), Number=Plur (2; 67%).

NUM tokens may have the following values of Gender:

PROPN

1 PROPN tokens (1% of all PROPN tokens) have a non-empty value of Gender.

PROPN tokens may have the following values of Gender:

X

1 X tokens (4% of all X tokens) have a non-empty value of Gender.

X tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (548; 90%), NOUN –[amod]–> ADJ (42; 68%), NOUN –[nmod]–> NOUN (38; 54%), DET –[det]–> DET (15; 100%), ADJ –[det]–> DET (14; 88%), PRON –[det]–> DET (10; 53%), PRON –[reparandum]–> PRON (9; 100%), NOUN –[obl]–> NOUN (8; 57%), NOUN –[reparandum]–> NOUN (7; 100%), DET –[reparandum]–> DET (6; 55%).