Gender
: gender
This document is a placeholder for the language-specific documentation
for Gender
.
Treebank Statistics (UD_Spanish)
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
158441 tokens (37%) have a non-empty value of Gender
.
20664 types (45%) occur at least once with a non-empty value of Gender
.
14851 lemmas (41%) occur at least once with a non-empty value of Gender
.
The feature is used with 10 part-of-speech tags: es-pos/NOUN (70392; 16% instances), es-pos/DET (56064; 13% instances), es-pos/ADJ (15362; 4% instances), es-pos/VERB (7518; 2% instances), es-pos/PRON (4451; 1% instances), es-pos/PROPN (3490; 1% instances), es-pos/X (542; 0% instances), es-pos/AUX (292; 0% instances), es-pos/NUM (208; 0% instances), es-pos/SYM (122; 0% instances).
NOUN
70392 es-pos/NOUN tokens (91% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (50573; 72%).
NOUN
tokens may have the following values of Gender
:
Fem
(32900; 47% of non-emptyGender
): parte, población, ciudad, personas, familia, vez, forma, vida, agua, regiónMasc
(37492; 53% of non-emptyGender
): años, año, municipio, nombre, lugar, equipo, tiempo, estado, grupo, paísEMPTY
(7138): habitantes, km, Estado, base, euros, frente, Gobierno, Oficina, mar, arte
Paradigm parte | Masc | Fem |
---|---|---|
Number=Sing | parte | parte |
Number=Plur | partes |
Gender
seems to be lexical feature of NOUN
. 97% lemmas (8872) occur only with one value of Gender
.
DET
56064 es-pos/DET tokens (92% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: PronType=Art (51173; 91%), Number=Sing (44644; 80%), Definite=Def (43522; 78%).
DET
tokens may have the following values of Gender
:
Fem
(23935; 43% of non-emptyGender
): la, las, una, esta, otras, toda, estas, esa, todas, otraMasc
(32129; 57% of non-emptyGender
): el, los, un, este, otros, ese, estos, todo, todos, unosEMPTY
(4808): su, sus, cada, cualquier, mi, the, tu, qué, mis, a
Paradigm el | Masc | Fem |
---|---|---|
Number=Sing | el | la |
Number=Plur | los | las |
ADJ
15362 es-pos/ADJ tokens (62% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (10933; 71%).
ADJ
tokens may have the following values of Gender
:
Fem
(6682; 43% of non-emptyGender
): primera, nueva, segunda, buena, francesa, misma, alta, pequeña, propia, nuevasMasc
(8680; 57% of non-emptyGender
): primer, mismo, nuevo, junto, segundo, español, buen, propio, primeros, únicoEMPTY
(9533): gran, mayor, estadounidense, mejor, total, nacional, grandes, principal, importante, diferentes
Paradigm primero | Masc | Fem |
---|---|---|
Number=Sing | primer, primero | primera |
Number=Sing|NumType=Ord | primer, primero | primera |
Number=Plur | primeros | primeras |
Number=Plur|NumType=Ord | primeros | primeras |
VERB
7518 es-pos/VERB tokens (18% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (7518; 100%), Person=EMPTY (7517; 100%), VerbForm=Part (6817; 91%), Number=Sing (6067; 81%), Tense=EMPTY (4395; 58%).
VERB
tokens may have the following values of Gender
:
Fem
(2302; 31% of non-emptyGender
): situada, conocida, ubicada, esta, llamada, dirigida, fundada, publicada, realizada, construidaMasc
(5216; 69% of non-emptyGender
): ubicado, conocido, sido, debido, llamado, hecho, nacido, dado, compuesto, lanzadoEMPTY
(33473): es, fue, son, eran, era, tiene, encuentra, ser, está, hacer
Paradigm hacer | Masc | Fem |
---|---|---|
Tense=Past|VerbForm=Part | hecho | hecha |
VerbForm=Fin | hacer | haga, hacía |
VerbForm=Part | hecho | hecha |
PRON
4451 es-pos/PRON tokens (32% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Reflex=EMPTY (4446; 100%), Number=Sing (3344; 75%), PronType=Prs (2879; 65%), Person=3 (2816; 63%), PrepCase=EMPTY (2271; 51%).
PRON
tokens may have the following values of Gender
:
Fem
(1173; 26% of non-emptyGender
): la, una, ella, las, ellas, esta, otra, otras, ésta, muchasMasc
(3278; 74% of non-emptyGender
): lo, uno, los, él, todo, ellos, ello, este, otros, otroEMPTY
(9573): se, que, le, me, cual, nos, quien, esto, les, te
Paradigm él | Masc | Fem |
---|---|---|
Case=Acc,Nom|Number=Sing | él, ello | ella |
Case=Acc,Nom|Number=Plur | ellos | ellas |
Case=Acc|Number=Sing|PrepCase=Npr | lo | la |
Case=Acc|Number=Plur|PrepCase=Npr | los | las |
PROPN
3490 es-pos/PROPN tokens (9% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (3002; 86%).
PROPN
tokens may have the following values of Gender
:
Fem
(997; 29% of non-emptyGender
): guerra, Segunda, Primera, Europea, Ruta, Isla, española, TV, Aérea, batallaMasc
(2493; 71% of non-emptyGender
): Unidos, Estados, Partido, censo, José, of, Club, Diego, País, ríoEMPTY
(36006): san, España, Estados, Unidos, madrid, Juan, septiembre, julio, enero, José
Paradigm the | Masc | Fem |
---|---|---|
the | the |
Gender
seems to be lexical feature of PROPN
. 99% lemmas (2037) occur only with one value of Gender
.
X
542 es-pos/X tokens (27% of all X
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which X
and Gender
co-occurred: Number=Sing (433; 80%).
X
tokens may have the following values of Gender
:
Fem
(115; 21% of non-emptyGender
): ’s, C, b, cápita, i, pre, semi, ta, C., highMasc
(427; 79% of non-emptyGender
): mm, msnm, etc., ‘s, n., of, the, al, co, cisEMPTY
(1443): ex, hab, ya, ‘s, C, etc., ², x, C., d
Paradigm 's | Masc | Fem |
---|---|---|
_ | 's | 's |
Number=Sing | 's | 's |
Number=Sing|Person=3 | 's |
Gender
seems to be lexical feature of X
. 96% lemmas (383) occur only with one value of Gender
.
AUX
292 es-pos/AUX tokens (5% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Mood=EMPTY (292; 100%), Person=EMPTY (290; 99%), Number=Sing (272; 93%), VerbForm=Part (225; 77%), Tense=Past (222; 76%).
AUX
tokens may have the following values of Gender
:
Fem
(33; 11% of non-emptyGender
): esta, estas, pudieras, Acabo, acabas, comienza, continua, estarías, estoy, fuerosMasc
(259; 89% of non-emptyGender
): sido, estado, podido, ido, tenido, ser, vuelto, Acabo, poder, venidoEMPTY
(5751): ha, fue, han, puede, había, está, es, fueron, ser, pueden
Paradigm haber | Masc | Fem |
---|---|---|
Number=Sing | han, haber | |
Number=Plur|Person=3 | has |
NUM
208 es-pos/NUM tokens (2% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumType=Card (208; 100%), Number=Sing (175; 84%).
NUM
tokens may have the following values of Gender
:
Fem
(75; 36% of non-emptyGender
): una, media, II, pocas, I, IV, XI, ocho, setenta, 2008-09Masc
(133; 64% of non-emptyGender
): un, uno, ciento, II, medio, cero, millones, V, VIII, XXEMPTY
(10809): dos, 2010, tres, 0, 3, cuatro, 1, 2, 10, 4
Paradigm uno | Masc | Fem |
---|---|---|
un, uno | una |
SYM
122 es-pos/SYM tokens (7% of all SYM
tokens) have a non-empty value of Gender
.
SYM
tokens may have the following values of Gender
:
Fem
(36; 30% of non-emptyGender
): h, $, &, m, €, +, http://redsismica.uprm.edu/spanish/informacion/terr1918.php, http://www.rumbo.es/disney/Masc
(86; 70% of non-emptyGender
): km, cm, $, &, m, #, º, mundo.com, www.delnuevo, www.dgt.esEMPTY
(1530): %, ², km, º, $, °, a, €, ª, /
Paradigm $ | Masc | Fem |
---|---|---|
Number=Sing | $ | $ |
Number=Sing|VerbForm=Part | $ | |
Number=Plur|VerbForm=Part | $ |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (42452; 84%),
NOUN –[amod]–> ADJ (10281; 56%),
NOUN –[conj]–> NOUN (2934; 54%),
NOUN –[acl]–> VERB (1856; 82%),
NOUN –[nummod]–> ADJ (793; 94%),
VERB –[nsubjpass]–> NOUN (697; 89%),
ADJ –[nsubj]–> NOUN (548; 57%),
PRON –[nmod]–> NOUN (500; 68%),
ADJ –[conj]–> ADJ (447; 54%),
NOUN –[det]–> PRON (188; 71%).
Treebank Statistics (UD_Spanish-AnCora)
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
200949 tokens (36%) have a non-empty value of Gender
.
17301 types (44%) occur at least once with a non-empty value of Gender
.
11645 lemmas (44%) occur at least once with a non-empty value of Gender
.
The feature is used with 9 part-of-speech tags: es-pos/NOUN (87899; 16% instances), es-pos/DET (78722; 14% instances), es-pos/ADJ (24251; 4% instances), es-pos/VERB (4616; 1% instances), es-pos/PRON (4410; 1% instances), es-pos/AUX (627; 0% instances), es-pos/NUM (291; 0% instances), es-pos/ADP (76; 0% instances), es-pos/ADV (57; 0% instances).
NOUN
87899 es-pos/NOUN tokens (87% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (61643; 70%).
NOUN
tokens may have the following values of Gender
:
Fem
(41003; 47% of non-emptyGender
): personas, parte, vida, situación, vez, forma, elecciones, empresa, horas, decisiónMasc
(46896; 53% of non-emptyGender
): años, presidente, millones, equipo, partido, país, año, ministro, mundo, grupoEMPTY
(12755): pesetas, dólares, frente, parte, portavoz, líder, respecto, vez, pese, policía
Paradigm candidato | Masc | Fem |
---|---|---|
Number=Sing | candidato | |
Number=Plur | candidatos | CANDIDATAS |
Gender
seems to be lexical feature of NOUN
. 99% lemmas (7741) occur only with one value of Gender
.
DET
78722 es-pos/DET tokens (92% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: PronType=Art (71978; 91%), Definite=Def (62611; 80%), Number=Sing (62048; 79%).
DET
tokens may have the following values of Gender
:
Fem
(32339; 41% of non-emptyGender
): la, las, una, esta, esa, todas, estas, otras, toda, otraMasc
(46383; 59% of non-emptyGender
): el, los, un, este, todo, ese, todos, otros, estos, unosEMPTY
(6811): su, sus, lo, cada, mi, cualquier, qué, tal, mis, diferentes
Paradigm el | Masc | Fem |
---|---|---|
Number=Sing | el | la |
Number=Plur | los | las |
EL |
ADJ
24251 es-pos/ADJ tokens (67% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: VerbForm=EMPTY (17741; 73%), Number=Sing (17384; 72%).
ADJ
tokens may have the following values of Gender
:
Fem
(10132; 42% of non-emptyGender
): primera, nueva, segunda, política, española, última, nuevas, única, buena, públicaMasc
(14119; 58% of non-emptyGender
): pasado, primer, nuevo, próximo, últimos, español, segundo, último, único, políticoEMPTY
(12193): gran, mayor, mejor, general, posible, ex, grandes, actual, electoral, internacional
Paradigm primero | Masc | Fem |
---|---|---|
Number=Sing | primer, primero | primera |
Number=Plur | primeros | primeras |
VERB
4616 es-pos/VERB tokens (10% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (4615; 100%), Tense=Past (4615; 100%), Person=EMPTY (4615; 100%), VerbForm=Part (4615; 100%), Number=Sing (4311; 93%).
VERB
tokens may have the following values of Gender
:
Fem
(325; 7% of non-emptyGender
): aprobada, considerada, dada, utilizada, dadas, incluida, rechazada, recibida, violada, aprobadasMasc
(4291; 93% of non-emptyGender
): hecho, tenido, dado, visto, conseguido, ganado, pasado, perdido, logrado, puestoEMPTY
(41449): tiene, dijo, está, hacer, tienen, aseguró, están, explicó, dar, afirmó
Paradigm hacer | Masc | Fem |
---|---|---|
Number=Sing | hecho | hecha |
Number=Plur | hechos |
PRON
4410 es-pos/PRON tokens (18% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Case=EMPTY (3182; 72%), Number=Sing (2924; 66%), Person=EMPTY (2401; 54%).
PRON
tokens may have the following values of Gender
:
Fem
(1157; 26% of non-emptyGender
): la, una, ella, las, ellas, otra, cuya, unas, ésta, otrasMasc
(3253; 74% of non-emptyGender
): lo, uno, todo, él, ellos, unos, los, otros, todos, nosotrosEMPTY
(20184): que, se, le, me, donde, nos, quien, les, eso, nada
Paradigm él | Masc | Fem |
---|---|---|
Case=Acc|Number=Sing | lo, le, Les | la |
Case=Acc|Number=Plur | los, les | las |
Number=Sing | él | ella |
Number=Plur | ellos | ellas, les |
AUX
627 es-pos/AUX tokens (4% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: VerbForm=Part (627; 100%), Person=EMPTY (627; 100%), Mood=EMPTY (627; 100%), Tense=Past (627; 100%), Number=Sing (610; 97%).
AUX
tokens may have the following values of Gender
:
Fem
(9; 1% of non-emptyGender
): trasladadas, comprada, demolidas, diseñada, dispuestas, investigadas, promocionadas, registradaMasc
(618; 99% of non-emptyGender
): sido, podido, habido, ido, hecho, llegado, empezado, estado, dejado, vueltoEMPTY
(15096): es, ha, han, fue, ser, son, había, hay, puede, era
Paradigm investigar | Masc | Fem |
---|---|---|
Number=Sing | investigado | |
Number=Plur | investigadas |
Gender
seems to be lexical feature of AUX
. 98% lemmas (61) occur only with one value of Gender
.
NUM
291 es-pos/NUM tokens (3% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumType=Card (290; 100%), NumForm=EMPTY (290; 100%), Number=Plur (176; 60%).
NUM
tokens may have the following values of Gender
:
Fem
(94; 32% of non-emptyGender
): ambas, media, una, DECENAS, décima, quinientasMasc
(197; 68% of non-emptyGender
): ambos, medio, un, doscientos, uno, miles, quinientos, dois, euros, ochentaEMPTY
(8476): dos, ciento, tres, cinco, cuatro, seis, 20, siete, 30, diez
Paradigm ambos | Masc | Fem |
---|---|---|
ambos | ambas |
ADP
76 es-pos/ADP tokens (0% of all ADP
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADP
and Gender
co-occurred: AdpType=Preppron (76; 100%).
ADP
tokens may have the following values of Gender
:
Fem
(14; 18% of non-emptyGender
): daMasc
(62; 82% of non-emptyGender
): Al, Del, do, dels, DEL, als, de, peloEMPTY
(87973): de, en, a, por, con, para, entre, sobre, sin, desde
Gender
seems to be lexical feature of ADP
. 100% lemmas (12) occur only with one value of Gender
.
ADV
57 es-pos/ADV tokens (0% of all ADV
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADV
and Gender
co-occurred: Negative=EMPTY (57; 100%).
ADV
tokens may have the following values of Gender
:
Masc
(57; 100% of non-emptyGender
): además, debajo, acerca, detrás, encima, después, dentro, lejos, alrededor, delanteEMPTY
(17835): no, más, también, ya, hoy, ayer, muy, sólo, después, ahora
Gender
seems to be lexical feature of ADV
. 100% lemmas (14) occur only with one value of Gender
.
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (56734; 85%),
NOUN –[amod]–> ADJ (16852; 63%),
NOUN –[conj]–> NOUN (2460; 54%),
DET –[det]–> DET (1076; 82%),
ADJ –[det]–> DET (684; 55%),
ADJ –[nsubj]–> NOUN (647; 57%),
ADJ –[conj]–> ADJ (567; 56%),
PRON –[nmod]–> NOUN (442; 72%),
NOUN –[nmod]–> PRON (310; 54%),
PRON –[amod]–> ADJ (104; 51%).
Gender in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]