Gender: gender
This document is a placeholder for the language-specific documentation
for Gender.
Treebank Statistics (UD_Galician-TreeGal)
This feature is universal.
It occurs with 4 different values: Com, Fem, Masc, Neut.
12208 tokens (50%) have a non-empty value of Gender.
3498 types (65%) occur at least once with a non-empty value of Gender.
2851 lemmas (73%) occur at least once with a non-empty value of Gender.
The feature is used with 8 part-of-speech tags: gl-pos/NOUN (4308; 18% instances), gl-pos/DET (3978; 16% instances), gl-pos/ADJ (1564; 6% instances), gl-pos/PRON (1280; 5% instances), gl-pos/PROPN (591; 2% instances), gl-pos/NUM (248; 1% instances), gl-pos/VERB (237; 1% instances), gl-pos/SCONJ (2; 0% instances).
NOUN
4308 gl-pos/NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (3072; 71%).
NOUN tokens may have the following values of Gender:
Com(27; 1% of non-emptyGender): nacionalistas, socialistas, galeguista, guitarristas, xornalistas, antípoda, babecas, emigrante, finalistas, inmigrantesFem(2094; 49% of non-emptyGender): cidade, música, obra, parte, proposta, vida, arte, empresa, obras, xenteMasc(2187; 51% of non-emptyGender): anos, traballo, goberno, mundo, dereito, proxecto, país, tempo, medios, grupo
| Paradigm galego | Masc | Fem |
|---|---|---|
| Number=Sing | galego | |
| Number=Plur | galegos | galegas |
Gender seems to be lexical feature of NOUN. 98% lemmas (1558) occur only with one value of Gender.
DET
3978 gl-pos/DET tokens (100% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (3337; 84%), Number=Sing (3066; 77%), Definite=Def (2885; 73%).
DET tokens may have the following values of Gender:
Fem(1838; 46% of non-emptyGender): a, as, unha, súa, esta, la, súas, esa, outra, nosaMasc(2140; 54% of non-emptyGender): o, os, un, seu, lo, este, seus, todos, ese, todo
| Paradigm o | Masc | Fem |
|---|---|---|
| Number=Sing | o, lo, os | a, la |
| Number=Plur | os, los | as, las |
ADJ
1564 gl-pos/ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1149; 73%).
ADJ tokens may have the following values of Gender:
Com(15; 1% of non-emptyGender): coincidente, diferentes, bípedes, capaz, conscientes, fráxil, fácil, grave, nacionalistas, pedestreFem(755; 48% of non-emptyGender): galega, política, nova, social, actual, cultural, laboral, mellor, pública, autonómicasMasc(794; 51% of non-emptyGender): novo, galego, español, galegos, difícil, gran, mellor, socialista, distintos, próximo
| Paradigm nacionalista | Masc | Fem | Com |
|---|---|---|---|
| Number=Sing | nacionalista | nacionalista | |
| Number=Plur | nacionalistas | nacionalistas |
PRON
1280 gl-pos/PRON tokens (99% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Case=EMPTY (1026; 80%), Person=EMPTY (749; 59%), Number=Sing (713; 56%).
PRON tokens may have the following values of Gender:
Com(388; 30% of non-emptyGender): se, nos, que, me, quen, lle, eu, nós, vostede, alguénFem(282; 22% of non-emptyGender): que, a, lle, unha, elas, as, ela, esta, na, ningunhaMasc(577; 45% of non-emptyGender): que, o, lle, lo, todo, un, algo, algúns, lles, outrosNeut(33; 3% of non-emptyGender): iso, istoEMPTY(11): nada, algo
| Paradigm que | Masc | Fem | Com |
|---|---|---|---|
| Number=Sing|PronType=Int | que | ||
| Number=Sing|PronType=Rel | que | que | que |
| Number=Dual|PronType=Int | que | ||
| Number=Dual|PronType=Rel | que | ||
| Number=Plur|PronType=Rel | que | que | que |
PROPN
591 gl-pos/PROPN tokens (51% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (357; 60%).
PROPN tokens may have the following values of Gender:
Fem(160; 27% of non-emptyGender): Mercedes_Núñez, Unión_Europea, Zona_Franca, UE, Xunta, Casa_da_Auga, Constitución, Bases, Comunidade_Autónoma, GalizaMasc(431; 73% of non-emptyGender): BNG, Estado, Anxo_Quintana, Estatuto, Miguel_Barros, Prestige, PP, Carlos_Príncipe, Quixote, PSOEEMPTY(557): Galiza, Ferrol, Vigo, A_Galiza, A_Nosa_Terra, Pontevedra, Beiras, Abades, Amenábar, ENCE
Gender seems to be lexical feature of PROPN. 100% lemmas (349) occur only with one value of Gender.
NUM
248 gl-pos/NUM tokens (100% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (212; 85%), Number=Sing (141; 57%).
NUM tokens may have the following values of Gender:
Com(2; 1% of non-emptyGender): 13.000, cenFem(53; 21% of non-emptyGender): primeira, dúas, segunda, catro, tres, cinco, terceira, 12, 15.000, 17Masc(193; 78% of non-emptyGender): dous, un, primeiro, catro, dez, 1990, 25, quince, tres, 1917
| Paradigm primeiro | Masc | Fem |
|---|---|---|
| Number=Sing | primeiro | primeira |
| Number=Plur | primeiros | primeiras |
VERB
237 gl-pos/VERB tokens (8% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (237; 100%), Mood=EMPTY (237; 100%), Tense=EMPTY (237; 100%), VerbForm=Part (235; 99%), Number=Sing (172; 73%).
VERB tokens may have the following values of Gender:
Fem(99; 42% of non-emptyGender): recibida, dirixida, destinadas, feita, feitas, prometidas, realizada, vencellada, aceptada, admitidaMasc(138; 58% of non-emptyGender): debido, baseado, elaborado, elixido, afectado, apresentado, atendidos, chamado, considerado, dirixidoEMPTY(2730): é, hai, ten, está, foi, son, ser, pode, facer, teñen
| Paradigm facer | Masc | Fem |
|---|---|---|
| Number=Sing | feito | feita |
| Number=Plur | feitas |
SCONJ
2 gl-pos/SCONJ tokens (0% of all SCONJ tokens) have a non-empty value of Gender.
SCONJ tokens may have the following values of Gender:
Com(2; 100% of non-emptyGender): queEMPTY(561): que, pero, como, se, porque, aínda_que, mais, senón, pois, coma
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (3322; 100%),
NOUN –[amod]–> ADJ (1172; 99%),
PROPN –[det]–> DET (359; 99%),
NOUN –[conj]–> NOUN (205; 56%),
PRON –[det]–> DET (164; 99%),
NOUN –[nummod]–> NUM (141; 100%),
ADJ –[conj]–> ADJ (113; 97%),
ADJ –[det]–> DET (60; 100%),
ADJ –[nsubj]–> NOUN (44; 94%),
PROPN –[conj]–> PROPN (37; 63%).
Gender in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]