Treebank Statistics: UD_Galician-TreeGal: Features: Gender
This feature is universal.
It occurs with 4 different values: Com
, Fem
, Masc
, Neut
.
12593 tokens (49%) have a non-empty value of Gender
.
3512 types (65%) occur at least once with a non-empty value of Gender
.
2665 lemmas (68%) occur at least once with a non-empty value of Gender
.
The feature is used with 7 part-of-speech tags: NOUN (4837; 19% instances), DET (4110; 16% instances), ADJ (1696; 7% instances), PRON (1359; 5% instances), NUM (261; 1% instances), VERB (237; 1% instances), PROPN (93; 0% instances).
NOUN
4837 NOUN tokens (99% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (3514; 73%).
NOUN
tokens may have the following values of Gender
:
Com
(31; 1% of non-emptyGender
): nacionalistas, socialistas, galeguista, guitarristas, prol, xornalistas, Babecas, Finalistas, alcalde, antípodaFem
(2334; 48% of non-emptyGender
): cidade, música, parte, obra, vida, empresa, proposta, arte, bases, obrasMasc
(2472; 51% of non-emptyGender
): anos, goberno, traballo, Estado, estatuto, tempo, dereito, mundo, país, proxectoEMPTY
(29): Celulosas, Discurso, Terra, Bahia, Caixa, Edicións, Escritores, Estado, Galego, Limones
Paradigm alcalde | Masc | Fem | Com |
---|---|---|---|
alcalde | alcaldesa | alcalde |
Gender
seems to be lexical feature of NOUN
. 98% lemmas (1664) occur only with one value of Gender
.
DET
4110 DET tokens (100% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: PronType=Art (3451; 84%), Number=Sing (3181; 77%), Definite=Def (2989; 73%).
DET
tokens may have the following values of Gender
:
Fem
(1934; 47% of non-emptyGender
): a, as, unha, súa, esta, la, nosa, súas, esa, outraMasc
(2176; 53% of non-emptyGender
): o, os, un, lo, seu, este, seus, todo, todos, eseEMPTY
(9): Los, la, El
Paradigm o | Masc | Fem |
---|---|---|
Definite=Def|Number=Sing | o, lo, os | a, la |
Definite=Def|Number=Plur | os, los | as, las |
Number=Sing | a | |
Number=Plur|Person=3 | os |
ADJ
1696 ADJ tokens (98% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (1260; 74%).
ADJ
tokens may have the following values of Gender
:
Com
(21; 1% of non-emptyGender
): obstante, embargante, best, coincidente, diferentes, Simple, Visitábel, bípedes, capaz, conscientesFem
(804; 47% of non-emptyGender
): galega, política, europea, nova, social, Franca, actual, cultural, laboral, mellorMasc
(871; 51% of non-emptyGender
): novo, galego, español, galegos, socialista, constitucional, difícil, gran, mellor, claroEMPTY
(26): xeral, Mellor, galega, Artístico, Barataria, Constituínte, Franca, Profesional, Reservada, Tradicional
Paradigm nacionalista | Masc | Fem | Com |
---|---|---|---|
Number=Sing | nacionalista | nacionalista | |
Number=Plur | nacionalistas | nacionalistas |
PRON
1359 PRON tokens (99% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Case=EMPTY (1104; 81%), Clitic=EMPTY (888; 65%), Number=Sing (774; 57%), Person=EMPTY (724; 53%).
PRON
tokens may have the following values of Gender
:
Com
(390; 29% of non-emptyGender
): se, nos, que, me, quen, lle, eu, nós, vostede, alguénFem
(291; 21% of non-emptyGender
): que, a, lle, as, unha, elas, ela, esta, na, ningunhaMasc
(645; 47% of non-emptyGender
): que, o, lle, lo, os, todo, un, algo, algúns, llesNeut
(33; 2% of non-emptyGender
): iso, istoEMPTY
(13): nada, algo, se, che
Paradigm que | Masc | Fem | Com |
---|---|---|---|
Number=Sing|PronType=Int | que | ||
Number=Sing|PronType=Rel | que | que | que |
Number=Plur|PronType=Rel | que | que | que |
PronType=Int | que | ||
PronType=Rel | que |
NUM
261 NUM tokens (98% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumType=Card (223; 85%), Number=Sing (148; 57%).
NUM
tokens may have the following values of Gender
:
Com
(2; 1% of non-emptyGender
): 13.000, cenFem
(59; 23% of non-emptyGender
): primeira, dúas, segunda, 21, catro, tres, unha, cinco, terceira, 12Masc
(200; 77% of non-emptyGender
): dous, un, primeiro, catro, dez, 1990, 25, cinco, quince, tresEMPTY
(6): 36, 687 614 874, II, Tres, XII, dos
Paradigm primeiro | Masc | Fem |
---|---|---|
Number=Sing | primeiro | primeira |
Number=Plur | primeiros | primeiras |
VERB
237 VERB tokens (10% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (237; 100%), Person=EMPTY (237; 100%), Tense=EMPTY (237; 100%), VerbForm=Part (237; 100%), Number=Sing (172; 73%).
VERB
tokens may have the following values of Gender
:
Fem
(99; 42% of non-emptyGender
): recibida, dirixida, destinadas, feita, feitas, prometidas, realizada, vencellada, Configurada, NacidaMasc
(138; 58% of non-emptyGender
): debido, baseado, elaborado, elixido, afectado, apresentado, atendidos, chamado, considerado, dirixidoEMPTY
(2142): hai, ten, facer, ter, teñen, ver, fai, falar, dar, fixo
Paradigm facer | Masc | Fem |
---|---|---|
Number=Sing | feito | feita |
Number=Plur | feitas |
PROPN
93 PROPN tokens (8% of all PROPN
tokens) have a non-empty value of Gender
.
PROPN
tokens may have the following values of Gender
:
Fem
(20; 22% of non-emptyGender
): UE, CIG, ONU, CIG-ensino, EMALCSA, ETEA, OTAN, SA, SEPI, TVGMasc
(73; 78% of non-emptyGender
): BNG, PP, PSOE, PSdeG, PSdeG-PSOE, PSC, SXG, FIDAC, IBBY, INEMEMPTY
(1098): Galiza, Ferrol, Prestige, Vigo, Manuel, Touriño, Xosé, Beiras, Galicia, Miguel
Gender
seems to be lexical feature of PROPN
. 100% lemmas (25) occur only with one value of Gender
.
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (3648; 99%),
NOUN –[amod]–> ADJ (1300; 98%),
NOUN –[conj]–> NOUN (229; 57%),
NOUN –[nummod]–> NUM (151; 96%),
ADJ –[conj]–> ADJ (114; 97%),
PRON –[det]–> DET (98; 99%),
ADJ –[det]–> DET (64; 100%),
ADJ –[nsubj]–> NOUN (43; 93%),
NUM –[det]–> DET (38; 95%),
NOUN –[nmod]–> PRON (35; 61%).