Treebank Statistics: UD_Galician-TreeGal: Features: Gender
This feature is universal.
It occurs with 4 different values: Com, Fem, Masc, Neut.
12590 tokens (49%) have a non-empty value of Gender.
3509 types (65%) occur at least once with a non-empty value of Gender.
2662 lemmas (68%) occur at least once with a non-empty value of Gender.
The feature is used with 7 part-of-speech tags: NOUN (4835; 19% instances), DET (4110; 16% instances), ADJ (1695; 7% instances), PRON (1359; 5% instances), NUM (261; 1% instances), VERB (237; 1% instances), PROPN (93; 0% instances).
NOUN
4835 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (3513; 73%).
NOUN tokens may have the following values of Gender:
Com(30; 1% of non-emptyGender): nacionalistas, socialistas, galeguista, guitarristas, prol, xornalistas, Babecas, Finalistas, alcalde, antípodaFem(2333; 48% of non-emptyGender): cidade, música, parte, obra, vida, empresa, proposta, arte, bases, obrasMasc(2472; 51% of non-emptyGender): anos, goberno, traballo, Estado, estatuto, tempo, dereito, mundo, país, proxectoEMPTY(29): Celulosas, Discurso, Terra, Bahia, Caixa, Edicións, Escritores, Estado, Galego, Limones
| Paradigm alcalde | Masc | Fem | Com |
|---|---|---|---|
| alcalde | alcaldesa | alcalde |
Gender seems to be lexical feature of NOUN. 98% lemmas (1662) occur only with one value of Gender.
DET
4110 DET tokens (100% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (3451; 84%), Number=Sing (3181; 77%), Definite=Def (2989; 73%).
DET tokens may have the following values of Gender:
Fem(1934; 47% of non-emptyGender): a, as, unha, súa, esta, la, nosa, súas, esa, outraMasc(2176; 53% of non-emptyGender): o, os, un, lo, seu, este, seus, todo, todos, eseEMPTY(9): Los, la, El
| Paradigm o | Masc | Fem |
|---|---|---|
| Definite=Def|Number=Sing | o, lo, os | a, la |
| Definite=Def|Number=Plur | os, los | as, las |
| Number=Sing | a | |
| Number=Plur|Person=3 | os |
ADJ
1695 ADJ tokens (98% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1260; 74%).
ADJ tokens may have the following values of Gender:
Com(20; 1% of non-emptyGender): obstante, embargante, coincidente, diferentes, Simple, Visitábel, bípedes, capaz, conscientes, fráxilFem(804; 47% of non-emptyGender): galega, política, europea, nova, social, Franca, actual, cultural, laboral, mellorMasc(871; 51% of non-emptyGender): novo, galego, español, galegos, socialista, constitucional, difícil, gran, mellor, claroEMPTY(26): xeral, Mellor, galega, Artístico, Barataria, Constituínte, Franca, Profesional, Reservada, Tradicional
| Paradigm nacionalista | Masc | Fem | Com |
|---|---|---|---|
| Number=Sing | nacionalista | nacionalista | |
| Number=Plur | nacionalistas | nacionalistas |
PRON
1359 PRON tokens (99% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Case=EMPTY (1104; 81%), Clitic=EMPTY (888; 65%), Number=Sing (774; 57%), Person=EMPTY (724; 53%).
PRON tokens may have the following values of Gender:
Com(390; 29% of non-emptyGender): se, nos, que, me, quen, lle, eu, nós, vostede, alguénFem(291; 21% of non-emptyGender): que, a, lle, as, unha, elas, ela, esta, na, ningunhaMasc(645; 47% of non-emptyGender): que, o, lle, lo, os, todo, un, algo, algúns, llesNeut(33; 2% of non-emptyGender): iso, istoEMPTY(13): nada, algo, se, che
| Paradigm que | Masc | Fem | Com |
|---|---|---|---|
| Number=Sing|PronType=Int | que | ||
| Number=Sing|PronType=Rel | que | que | que |
| Number=Plur|PronType=Rel | que | que | que |
| PronType=Int | que | ||
| PronType=Rel | que |
NUM
261 NUM tokens (98% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (223; 85%), Number=Sing (148; 57%).
NUM tokens may have the following values of Gender:
Com(2; 1% of non-emptyGender): 13.000, cenFem(59; 23% of non-emptyGender): primeira, dúas, segunda, 21, catro, tres, unha, cinco, terceira, 12Masc(200; 77% of non-emptyGender): dous, un, primeiro, catro, dez, 1990, 25, cinco, quince, tresEMPTY(6): 36, 687 614 874, II, Tres, XII, dos
| Paradigm primeiro | Masc | Fem |
|---|---|---|
| Number=Sing | primeiro | primeira |
| Number=Plur | primeiros | primeiras |
VERB
237 VERB tokens (10% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (237; 100%), Person=EMPTY (237; 100%), Tense=EMPTY (237; 100%), VerbForm=Part (237; 100%), Number=Sing (172; 73%).
VERB tokens may have the following values of Gender:
Fem(99; 42% of non-emptyGender): recibida, dirixida, destinadas, feita, feitas, prometidas, realizada, vencellada, Configurada, NacidaMasc(138; 58% of non-emptyGender): debido, baseado, elaborado, elixido, afectado, apresentado, atendidos, chamado, considerado, dirixidoEMPTY(2142): hai, ten, facer, ter, teñen, ver, fai, falar, dar, fixo
| Paradigm facer | Masc | Fem |
|---|---|---|
| Number=Sing | feito | feita |
| Number=Plur | feitas |
PROPN
93 PROPN tokens (8% of all PROPN tokens) have a non-empty value of Gender.
PROPN tokens may have the following values of Gender:
Fem(20; 22% of non-emptyGender): UE, CIG, ONU, CIG-ensino, EMALCSA, ETEA, OTAN, SA, SEPI, TVGMasc(73; 78% of non-emptyGender): BNG, PP, PSOE, PSdeG, PSdeG-PSOE, PSC, SXG, FIDAC, IBBY, INEMEMPTY(1098): Galiza, Ferrol, Prestige, Vigo, Manuel, Touriño, Xosé, Beiras, Galicia, Miguel
Gender seems to be lexical feature of PROPN. 100% lemmas (25) occur only with one value of Gender.
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (3648; 99%),
NOUN –[amod]–> ADJ (1300; 98%),
NOUN –[conj]–> NOUN (230; 57%),
NOUN –[nummod]–> NUM (151; 96%),
ADJ –[conj]–> ADJ (114; 97%),
PRON –[det]–> DET (98; 99%),
ADJ –[det]–> DET (64; 100%),
ADJ –[nsubj]–> NOUN (43; 93%),
NUM –[det]–> DET (38; 95%),
NOUN –[nmod]–> PRON (35; 61%).