Treebank Statistics: UD_Galician-TreeGal: Features: Gender
This feature is universal.
It occurs with 4 different values: Com
, Fem
, Masc
, Neut
.
13078 tokens (51%) have a non-empty value of Gender
.
3791 types (70%) occur at least once with a non-empty value of Gender
.
2996 lemmas (77%) occur at least once with a non-empty value of Gender
.
The feature is used with 7 part-of-speech tags: NOUN (4495; 18% instances), DET (4109; 16% instances), ADJ (1692; 7% instances), PRON (1359; 5% instances), PROPN (932; 4% instances), NUM (254; 1% instances), VERB (237; 1% instances).
NOUN
4495 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (3222; 72%).
NOUN
tokens may have the following values of Gender
:
Com
(31; 1% of non-emptyGender
): nacionalistas, socialistas, galeguista, guitarristas, prol, xornalistas, Babecas, Finalistas, alcalde, antípodaFem
(2178; 48% of non-emptyGender
): cidade, música, parte, obra, vida, proposta, arte, empresa, obras, linguaMasc
(2286; 51% of non-emptyGender
): anos, traballo, goberno, mundo, tempo, dereito, proxecto, país, medios, grupoEMPTY
(8): Escritores, Galego, Terra, aguas, foot, football, grue, tradutore
Paradigm alcalde | Masc | Fem | Com |
---|---|---|---|
alcalde | alcaldesa | alcalde |
Gender
seems to be lexical feature of NOUN
. 98% lemmas (1603) occur only with one value of Gender
.
DET
4109 DET tokens (100% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: PronType=Art (3450; 84%), Number=Sing (3181; 77%), Definite=Def (2988; 73%).
DET
tokens may have the following values of Gender
:
Fem
(1934; 47% of non-emptyGender
): a, as, unha, súa, esta, la, nosa, súas, esa, outraMasc
(2175; 53% of non-emptyGender
): o, os, un, lo, seu, este, seus, todo, todos, eseEMPTY
(9): Los, la, El
Paradigm o | Masc | Fem |
---|---|---|
Definite=Def|Number=Sing | o, lo, os | a, la |
Definite=Def|Number=Plur | os, los | as, las |
Number=Sing | a | |
Number=Plur|Person=3 | os |
ADJ
1692 ADJ tokens (98% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=Sing (1256; 74%).
ADJ
tokens may have the following values of Gender
:
Com
(21; 1% of non-emptyGender
): obstante, embargante, best, coincidente, diferentes, Simple, Visitábel, bípedes, capaz, conscientesFem
(802; 47% of non-emptyGender
): galega, política, europea, nova, social, Franca, actual, cultural, laboral, mellorMasc
(869; 51% of non-emptyGender
): novo, galego, español, galegos, socialista, constitucional, difícil, gran, mellor, claroEMPTY
(26): xeral, Mellor, galega, Artístico, Barataria, Constituínte, Franca, Profesional, Reservada, Tradicional
Paradigm nacionalista | Masc | Fem | Com |
---|---|---|---|
Number=Sing | nacionalista | nacionalista | |
Number=Plur | nacionalistas | nacionalistas |
PRON
1359 PRON tokens (99% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Case=EMPTY (1104; 81%), Clitic=EMPTY (888; 65%), Number=Sing (774; 57%), Person=EMPTY (724; 53%).
PRON
tokens may have the following values of Gender
:
Com
(390; 29% of non-emptyGender
): se, nos, que, me, quen, lle, eu, nós, vostede, alguénFem
(291; 21% of non-emptyGender
): que, a, lle, as, unha, elas, ela, esta, na, ningunhaMasc
(645; 47% of non-emptyGender
): que, o, lle, lo, os, todo, un, algo, algúns, llesNeut
(33; 2% of non-emptyGender
): iso, istoEMPTY
(13): nada, algo, se, che
Paradigm que | Masc | Fem | Com |
---|---|---|---|
Number=Sing|PronType=Int | que | ||
Number=Sing|PronType=Rel | que | que | que |
Number=Plur|PronType=Rel | que | que | que |
PronType=Int | que | ||
PronType=Rel | que |
PROPN
932 PROPN tokens (59% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=EMPTY (474; 51%).
PROPN
tokens may have the following values of Gender
:
Fem
(222; 24% of non-emptyGender
): Mercedes, Núñez, Unión, UE, Zona, Bases, Xunta, Constitución, Galiza, AcademiaMasc
(710; 76% of non-emptyGender
): BNG, Estado, Manuel, Xosé, Miguel, Anxo, Carlos, Estatuto, González, FranciscoEMPTY
(646): Galiza, Ferrol, Vigo, Pontevedra, Terra, Touriño, Beiras, Coruña, Abades, Amenábar
Paradigm galiza | Masc | Fem |
---|---|---|
Galiza | Galiza |
Gender
seems to be lexical feature of PROPN
. 97% lemmas (466) occur only with one value of Gender
.
NUM
254 NUM tokens (98% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumType=Card (217; 85%), Number=Sing (146; 57%).
NUM
tokens may have the following values of Gender
:
Com
(2; 1% of non-emptyGender
): 13.000, cenFem
(55; 22% of non-emptyGender
): primeira, dúas, segunda, catro, tres, unha, cinco, terceira, 12, 15.000Masc
(197; 78% of non-emptyGender
): dous, un, primeiro, catro, dez, 1990, 25, cinco, quince, tresEMPTY
(6): 36, 687 614 874, II, Tres, XII, dos
Paradigm primeiro | Masc | Fem |
---|---|---|
Number=Sing | primeiro | primeira |
Number=Plur | primeiros | primeiras |
VERB
237 VERB tokens (10% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (237; 100%), Person=EMPTY (237; 100%), Tense=EMPTY (237; 100%), VerbForm=Part (237; 100%), Number=Sing (172; 73%).
VERB
tokens may have the following values of Gender
:
Fem
(99; 42% of non-emptyGender
): recibida, dirixida, destinadas, feita, feitas, prometidas, realizada, vencellada, Configurada, NacidaMasc
(138; 58% of non-emptyGender
): debido, baseado, elaborado, elixido, afectado, apresentado, atendidos, chamado, considerado, dirixidoEMPTY
(2138): hai, ten, facer, ter, teñen, ver, fai, falar, dar, fixo
Paradigm facer | Masc | Fem |
---|---|---|
Number=Sing | feito | feita |
Number=Plur | feitas |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (3380; 100%),
NOUN –[amod]–> ADJ (1197; 99%),
PROPN –[det]–> DET (375; 80%),
PROPN –[flat:name]–> PROPN (274; 99%),
NOUN –[conj]–> NOUN (211; 57%),
NOUN –[nummod]–> NUM (143; 99%),
ADJ –[conj]–> ADJ (114; 97%),
PRON –[det]–> DET (98; 99%),
PROPN –[amod]–> ADJ (93; 82%),
ADJ –[det]–> DET (63; 100%).