Treebank Statistics: UD_Galician-PUD: Features: Gender
This feature is universal.
It occurs with 4 different values: Com, Fem, Masc, Neut.
9587 tokens (41%) have a non-empty value of Gender.
2603 types (44%) occur at least once with a non-empty value of Gender.
2052 lemmas (46%) occur at least once with a non-empty value of Gender.
The feature is used with 7 part-of-speech tags: NOUN (4545; 19% instances), DET (3809; 16% instances), PRON (924; 4% instances), VERB (244; 1% instances), ADJ (53; 0% instances), NUM (9; 0% instances), PROPN (3; 0% instances).
NOUN
4545 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (3206; 71%).
NOUN tokens may have the following values of Gender:
Com(85; 2% of non-emptyGender): parte, final, partes, axentes, cápita, defensa, mañá, axente, modelo, CFem(2016; 44% of non-emptyGender): persoas, guerra, cidade, vez, rexión, vida, historia, maioría, forma, policíaMasc(2444; 54% of non-emptyGender): anos, lugar, ano, estado, goberno, mar, día, millóns, mundo, séculoEMPTY(31): Estados, San, Punta, mil, Asociación, Cidade, Comúns, Escola, Head, Lord
| Paradigm parte | Fem | Com |
|---|---|---|
| Number=Sing | parte | parte |
| Number=Plur | partes | partes |
Gender seems to be lexical feature of NOUN. 98% lemmas (1754) occur only with one value of Gender.
DET
3809 DET tokens (100% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (3450; 91%), Definite=Def (3113; 82%), Number=Sing (2908; 76%).
DET tokens may have the following values of Gender:
Fem(1677; 44% of non-emptyGender): a, as, unha, súa, súas, esta, varias, outra, cada, estasMasc(2132; 56% of non-emptyGender): o, os, un, seu, este, seus, todo, moitos, outros, algúnsEMPTY(6): o, a, cada, calquera, quenquera
| Paradigm o | Masc | Fem |
|---|---|---|
| Number=Sing|PronType=Art | o, a | a, as, la, o |
| Number=Plur|Person=3|PronType=Prs | os | |
| Number=Plur|PronType=Art | os | as |
PRON
924 PRON tokens (98% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Case=EMPTY (824; 89%), Number=EMPTY (644; 70%), PronType=Prs (512; 55%).
PRON tokens may have the following values of Gender:
Com(403; 44% of non-emptyGender): se, lle, me, nos, eu, lles, mesmo, min, nós, queFem(54; 6% of non-emptyGender): a, ela, unha, esta, moitas, as, estas, que, Outras, algunhasMasc(137; 15% of non-emptyGender): o, el, un, eles, todo, os, que, ambos, estes, outrosNeut(330; 36% of non-emptyGender): que, isto, iso, cal, alguén, cales, algo, nada, ninguén, aquiloEMPTY(18): quen, que, Algo, Esta, tales
| Paradigm que | Masc | Fem | Neut | Com |
|---|---|---|---|---|
| Number=Sing|PronType=Prs | que | |||
| Number=Sing|PronType=Rel | que | que | ||
| Number=Plur|PronType=Rel | que | |||
| PronType=Rel | que | que |
VERB
244 VERB tokens (11% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (244; 100%), Person=EMPTY (244; 100%), Tense=EMPTY (242; 99%), VerbForm=Part (242; 99%), Number=Sing (158; 65%).
VERB tokens may have the following values of Gender:
Fem(99; 41% of non-emptyGender): baseadas, dirixida, localizadas, coñecidas, formada, perdidas, preparada, procedentes, publicadas, seguidasMasc(145; 59% of non-emptyGender): debido, incluídos, mediados, publicado, acusado, anticipado, deseñado, destruído, feito, formadoEMPTY(2014): hai, dixo, ten, facer, tivo, comezou, ver, fixo, declarou, é
| Paradigm seguir | Masc | Fem |
|---|---|---|
| Number=Sing | seguido | |
| Number=Plur | seguidos | seguidas |
ADJ
53 ADJ tokens (4% of all ADJ tokens) have a non-empty value of Gender.
ADJ tokens may have the following values of Gender:
Com(21; 40% of non-emptyGender): gran, per, anterior, confidenciais, conservacionistas, dixital, escalofriante, especial, habitual, impactanteFem(14; 26% of non-emptyGender): Buena, aqueménidas, bancaria, centrais, exitosa, inusual, iézidis, meterolóxica, minoristas, pacíficaMasc(18; 34% of non-emptyGender): argumentativo, austro, autosómicos, conservador, distinto, franco, inapropriado, local, medio, monegascoEMPTY(1378): gran, maior, Unidos, grandes, nacional, nova, novo, últimos, novos, longo
Gender seems to be lexical feature of ADJ. 100% lemmas (51) occur only with one value of Gender.
NUM
9 NUM tokens (2% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (9; 100%).
NUM tokens may have the following values of Gender:
Masc(9; 100% of non-emptyGender): mil, 1,4, 1,5, 103,7, 15.000, 16, 2EMPTY(493): dous, primeira, tres, dúas, primeiro, 1, catro, III, segunda, seis
PROPN
3 PROPN tokens (0% of all PROPN tokens) have a non-empty value of Gender.
PROPN tokens may have the following values of Gender:
Fem(2; 67% of non-emptyGender): Córsega, LiñaMasc(1; 33% of non-emptyGender): KnottEMPTY(1364): China, Trump, C., Mediterráneo, Europa, Francia, Hong, Italia, Kong, Albania
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (3292; 97%),
NOUN –[conj]–> NOUN (142; 57%),
NOUN –[amod]–> VERB (49; 92%),
PRON –[nmod]–> NOUN (30; 81%),
NOUN –[appos]–> NOUN (18; 62%),
VERB –[nsubj:pass]–> NOUN (17; 85%),
NOUN –[compound]–> NOUN (5; 71%),
NOUN –[conj]–> PRON (4; 100%),
NUM –[compound]–> NUM (3; 100%),
ADJ –[compound]–> ADJ (2; 100%).