Treebank Statistics: UD_Galician-PUD: Features: Gender
This feature is universal.
It occurs with 4 different values: Com
, Fem
, Masc
, Neut
.
9588 tokens (41%) have a non-empty value of Gender
.
2604 types (44%) occur at least once with a non-empty value of Gender
.
2053 lemmas (46%) occur at least once with a non-empty value of Gender
.
The feature is used with 7 part-of-speech tags: NOUN (4544; 19% instances), DET (3812; 16% instances), PRON (923; 4% instances), VERB (244; 1% instances), ADJ (53; 0% instances), NUM (9; 0% instances), PROPN (3; 0% instances).
NOUN
4544 NOUN tokens (99% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (3204; 71%).
NOUN
tokens may have the following values of Gender
:
Com
(85; 2% of non-emptyGender
): parte, final, partes, axentes, cápita, defensa, mañá, axente, modelo, CFem
(2014; 44% of non-emptyGender
): persoas, guerra, cidade, vez, rexión, vida, historia, maioría, forma, policíaMasc
(2445; 54% of non-emptyGender
): anos, lugar, ano, estado, goberno, mar, día, millóns, mundo, séculoEMPTY
(31): Estados, San, Punta, mil, Asociación, Cidade, Comúns, Escola, Head, Lord
Paradigm parte | Fem | Com |
---|---|---|
Number=Sing | parte | parte |
Number=Plur | partes | partes |
Gender
seems to be lexical feature of NOUN
. 98% lemmas (1755) occur only with one value of Gender
.
DET
3812 DET tokens (100% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: PronType=Art (3453; 91%), Definite=Def (3116; 82%), Number=Sing (2911; 76%).
DET
tokens may have the following values of Gender
:
Fem
(1680; 44% of non-emptyGender
): a, as, unha, súa, súas, esta, varias, outra, cada, estasMasc
(2132; 56% of non-emptyGender
): o, os, un, seu, este, seus, todo, moitos, outros, algúnsEMPTY
(6): o, a, cada, calquera, quenquera
Paradigm o | Masc | Fem |
---|---|---|
Number=Sing|PronType=Art | o, a | a, as, la, o |
Number=Plur|Person=3|PronType=Prs | os | |
Number=Plur|PronType=Art | os | as |
PRON
923 PRON tokens (98% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Case=EMPTY (823; 89%), Number=EMPTY (643; 70%), PronType=Prs (511; 55%).
PRON
tokens may have the following values of Gender
:
Com
(402; 44% of non-emptyGender
): se, lle, me, nos, eu, lles, mesmo, min, nós, queFem
(54; 6% of non-emptyGender
): a, ela, unha, esta, moitas, as, estas, que, Outras, algunhasMasc
(137; 15% of non-emptyGender
): o, el, un, eles, todo, os, que, ambos, estes, outrosNeut
(330; 36% of non-emptyGender
): que, isto, iso, cal, alguén, cales, algo, nada, ninguén, aquiloEMPTY
(18): quen, que, Algo, Esta, tales
Paradigm que | Masc | Fem | Neut | Com |
---|---|---|---|---|
Number=Sing|PronType=Prs | que | |||
Number=Sing|PronType=Rel | que | que | ||
Number=Plur|PronType=Rel | que | |||
PronType=Rel | que | que |
VERB
244 VERB tokens (11% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (244; 100%), Person=EMPTY (244; 100%), Tense=EMPTY (242; 99%), VerbForm=Part (242; 99%), Number=Sing (158; 65%).
VERB
tokens may have the following values of Gender
:
Fem
(99; 41% of non-emptyGender
): baseadas, dirixida, localizadas, coñecidas, formada, perdidas, preparada, procedentes, publicadas, seguidasMasc
(145; 59% of non-emptyGender
): debido, incluídos, mediados, publicado, acusado, anticipado, deseñado, destruído, feito, formadoEMPTY
(2013): hai, dixo, ten, facer, tivo, comezou, ver, fixo, declarou, é
Paradigm seguir | Masc | Fem |
---|---|---|
Number=Sing | seguido | |
Number=Plur | seguidos | seguidas |
ADJ
53 ADJ tokens (4% of all ADJ
tokens) have a non-empty value of Gender
.
ADJ
tokens may have the following values of Gender
:
Com
(21; 40% of non-emptyGender
): gran, per, anterior, confidenciais, conservacionistas, dixital, escalofriante, especial, habitual, impactanteFem
(14; 26% of non-emptyGender
): Buena, aqueménidas, bancaria, centrais, exitosa, inusual, iézidis, meterolóxica, minoristas, pacíficaMasc
(18; 34% of non-emptyGender
): argumentativo, austro, autosómicos, conservador, distinto, franco, inapropriado, local, medio, monegascoEMPTY
(1380): gran, maior, Unidos, grandes, nacional, nova, novo, últimos, novos, longo
Gender
seems to be lexical feature of ADJ
. 100% lemmas (51) occur only with one value of Gender
.
NUM
9 NUM tokens (2% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumType=Card (9; 100%).
NUM
tokens may have the following values of Gender
:
Masc
(9; 100% of non-emptyGender
): mil, 1,4, 1,5, 103,7, 15.000, 16, 2EMPTY
(493): dous, primeira, tres, dúas, primeiro, 1, catro, III, segunda, seis
PROPN
3 PROPN tokens (0% of all PROPN
tokens) have a non-empty value of Gender
.
PROPN
tokens may have the following values of Gender
:
Fem
(2; 67% of non-emptyGender
): Córsega, LiñaMasc
(1; 33% of non-emptyGender
): KnottEMPTY
(1364): China, Trump, C., Mediterráneo, Europa, Francia, Hong, Italia, Kong, Albania
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (3290; 97%),
NOUN –[conj]–> NOUN (142; 57%),
NOUN –[amod]–> VERB (49; 92%),
PRON –[nmod]–> NOUN (30; 81%),
NOUN –[appos]–> NOUN (18; 62%),
VERB –[nsubj:pass]–> NOUN (17; 85%),
NOUN –[compound]–> NOUN (5; 71%),
DET –[fixed]–> NOUN (4; 100%),
NOUN –[conj]–> PRON (4; 100%),
NUM –[compound]–> NUM (3; 100%).