home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Galician-TreeGal: Features: Gender

This feature is universal. It occurs with 4 different values: Com, Fem, Masc, Neut.

12590 tokens (49%) have a non-empty value of Gender. 3509 types (65%) occur at least once with a non-empty value of Gender. 2662 lemmas (68%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: NOUN (4835; 19% instances), DET (4110; 16% instances), ADJ (1695; 7% instances), PRON (1359; 5% instances), NUM (261; 1% instances), VERB (237; 1% instances), PROPN (93; 0% instances).

NOUN

4835 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (3513; 73%).

NOUN tokens may have the following values of Gender:

Paradigm alcaldeMascFemCom
alcaldealcaldesaalcalde

Gender seems to be lexical feature of NOUN. 98% lemmas (1662) occur only with one value of Gender.

DET

4110 DET tokens (100% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (3451; 84%), Number=Sing (3181; 77%), Definite=Def (2989; 73%).

DET tokens may have the following values of Gender:

Paradigm oMascFem
Definite=Def|Number=Singo, lo, osa, la
Definite=Def|Number=Pluros, losas, las
Number=Singa
Number=Plur|Person=3os

ADJ

1695 ADJ tokens (98% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1260; 74%).

ADJ tokens may have the following values of Gender:

Paradigm nacionalistaMascFemCom
Number=Singnacionalistanacionalista
Number=Plurnacionalistasnacionalistas

PRON

1359 PRON tokens (99% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Case=EMPTY (1104; 81%), Clitic=EMPTY (888; 65%), Number=Sing (774; 57%), Person=EMPTY (724; 53%).

PRON tokens may have the following values of Gender:

Paradigm queMascFemCom
Number=Sing|PronType=Intque
Number=Sing|PronType=Relquequeque
Number=Plur|PronType=Relquequeque
PronType=Intque
PronType=Relque

NUM

261 NUM tokens (98% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (223; 85%), Number=Sing (148; 57%).

NUM tokens may have the following values of Gender:

Paradigm primeiroMascFem
Number=Singprimeiroprimeira
Number=Plurprimeirosprimeiras

VERB

237 VERB tokens (10% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (237; 100%), Person=EMPTY (237; 100%), Tense=EMPTY (237; 100%), VerbForm=Part (237; 100%), Number=Sing (172; 73%).

VERB tokens may have the following values of Gender:

Paradigm facerMascFem
Number=Singfeitofeita
Number=Plurfeitas

PROPN

93 PROPN tokens (8% of all PROPN tokens) have a non-empty value of Gender.

PROPN tokens may have the following values of Gender:

Gender seems to be lexical feature of PROPN. 100% lemmas (25) occur only with one value of Gender.

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (3648; 99%), NOUN –[amod]–> ADJ (1300; 98%), NOUN –[conj]–> NOUN (229; 57%), NOUN –[nummod]–> NUM (151; 96%), ADJ –[conj]–> ADJ (114; 97%), PRON –[det]–> DET (98; 99%), ADJ –[det]–> DET (64; 100%), ADJ –[nsubj]–> NOUN (43; 93%), NUM –[det]–> DET (38; 95%), NOUN –[nmod]–> PRON (35; 61%).