home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Galician-TreeGal: Features: Gender

This feature is universal. It occurs with 4 different values: Com, Fem, Masc, Neut.

13078 tokens (51%) have a non-empty value of Gender. 3791 types (70%) occur at least once with a non-empty value of Gender. 2996 lemmas (77%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: NOUN (4495; 18% instances), DET (4109; 16% instances), ADJ (1692; 7% instances), PRON (1359; 5% instances), PROPN (932; 4% instances), NUM (254; 1% instances), VERB (237; 1% instances).

NOUN

4495 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (3222; 72%).

NOUN tokens may have the following values of Gender:

Paradigm alcaldeMascFemCom
alcaldealcaldesaalcalde

Gender seems to be lexical feature of NOUN. 98% lemmas (1603) occur only with one value of Gender.

DET

4109 DET tokens (100% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (3450; 84%), Number=Sing (3181; 77%), Definite=Def (2988; 73%).

DET tokens may have the following values of Gender:

Paradigm oMascFem
Definite=Def|Number=Singo, lo, osa, la
Definite=Def|Number=Pluros, losas, las
Number=Singa
Number=Plur|Person=3os

ADJ

1692 ADJ tokens (98% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1256; 74%).

ADJ tokens may have the following values of Gender:

Paradigm nacionalistaMascFemCom
Number=Singnacionalistanacionalista
Number=Plurnacionalistasnacionalistas

PRON

1359 PRON tokens (99% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Case=EMPTY (1104; 81%), Clitic=EMPTY (888; 65%), Number=Sing (774; 57%), Person=EMPTY (724; 53%).

PRON tokens may have the following values of Gender:

Paradigm queMascFemCom
Number=Sing|PronType=Intque
Number=Sing|PronType=Relquequeque
Number=Plur|PronType=Relquequeque
PronType=Intque
PronType=Relque

PROPN

932 PROPN tokens (59% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=EMPTY (474; 51%).

PROPN tokens may have the following values of Gender:

Paradigm galizaMascFem
GalizaGaliza

Gender seems to be lexical feature of PROPN. 97% lemmas (466) occur only with one value of Gender.

NUM

254 NUM tokens (98% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (217; 85%), Number=Sing (146; 57%).

NUM tokens may have the following values of Gender:

Paradigm primeiroMascFem
Number=Singprimeiroprimeira
Number=Plurprimeirosprimeiras

VERB

237 VERB tokens (10% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (237; 100%), Person=EMPTY (237; 100%), Tense=EMPTY (237; 100%), VerbForm=Part (237; 100%), Number=Sing (172; 73%).

VERB tokens may have the following values of Gender:

Paradigm facerMascFem
Number=Singfeitofeita
Number=Plurfeitas

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (3380; 100%), NOUN –[amod]–> ADJ (1197; 99%), PROPN –[det]–> DET (376; 80%), PROPN –[flat:name]–> PROPN (274; 99%), NOUN –[conj]–> NOUN (211; 57%), NOUN –[nummod]–> NUM (143; 99%), ADJ –[conj]–> ADJ (114; 97%), PRON –[det]–> DET (98; 99%), PROPN –[amod]–> ADJ (93; 82%), ADJ –[det]–> DET (63; 100%).