home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-PetroGold: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

131773 tokens (53%) have a non-empty value of Gender. 11833 types (78%) occur at least once with a non-empty value of Gender. 8286 lemmas (79%) occur at least once with a non-empty value of Gender. The feature is used with 10 part-of-speech tags: NOUN (57531; 23% instances), DET (36327; 14% instances), ADJ (17069; 7% instances), VERB (8783; 4% instances), PROPN (8285; 3% instances), PRON (3508; 1% instances), ADV (214; 0% instances), NUM (51; 0% instances), AUX (4; 0% instances), X (1; 0% instances).

NOUN

57531 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (41495; 72%).

NOUN tokens may have the following values of Gender:

Paradigm óleoMascFem
Number=Singóleoóleo
Number=Pluróleos

Gender seems to be lexical feature of NOUN. 97% lemmas (3590) occur only with one value of Gender.

DET

36327 DET tokens (100% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (31749; 87%), Definite=Def (29007; 80%), Number=Sing (28112; 77%).

DET tokens may have the following values of Gender:

Paradigm oMascFem
Definite=Def|Number=Sing|PronType=Artoa, á
Definite=Def|Number=Plur|PronType=Artosas, A
Number=Singo
Number=Plur|PronType=Artos

ADJ

17069 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (11092; 65%).

ADJ tokens may have the following values of Gender:

Paradigm maiorMascFem
Number=Singmaiormaior
Number=Plurmaioresmaiores

VERB

8783 VERB tokens (43% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (8782; 100%), Person=EMPTY (8782; 100%), Tense=EMPTY (8782; 100%), VerbForm=Part (8775; 100%), Number=Sing (5169; 59%), Voice=EMPTY (4792; 55%).

VERB tokens may have the following values of Gender:

Paradigm utilizarMascFem
Number=Sing|VerbForm=Gerutilizado
Number=Sing|VerbForm=Partutilizadoutilizada, utilizado
Number=Sing|VerbForm=Part|Voice=Passutilizadoutilizada
Number=Plur|VerbForm=Partutilizadosutilizadas
Number=Plur|VerbForm=Part|Voice=Passutilizadosutilizadas

PROPN

8285 PROPN tokens (69% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (8066; 97%).

PROPN tokens may have the following values of Gender:

Paradigm NE-SWMascFem
Number=SingNE-SWNE-SW
Number=PlurNE-SW

PRON

3508 PRON tokens (65% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (2396; 68%), PronType=Rel (1987; 57%).

PRON tokens may have the following values of Gender:

Paradigm queMascFem
Number=Singqueque
Number=Plurqueque

ADV

214 ADV tokens (3% of all ADV tokens) have a non-empty value of Gender.

ADV tokens may have the following values of Gender:

Paradigm ondeMascFem
Number=Singondeonde
Number=Plurondeonde

NUM

51 NUM tokens (1% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=EMPTY (31; 61%).

NUM tokens may have the following values of Gender:

Gender seems to be lexical feature of NUM. 100% lemmas (50) occur only with one value of Gender.

AUX

4 AUX tokens (0% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (4; 100%), Number=Sing (4; 100%), Person=EMPTY (4; 100%), Tense=EMPTY (4; 100%), VerbForm=Part (4; 100%).

AUX tokens may have the following values of Gender:

X

1 X tokens (0% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Foreign=EMPTY (1; 100%).

X tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (32956; 100%), NOUN –[amod]–> ADJ (14549; 100%), NOUN –[acl]–> VERB (4069; 93%), NOUN –[conj]–> NOUN (2654; 61%), VERB –[nsubj:pass]–> NOUN (2123; 77%), PROPN –[det]–> DET (2098; 99%), NOUN –[nmod]–> PROPN (1914; 61%), ADJ –[obl]–> NOUN (713; 54%), ADJ –[nsubj]–> NOUN (663; 91%), PROPN –[conj]–> PROPN (661; 71%).