home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-Bosque: Features: Gender

This feature is universal but the values Unsp are language-specific. It occurs with 4 different values: Fem, Masc, Neut, Unsp.

109209 tokens (48%) have a non-empty value of Gender. 18998 types (74%) occur at least once with a non-empty value of Gender. 14695 lemmas (81%) occur at least once with a non-empty value of Gender. The feature is used with 13 part-of-speech tags: NOUN (41249; 18% instances), DET (33621; 15% instances), PROPN (11878; 5% instances), ADJ (11312; 5% instances), PRON (7393; 3% instances), VERB (3552; 2% instances), NUM (158; 0% instances), X (18; 0% instances), ADV (12; 0% instances), AUX (9; 0% instances), SCONJ (4; 0% instances), ADP (2; 0% instances), PART (1; 0% instances).

NOUN

41249 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (29483; 71%).

NOUN tokens may have the following values of Gender:

Paradigm presidenteMascFemUnsp
Number=SingpresidentepresidentePresidente
Number=Plurpresidentes

Gender seems to be lexical feature of NOUN. 97% lemmas (6603) occur only with one value of Gender.

DET

33621 DET tokens (96% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (29810; 89%), Definite=Def (26514; 79%), Number=Sing (26513; 79%).

DET tokens may have the following values of Gender:

Paradigm qualquerMascFemUnsp
Number=Singqualquerqualquerqualquer
Number=Plurquaisquerquaisquer

PROPN

11878 PROPN tokens (63% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (11457; 96%).

PROPN tokens may have the following values of Gender:

Paradigm SãoMascFemUnsp
_SÃO
Number=SingSão, S., SÃOSãoSão

Gender seems to be lexical feature of PROPN. 94% lemmas (4480) occur only with one value of Gender.

ADJ

11312 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (8161; 72%).

ADJ tokens may have the following values of Gender:

Paradigm grandeMascFemUnsp
Number=Singmaior, grande, máximomaior, grande, máxima
Number=Plurgrandes, maiores, máximosgrandes, maioresgrandes

PRON

7393 PRON tokens (99% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (5202; 70%), Person=EMPTY (4985; 67%), Case=EMPTY (4857; 66%).

PRON tokens may have the following values of Gender:

Paradigm queMascFemUnsp
Definite=Def|Number=Sing|PronType=Artque
Number=Sing|PronType=Demque
Number=Sing|PronType=Indqueque
Number=Sing|PronType=Intquequeque
Number=Sing|PronType=Relqueque, quque
Number=Plur|PronType=Indque
Number=Plur|PronType=Intqueque
Number=Plur|PronType=Relquequeque
Number=Unsp|PronType=Indque
Number=Unsp|PronType=Relque

VERB

3552 VERB tokens (17% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Tense=EMPTY (3552; 100%), Mood=EMPTY (3551; 100%), Person=EMPTY (3551; 100%), VerbForm=Part (3541; 100%), Number=Sing (2337; 66%).

VERB tokens may have the following values of Gender:

Paradigm terMascFem
Number=Singtido
Number=Sing|Voice=Passtidotida
Number=Plurtidas

NUM

158 NUM tokens (3% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Mult (131; 83%).

NUM tokens may have the following values of Gender:

Gender seems to be lexical feature of NUM. 100% lemmas (15) occur only with one value of Gender.

X

18 X tokens (11% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Number=Sing (17; 94%).

X tokens may have the following values of Gender:

Gender seems to be lexical feature of X. 100% lemmas (17) occur only with one value of Gender.

ADV

12 ADV tokens (0% of all ADV tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADV and Gender co-occurred: Polarity=EMPTY (11; 92%).

ADV tokens may have the following values of Gender:

Paradigm quantoMascFem
PronType=Indquanto
PronType=Intquanto
PronType=Relquanto

AUX

9 AUX tokens (0% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (9; 100%), Number=Sing (9; 100%), Person=EMPTY (9; 100%), Tense=EMPTY (9; 100%), VerbForm=Part (9; 100%).

AUX tokens may have the following values of Gender:

SCONJ

4 SCONJ tokens (0% of all SCONJ tokens) have a non-empty value of Gender.

SCONJ tokens may have the following values of Gender:

ADP

2 ADP tokens (0% of all ADP tokens) have a non-empty value of Gender.

ADP tokens may have the following values of Gender:

PART

1 PART tokens (33% of all PART tokens) have a non-empty value of Gender.

The most frequent other feature values with which PART and Gender co-occurred: Number=Sing (1; 100%).

PART tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (27120; 96%), NOUN –[amod]–> ADJ (8843; 99%), PROPN –[det]–> DET (4469; 81%), NOUN –[acl]–> VERB (1604; 67%), NOUN –[conj]–> NOUN (1369; 61%), NOUN –[appos]–> PROPN (1215; 89%), PROPN –[conj]–> PROPN (828; 76%), VERB –[nsubj:pass]–> NOUN (559; 79%), ADJ –[nsubj]–> NOUN (430; 96%), PROPN –[appos]–> NOUN (379; 80%).