home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-Bosque: Features: Gender

This feature is universal but the values Unsp are language-specific. It occurs with 4 different values: Fem, Masc, Neut, Unsp.

109213 tokens (48%) have a non-empty value of Gender. 18999 types (74%) occur at least once with a non-empty value of Gender. 14692 lemmas (81%) occur at least once with a non-empty value of Gender. The feature is used with 13 part-of-speech tags: NOUN (41249; 18% instances), DET (33625; 15% instances), PROPN (11876; 5% instances), ADJ (11314; 5% instances), PRON (7394; 3% instances), VERB (3550; 2% instances), NUM (158; 0% instances), X (18; 0% instances), ADV (11; 0% instances), AUX (9; 0% instances), SCONJ (5; 0% instances), ADP (3; 0% instances), PART (1; 0% instances).

NOUN

41249 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (29483; 71%).

NOUN tokens may have the following values of Gender:

Paradigm presidenteMascFemUnsp
Number=SingpresidentepresidentePresidente
Number=Plurpresidentes

Gender seems to be lexical feature of NOUN. 97% lemmas (6604) occur only with one value of Gender.

DET

33625 DET tokens (96% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (29811; 89%), Number=Sing (26516; 79%), Definite=Def (26514; 79%).

DET tokens may have the following values of Gender:

Paradigm qualquerMascFemUnsp
Number=Singqualquerqualquerqualquer
Number=Plurquaisquerquaisquer

PROPN

11876 PROPN tokens (63% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (11456; 96%).

PROPN tokens may have the following values of Gender:

Paradigm SãoMascFemUnsp
_SÃO
Number=SingSão, S., SÃOSãoSão

Gender seems to be lexical feature of PROPN. 94% lemmas (4475) occur only with one value of Gender.

ADJ

11314 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (8163; 72%).

ADJ tokens may have the following values of Gender:

Paradigm grandeMascFemUnsp
Number=Singmaior, grande, máximomaior, grande, máxima
Number=Plurgrandes, maiores, máximosgrandes, maioresgrandes

PRON

7394 PRON tokens (99% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (5202; 70%), Person=EMPTY (4984; 67%), Case=EMPTY (4856; 66%).

PRON tokens may have the following values of Gender:

Paradigm queMascFemUnsp
Case=Acc|Number=Sing|Person=3|PronType=IntQue
Definite=Def|Number=Sing|PronType=Artque
Number=Sing|PronType=Demque
Number=Sing|PronType=Indqueque
Number=Sing|PronType=Intquequeque
Number=Sing|PronType=Relqueque, quque
Number=Plur|PronType=Indque
Number=Plur|PronType=Intqueque
Number=Plur|PronType=Relquequeque
Number=Unsp|PronType=Indque
Number=Unsp|PronType=Relque

VERB

3550 VERB tokens (17% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Tense=EMPTY (3550; 100%), Mood=EMPTY (3549; 100%), Person=EMPTY (3549; 100%), VerbForm=Part (3539; 100%), Number=Sing (2336; 66%).

VERB tokens may have the following values of Gender:

Paradigm terMascFem
Number=Singtido
Number=Sing|Voice=Passtidotida
Number=Plurtidas

NUM

158 NUM tokens (3% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Mult (131; 83%).

NUM tokens may have the following values of Gender:

Gender seems to be lexical feature of NUM. 100% lemmas (15) occur only with one value of Gender.

X

18 X tokens (11% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Number=Sing (17; 94%).

X tokens may have the following values of Gender:

Gender seems to be lexical feature of X. 100% lemmas (17) occur only with one value of Gender.

ADV

11 ADV tokens (0% of all ADV tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADV and Gender co-occurred: Polarity=EMPTY (10; 91%).

ADV tokens may have the following values of Gender:

Paradigm quantoMascFem
PronType=Indquanto
PronType=Intquanto
PronType=Relquanto

AUX

9 AUX tokens (0% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (9; 100%), Number=Sing (9; 100%), Person=EMPTY (9; 100%), Tense=EMPTY (9; 100%), VerbForm=Part (9; 100%).

AUX tokens may have the following values of Gender:

SCONJ

5 SCONJ tokens (0% of all SCONJ tokens) have a non-empty value of Gender.

SCONJ tokens may have the following values of Gender:

Paradigm queMascFem
que
PronType=Relque

ADP

3 ADP tokens (0% of all ADP tokens) have a non-empty value of Gender.

ADP tokens may have the following values of Gender:

PART

1 PART tokens (33% of all PART tokens) have a non-empty value of Gender.

The most frequent other feature values with which PART and Gender co-occurred: Number=Sing (1; 100%).

PART tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (27129; 96%), NOUN –[amod]–> ADJ (8846; 99%), PROPN –[det]–> DET (4470; 81%), NOUN –[acl]–> VERB (1603; 67%), NOUN –[conj]–> NOUN (1369; 61%), NOUN –[appos]–> PROPN (1216; 89%), PROPN –[conj]–> PROPN (828; 75%), VERB –[nsubj:pass]–> NOUN (560; 79%), ADJ –[nsubj]–> NOUN (431; 96%), ADJ –[conj]–> ADJ (378; 96%).