home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-Bosque: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

109130 tokens (48%) have a non-empty value of Gender. 18851 types (73%) occur at least once with a non-empty value of Gender. 14461 lemmas (80%) occur at least once with a non-empty value of Gender. The feature is used with 13 part-of-speech tags: NOUN (41220; 18% instances), DET (34574; 15% instances), PROPN (11532; 5% instances), ADJ (11338; 5% instances), PRON (6713; 3% instances), VERB (3537; 2% instances), NUM (166; 0% instances), X (17; 0% instances), ADV (14; 0% instances), AUX (9; 0% instances), SCONJ (6; 0% instances), ADP (3; 0% instances), PART (1; 0% instances).

NOUN

41220 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (29542; 72%).

NOUN tokens may have the following values of Gender:

Paradigm diaMascFem
Number=Singdiadia
Number=Plurdias

Gender seems to be lexical feature of NOUN. 98% lemmas (6592) occur only with one value of Gender.

DET

34574 DET tokens (99% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (30779; 89%), Definite=Def (27460; 79%), Number=Sing (27254; 79%).

DET tokens may have the following values of Gender:

Paradigm oMascFem
Definite=Def|ExtPos=PROPN|Number=Plur|PronType=ArtAs
Definite=Def|Number=Sing|PronType=Arto, Os, a, o(s)a
Definite=Def|Number=Sing|PronType=Art|Typo=Yesoso
Definite=Def|Number=Plur|PronType=Artos, oas
Definite=Def|Number=Plur|PronType=Art|Typo=Yesoa, As
Definite=Ind|Number=Sing|PronType=Arto
ExtPos=PROPN|Number=Sing|PronType=ArtO
Number=Sing|PronType=Arto, Aa
Number=Sing|PronType=Demoa
Number=Plur|PronType=Artosas
Number=Plur|PronType=Demosas

PROPN

11532 PROPN tokens (61% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (11116; 96%), ExtPos=EMPTY (7643; 66%).

PROPN tokens may have the following values of Gender:

Paradigm SãoMascFem
Abbr=Yes|ExtPos=PROPN|Number=SingS.
Abbr=Yes|Number=SingS.
ExtPos=PROPNSÃO
ExtPos=PROPN|Number=SingSão, SÃOSão
Number=SingSão

Gender seems to be lexical feature of PROPN. 95% lemmas (4385) occur only with one value of Gender.

ADJ

11338 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (8184; 72%).

ADJ tokens may have the following values of Gender:

Paradigm novoMascFem
Number=Singnovonova
Number=Plurnovosnovas

PRON

6713 PRON tokens (90% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (4951; 74%), Case=EMPTY (4722; 70%), Person=EMPTY (4570; 68%).

PRON tokens may have the following values of Gender:

Paradigm queMascFem
Case=Acc|Number=Sing|Person=3|PronType=IntQue
Definite=Def|Number=Sing|PronType=Artque
Number=Sing|PronType=Demque
Number=Sing|PronType=Indqueque
Number=Sing|PronType=Intqueque
Number=Sing|PronType=Relqueque
Number=Sing|PronType=Rel|Typo=Yesqu
Number=Plur|PronType=Indque
Number=Plur|PronType=Intqueque
Number=Plur|PronType=Relqueque

VERB

3537 VERB tokens (17% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (3536; 100%), Tense=EMPTY (3536; 100%), Mood=EMPTY (3535; 100%), VerbForm=Part (3534; 100%), Number=Sing (2329; 66%).

VERB tokens may have the following values of Gender:

Paradigm terMascFem
Number=Singtido
Number=Sing|Voice=Passtidotida
Number=Plurtidas

NUM

166 NUM tokens (4% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Mult (131; 79%).

NUM tokens may have the following values of Gender:

Gender seems to be lexical feature of NUM. 100% lemmas (21) occur only with one value of Gender.

X

17 X tokens (10% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Number=Sing (16; 94%).

X tokens may have the following values of Gender:

Gender seems to be lexical feature of X. 100% lemmas (16) occur only with one value of Gender.

ADV

14 ADV tokens (0% of all ADV tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADV and Gender co-occurred: Polarity=EMPTY (13; 93%).

ADV tokens may have the following values of Gender:

Paradigm quantoMascFem
PronType=Indquanto
PronType=Intquanto
PronType=Relquanto

AUX

9 AUX tokens (0% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (9; 100%), Number=Sing (9; 100%), Person=EMPTY (9; 100%), Tense=EMPTY (9; 100%), VerbForm=Part (9; 100%).

AUX tokens may have the following values of Gender:

SCONJ

6 SCONJ tokens (0% of all SCONJ tokens) have a non-empty value of Gender.

SCONJ tokens may have the following values of Gender:

Paradigm queMascFem
que
PronType=Relqueque

ADP

3 ADP tokens (0% of all ADP tokens) have a non-empty value of Gender.

ADP tokens may have the following values of Gender:

PART

1 PART tokens (33% of all PART tokens) have a non-empty value of Gender.

The most frequent other feature values with which PART and Gender co-occurred: ExtPos=EMPTY (1; 100%), Number=Sing (1; 100%).

PART tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (28280; 100%), NOUN –[amod]–> ADJ (8998; 100%), PROPN –[det]–> DET (4454; 81%), NOUN –[acl]–> VERB (1597; 67%), NOUN –[conj]–> NOUN (1383; 60%), NOUN –[appos]–> PROPN (1216; 90%), PROPN –[conj]–> PROPN (811; 75%), VERB –[nsubj:pass]–> NOUN (572; 79%), ADJ –[nsubj]–> NOUN (435; 97%), ADJ –[conj]–> ADJ (385; 98%).