home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-GSD: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

54658 tokens (17%) have a non-empty value of Gender. 6927 types (22%) occur at least once with a non-empty value of Gender. 5820 lemmas (41%) occur at least once with a non-empty value of Gender. The feature is used with 11 part-of-speech tags: DET (38829; 12% instances), NOUN (8320; 3% instances), PROPN (3093; 1% instances), ADJ (2066; 1% instances), PRON (1486; 0% instances), VERB (839; 0% instances), ADV (10; 0% instances), X (6; 0% instances), NUM (5; 0% instances), ADP (2; 0% instances), AUX (2; 0% instances).

DET

38829 DET tokens (82% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (38013; 98%), Definite=Def (37476; 97%), Number=Sing (33527; 86%).

DET tokens may have the following values of Gender:

Paradigm oMascFem
Definite=Def|Number=Sing|PronType=Arto, aa
Definite=Def|Number=Plur|PronType=Artosas
Number=Sing|PronType=Artoa
Number=Sing|PronType=Demo
Number=Plur|PronType=Artosas

NOUN

8320 NOUN tokens (15% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (6281; 75%).

NOUN tokens may have the following values of Gender:

Paradigm presidenteMascFem
presidentepresidente

Gender seems to be lexical feature of NOUN. 98% lemmas (2653) occur only with one value of Gender.

PROPN

3093 PROPN tokens (10% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (3018; 98%).

PROPN tokens may have the following values of Gender:

Paradigm TheMascFem
TheThe

Gender seems to be lexical feature of PROPN. 97% lemmas (1962) occur only with one value of Gender.

ADJ

2066 ADJ tokens (14% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1589; 77%).

ADJ tokens may have the following values of Gender:

Paradigm primeiroMascFem
Number=Singprimeiroprimeira
Number=Plurprimeirosprimeiras

PRON

1486 PRON tokens (19% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (1163; 78%).

PRON tokens may have the following values of Gender:

Paradigm queMascFem
Number=Sing|PronType=Demque
Number=Sing|PronType=Indque
Number=Sing|PronType=Intque
Number=Sing|PronType=Relqueque
Number=Plur|PronType=Relqueque

VERB

839 VERB tokens (3% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: VerbForm=Part (838; 100%), Number=Sing (586; 70%).

VERB tokens may have the following values of Gender:

Paradigm fazerMascFem
Number=Sing|Voice=Passfeitofeita
Number=Plurfeitos
Number=Plur|Voice=Passfeitosfeitas

ADV

10 ADV tokens (0% of all ADV tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADV and Gender co-occurred: Polarity=EMPTY (10; 100%).

ADV tokens may have the following values of Gender:

X

6 X tokens (1% of all X tokens) have a non-empty value of Gender.

X tokens may have the following values of Gender:

NUM

5 NUM tokens (0% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=EMPTY (3; 60%).

NUM tokens may have the following values of Gender:

ADP

2 ADP tokens (0% of all ADP tokens) have a non-empty value of Gender.

ADP tokens may have the following values of Gender:

AUX

2 AUX tokens (0% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (2; 100%), Number=Sing (2; 100%), Person=EMPTY (2; 100%), Tense=EMPTY (2; 100%), VerbForm=Part (2; 100%).

AUX tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (1671; 99%), NOUN –[appos]–> PROPN (398; 90%), NOUN –[acl]–> VERB (355; 71%), PROPN –[conj]–> PROPN (324; 70%), NOUN –[conj]–> NOUN (293; 67%), VERB –[nsubj:pass]–> NOUN (152; 83%), PROPN –[appos]–> PROPN (150; 75%), NOUN –[appos]–> NOUN (91; 61%), NOUN –[nmod]–> PRON (65; 61%), ADJ –[obl]–> NOUN (63; 55%).