home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-DANTEStocks: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

21331 tokens (26%) have a non-empty value of Gender. 3423 types (32%) occur at least once with a non-empty value of Gender. 2164 lemmas (25%) occur at least once with a non-empty value of Gender. The feature is used with 9 part-of-speech tags: NOUN (11577; 14% instances), DET (6616; 8% instances), ADJ (2104; 3% instances), PRON (478; 1% instances), VERB (467; 1% instances), NUM (61; 0% instances), ADP (19; 0% instances), AUX (5; 0% instances), PROPN (4; 0% instances).

NOUN

11577 NOUN tokens (96% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (8555; 74%).

NOUN tokens may have the following values of Gender:

Paradigm açãoMascFem
Number=Singação
Number=Sing|Typo=Yesacao
Number=Pluraçõesações, açõ, açõe
Number=Plur|Typo=Yesacoes, açõe, açoes

Gender seems to be lexical feature of NOUN. 98% lemmas (1626) occur only with one value of Gender.

DET

6616 DET tokens (98% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (5898; 89%), Number=Sing (5677; 86%), Definite=Def (5516; 83%).

DET tokens may have the following values of Gender:

Paradigm oMascFem
Definite=Def|Number=Sing|PronType=Artoa
Definite=Def|Number=Sing|PronType=Art|Typo=Yese
Definite=Def|Number=Plur|PronType=Artosas
Number=Sing|PronType=Art|Typo=Yess
Number=Sing|PronType=Demoa
Number=Plur|PronType=Demosas

ADJ

2104 ADJ tokens (72% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1701; 81%).

ADJ tokens may have the following values of Gender:

Paradigm diárioMascFem
Number=Singdiáriodiária
Number=Sing|Typo=Yesdiári, diário
Number=Plurdiários

PRON

478 PRON tokens (37% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (420; 88%), Case=EMPTY (395; 83%), Person=EMPTY (287; 60%), PronType=Dem (242; 51%).

PRON tokens may have the following values of Gender:

Paradigm oMascFem
Case=Acc|Number=Sing|Person=3|PronType=Prsa
Definite=Def|Number=Sing|PronType=Artoa
Definite=Def|Number=Sing|PronType=Demo
Definite=Def|Number=Plur|PronType=Artosas
Number=Singa
Number=Sing|Person=3|PronType=Demoa
Number=Sing|Person=3|PronType=Dem|Typo=Yesmo
Number=Sing|PronType=Demoa
Number=Sing|PronType=Into
Number=Plur|Person=3|PronType=Demosas
Number=Plur|PronType=Demosas

VERB

467 VERB tokens (7% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (467; 100%), Person=EMPTY (467; 100%), Tense=EMPTY (467; 100%), VerbForm=Part (466; 100%), Number=Sing (369; 79%).

VERB tokens may have the following values of Gender:

Paradigm romperMascFem
rompidorompida

NUM

61 NUM tokens (1% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (60; 98%).

NUM tokens may have the following values of Gender:

Paradigm umMascFem
Number=Singuma
NumType=Cardumuma

ADP

19 ADP tokens (0% of all ADP tokens) have a non-empty value of Gender.

ADP tokens may have the following values of Gender:

AUX

5 AUX tokens (0% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Abbr=EMPTY (5; 100%), Mood=EMPTY (5; 100%), Number=Sing (5; 100%), Person=EMPTY (5; 100%), Tense=EMPTY (5; 100%), VerbForm=Part (5; 100%).

AUX tokens may have the following values of Gender:

PROPN

4 PROPN tokens (0% of all PROPN tokens) have a non-empty value of Gender.

PROPN tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (4209; 93%), NOUN –[amod]–> ADJ (1625; 70%), NOUN –[list]–> NOUN (260; 51%), VERB –[nsubj:pass]–> NOUN (64; 96%), ADJ –[nsubj]–> NOUN (47; 64%), ADJ –[conj]–> ADJ (25; 56%), PRON –[det]–> DET (14; 78%), PRON –[amod]–> ADJ (12; 57%), ADJ –[det]–> DET (11; 52%), DET –[fixed]–> NOUN (11; 85%).