home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-PUD: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

12281 tokens (52%) have a non-empty value of Gender. 4740 types (80%) occur at least once with a non-empty value of Gender. 3266 lemmas (86%) occur at least once with a non-empty value of Gender. The feature is used with 9 part-of-speech tags: NOUN (4598; 20% instances), DET (3537; 15% instances), ADJ (1550; 7% instances), PROPN (1393; 6% instances), PRON (550; 2% instances), VERB (357; 2% instances), NUM (274; 1% instances), ADP (11; 0% instances), AUX (11; 0% instances).

NOUN

4598 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (3258; 71%).

NOUN tokens may have the following values of Gender:

Gender seems to be lexical feature of NOUN. 98% lemmas (1653) occur only with one value of Gender.

DET

3537 DET tokens (100% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (2771; 78%), PronType=Art (2730; 77%).

DET tokens may have the following values of Gender:

Paradigm oMascFem
Definite=Def|Number=Sing|PronType=Artoa
Number=Singoa
Number=Sing|PronType=Artoa
Number=Plurosas, os
Number=Plur|PronType=Artosas

ADJ

1550 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1073; 69%).

ADJ tokens may have the following values of Gender:

PROPN

1393 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1357; 97%), Foreign=EMPTY (1229; 88%).

PROPN tokens may have the following values of Gender:

Paradigm TrumpMascFem
TrumpTrump

Gender seems to be lexical feature of PROPN. 97% lemmas (970) occur only with one value of Gender.

PRON

550 PRON tokens (59% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Person=3 (444; 81%), Number=Sing (422; 77%), Case=EMPTY (360; 65%), Number[psor]=EMPTY (316; 57%), PronType=EMPTY (305; 55%).

PRON tokens may have the following values of Gender:

VERB

357 VERB tokens (18% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (357; 100%), Person=EMPTY (357; 100%), Tense=EMPTY (357; 100%), Number=Sing (235; 66%).

VERB tokens may have the following values of Gender:

NUM

274 NUM tokens (58% of all NUM tokens) have a non-empty value of Gender.

NUM tokens may have the following values of Gender:

ADP

11 ADP tokens (0% of all ADP tokens) have a non-empty value of Gender.

ADP tokens may have the following values of Gender:

AUX

11 AUX tokens (1% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (11; 100%), Person=EMPTY (11; 100%), Tense=EMPTY (11; 100%), Number=Sing (8; 73%).

AUX tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (3044; 100%), NOUN –[amod]–> ADJ (1282; 100%), NOUN –[nmod]–> NOUN (629; 51%), PROPN –[det]–> DET (358; 99%), NOUN –[det]–> PRON (232; 100%), NOUN –[nmod]–> PROPN (190; 54%), NOUN –[conj]–> NOUN (159; 61%), PROPN –[flat]–> PROPN (158; 100%), PROPN –[flat:name]–> PROPN (152; 100%), NOUN –[appos]–> PROPN (140; 92%).