Treebank Statistics: UD_Portuguese-Porttinari: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
64570 tokens (38%) have a non-empty value of Gender
.
9713 types (51%) occur at least once with a non-empty value of Gender
.
6649 lemmas (52%) occur at least once with a non-empty value of Gender
.
The feature is used with 7 part-of-speech tags: NOUN (29447; 18% instances), DET (24018; 14% instances), ADJ (5441; 3% instances), PRON (2766; 2% instances), VERB (2267; 1% instances), NUM (569; 0% instances), AUX (62; 0% instances).
NOUN
29447 NOUN tokens (94% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (21248; 72%).
NOUN
tokens may have the following values of Gender
:
Fem
(14026; 48% of non-emptyGender
): pessoas, vez, parte, empresa, casa, cidade, história, empresas, gente, formaMasc
(15421; 52% of non-emptyGender
): anos, ano, dia, país, tempo, governo, mercado, caso, mundo, acordoEMPTY
(1955): presidente, polícia, segurança, capital, final, clientes, ex-presidente, local, cara, modelo
Paradigm filho | Masc | Fem |
---|---|---|
Number=Sing | filho | filha |
Number=Plur | filhos | filhas |
Gender
seems to be lexical feature of NOUN
. 97% lemmas (4562) occur only with one value of Gender
.
DET
24018 DET tokens (99% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: PronType=Art (21173; 88%), Number=Sing (19605; 82%), Definite=Def (18892; 79%).
DET
tokens may have the following values of Gender
:
Fem
(11103; 46% of non-emptyGender
): a, as, uma, sua, essa, esta, suas, essas, minha, outrasMasc
(12915; 54% of non-emptyGender
): o, os, um, seu, esse, este, seus, outros, mesmo, todosEMPTY
(300): cada, mais, qualquer, que, menos, tal, demais, quais, qual, tais
Paradigm o | Masc | Fem |
---|---|---|
Number=Sing | o | a |
Number=Plur | os | as |
ADJ
5441 ADJ tokens (64% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: VerbForm=EMPTY (4518; 83%), Number=Sing (3812; 70%).
ADJ
tokens may have the following values of Gender
:
Fem
(2474; 45% of non-emptyGender
): primeira, nova, brasileira, segunda, muitas, última, política, boa, novas, públicaMasc
(2967; 55% of non-emptyGender
): novo, primeiro, últimos, segundo, muitos, bom, preciso, passado, último, brasileiroEMPTY
(3115): maior, grande, melhor, possível, importante, sociais, difícil, grandes, principal, atual
Paradigm novo | Masc | Fem |
---|---|---|
Number=Sing | novo | nova |
Number=Plur | novos | novas |
PRON
2766 PRON tokens (43% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Number=Sing (2181; 79%), Person=3 (1821; 66%), Case=EMPTY (1649; 60%).
PRON
tokens may have the following values of Gender
:
Fem
(635; 23% of non-emptyGender
): ela, a, elas, as, essa, la, esta, algumas, outra, outrasMasc
(2131; 77% of non-emptyGender
): o, ele, isso, eles, os, nada, algo, lo, outro, umEMPTY
(3649): que, se, eu, quem, me, tudo, você, nos, nós, ninguém
Paradigm ele | Masc | Fem |
---|---|---|
Number=Sing | ele | ela |
Number=Plur | eles | elas |
VERB
2267 VERB tokens (13% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (2267; 100%), Person=EMPTY (2267; 100%), Tense=EMPTY (2267; 100%), VerbForm=Part (2267; 100%), Voice=Pass (1728; 76%), Number=Sing (1606; 71%).
VERB
tokens may have the following values of Gender
:
Fem
(783; 35% of non-emptyGender
): feita, feitas, realizada, procurada, chamada, criada, seguida, usadas, considerada, dadaMasc
(1484; 65% of non-emptyGender
): feito, devido, usado, visto, apresentado, chamado, conhecido, preso, recebido, apontadoEMPTY
(14965): diz, tem, há, disse, pode, fazer, ter, afirma, deve, teve
Paradigm ter | Masc | Fem |
---|---|---|
Number=Sing | tido | |
Number=Plur|Voice=Pass | tidas |
NUM
569 NUM tokens (18% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumType=Card (559; 98%).
NUM
tokens may have the following values of Gender
:
Fem
(232; 41% of non-emptyGender
): uma, duas, meiaMasc
(337; 59% of non-emptyGender
): um, dois, meioEMPTY
(2649): três, mil, 20, quatro, 30, 2016, 2018, 12, 15, cinco
Paradigm um | Masc | Fem |
---|---|---|
um | uma |
AUX
62 AUX tokens (1% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Mood=EMPTY (62; 100%), Number=Sing (62; 100%), Person=EMPTY (62; 100%), Tense=EMPTY (62; 100%), VerbForm=Part (62; 100%).
AUX
tokens may have the following values of Gender
:
Masc
(62; 100% of non-emptyGender
): sidoEMPTY
(4744): é, foi, ser, está, são, era, foram, será, estão, estava
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (18905; 93%),
NOUN –[amod]–> ADJ (3748; 62%),
NOUN –[conj]–> NOUN (774; 50%),
VERB –[nsubj:pass]–> NOUN (471; 92%),
ADJ –[nsubj]–> NOUN (225; 54%),
PRON –[amod]–> ADJ (142; 55%),
NUM –[nmod]–> NOUN (122; 56%),
PRON –[nmod]–> NOUN (99; 57%),
PRON –[det]–> DET (75; 63%),
PRON –[nsubj]–> NOUN (49; 67%).