Treebank Statistics: UD_Portuguese-DANTEStocks: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
21332 tokens (26%) have a non-empty value of Gender.
3425 types (32%) occur at least once with a non-empty value of Gender.
2166 lemmas (25%) occur at least once with a non-empty value of Gender.
The feature is used with 9 part-of-speech tags: NOUN (11577; 14% instances), DET (6615; 8% instances), ADJ (2104; 3% instances), PRON (478; 1% instances), VERB (467; 1% instances), NUM (63; 0% instances), ADP (19; 0% instances), AUX (5; 0% instances), PROPN (4; 0% instances).
NOUN
11577 NOUN tokens (96% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (8555; 74%).
NOUN tokens may have the following values of Gender:
Fem(5604; 48% of non-emptyGender): ações, indicação, venda, compra, MM21, alta, 16h, resistências, semana, quedaMasc(5973; 52% of non-emptyGender): gráfico, Rastreamento, dia, ativo, vol, suportes, preço, intraday, volume, fundoEMPTY(424): trimestre, capital, acionistas, analistas, presidente, recorde, cara, final, pessoal, acionista
| Paradigm ação | Masc | Fem |
|---|---|---|
| Number=Sing | ação | |
| Number=Sing|Typo=Yes | acao | |
| Number=Plur | ações | ações, açõ, açõe |
| Number=Plur|Typo=Yes | acoes, açõe, açoes |
Gender seems to be lexical feature of NOUN. 98% lemmas (1627) occur only with one value of Gender.
DET
6615 DET tokens (98% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (5897; 89%), Number=Sing (5676; 86%), Definite=Def (5515; 83%).
DET tokens may have the following values of Gender:
Fem(3799; 57% of non-emptyGender): a, as, uma, sua, essa, esta, suas, minha, alguma, nossaMasc(2816; 43% of non-emptyGender): o, os, um, esse, este, seu, meu, mesmo, outros, todoEMPTY(106): que, quais, mais, qual, cada, qualquer, tal, qq, demais, menos
| Paradigm o | Masc | Fem |
|---|---|---|
| Definite=Def|Number=Sing|PronType=Art | o | a |
| Definite=Def|Number=Sing|PronType=Art|Typo=Yes | e | |
| Definite=Def|Number=Plur|PronType=Art | os | as |
| Number=Sing|PronType=Art|Typo=Yes | s | |
| Number=Sing|PronType=Dem | o | a |
| Number=Plur|PronType=Dem | os | as |
ADJ
2104 ADJ tokens (72% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (1701; 81%).
ADJ tokens may have the following values of Gender:
Fem(960; 46% of non-emptyGender): nova, última, financeiras, históricas, boa, passada, técnica, boas, linda, exclusivaMasc(1144; 54% of non-emptyGender): diário, financeiro, bom, 1º, novo, últimos, cons, primeiro, líquido, próximoEMPTY(813): superior, maiores, semanal, maior, relevante, individual, gerais, forte, melhor, interessante
| Paradigm diário | Masc | Fem |
|---|---|---|
| Number=Sing | diário | diária |
| Number=Sing|Typo=Yes | diári, diário | |
| Number=Plur | diários |
PRON
478 PRON tokens (37% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (420; 88%), Case=EMPTY (395; 83%), Person=EMPTY (287; 60%), PronType=Dem (242; 51%).
PRON tokens may have the following values of Gender:
Fem(139; 29% of non-emptyGender): ela, a, essa, uma, as, outras, outra, todas, la, elasMasc(339; 71% of non-emptyGender): o, isso, nada, os, alguém, ele, algo, todos, esse, istoEMPTY(821): que, se, quem, eu, q, me, vc, tudo, qual, você
| Paradigm o | Masc | Fem |
|---|---|---|
| Case=Acc|Number=Sing|Person=3|PronType=Prs | a | |
| Definite=Def|Number=Sing|PronType=Art | o | a |
| Definite=Def|Number=Sing|PronType=Dem | o | |
| Definite=Def|Number=Plur|PronType=Art | os | as |
| ExtPos=PRON|Number=Sing|Person=3|PronType=Dem | o | a |
| ExtPos=PRON|Number=Sing|Person=3|PronType=Dem|Typo=Yes | mo | |
| ExtPos=PRON|Number=Sing|PronType=Dem | o | |
| ExtPos=PRON|Number=Sing|PronType=Int | o | |
| Number=Sing|Person=3|PronType=Dem | o | a |
| Number=Sing|PronType=Art | a | |
| Number=Sing|PronType=Dem | o | a |
| Number=Plur|Person=3|PronType=Dem | os | as |
| Number=Plur|PronType=Dem | os | as |
VERB
467 VERB tokens (7% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (467; 100%), Person=EMPTY (467; 100%), Tense=EMPTY (467; 100%), VerbForm=Part (466; 100%), Number=Sing (369; 79%).
VERB tokens may have the following values of Gender:
Fem(172; 37% of non-emptyGender): ajustadas, administrada, capitalizada, coberta, divulgada, instalada, realizada, alugadas, controladas, feitaMasc(295; 63% of non-emptyGender): indicado, comprado, postado, vendido, feito, cancelado, negociado, ajustado, esperado, exercidoEMPTY(6119): romper, analise, confira, resultou, tem, comprar, ver, sobe, fechou, pode
| Paradigm romper | Masc | Fem |
|---|---|---|
| rompido | rompida |
NUM
63 NUM tokens (1% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (62; 98%).
NUM tokens may have the following values of Gender:
Fem(29; 46% of non-emptyGender): uma, duas, 2Masc(34; 54% of non-emptyGender): um, doisEMPTY(4960): 13, 5, 3, 10, 15, 2, 1, 4, 31/12/2013, 6
| Paradigm um | Masc | Fem |
|---|---|---|
| Number=Sing | uma | |
| NumType=Card | um | uma |
ADP
19 ADP tokens (0% of all ADP tokens) have a non-empty value of Gender.
ADP tokens may have the following values of Gender:
Fem(19; 100% of non-emptyGender): a, asEMPTY(8741): de, em, a, com, para, por, c/, pra, sobre, até
AUX
5 AUX tokens (0% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Abbr=EMPTY (5; 100%), Mood=EMPTY (5; 100%), Number=Sing (5; 100%), Person=EMPTY (5; 100%), Tense=EMPTY (5; 100%), VerbForm=Part (5; 100%).
AUX tokens may have the following values of Gender:
Masc(5; 100% of non-emptyGender): sidoEMPTY(1327): é, vai, está, tá, foi, foram, será, ser, estão, ta
PROPN
4 PROPN tokens (0% of all PROPN tokens) have a non-empty value of Gender.
PROPN tokens may have the following values of Gender:
Fem(4; 100% of non-emptyGender): #PETR4, ITUB4, República, siderurgiaEMPTY(11758): petr4, #petr4, vale5, petrobras, #vale5, vale, @live_trade, petr3, bbas3, oibr4
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (4209; 93%),
NOUN –[amod]–> ADJ (1625; 70%),
NOUN –[list]–> NOUN (260; 51%),
VERB –[nsubj:pass]–> NOUN (64; 96%),
ADJ –[nsubj]–> NOUN (47; 64%),
ADJ –[conj]–> ADJ (25; 56%),
PRON –[det]–> DET (14; 78%),
PRON –[amod]–> ADJ (12; 57%),
ADJ –[det]–> DET (11; 55%),
DET –[fixed]–> NOUN (11; 85%).