Treebank Statistics: UD_Portuguese-GSD: Features: Number
This feature is universal.
It occurs with 2 different values: Plur, Sing.
60313 tokens (19%) have a non-empty value of Number.
9505 types (31%) occur at least once with a non-empty value of Number.
7576 lemmas (53%) occur at least once with a non-empty value of Number.
The feature is used with 12 part-of-speech tags: DET (38837; 12% instances), NOUN (8351; 3% instances), PROPN (5452; 2% instances), VERB (3154; 1% instances), ADJ (2077; 1% instances), PRON (1562; 0% instances), AUX (847; 0% instances), ADV (10; 0% instances), NUM (8; 0% instances), ADP (7; 0% instances), X (6; 0% instances), CCONJ (2; 0% instances).
DET
38837 DET tokens (82% of all DET tokens) have a non-empty value of Number.
The most frequent other feature values with which DET and Number co-occurred: PronType=Art (38013; 98%), Definite=Def (37476; 96%), Gender=Masc (21023; 54%).
DET tokens may have the following values of Number:
Plur(5302; 14% of non-emptyNumber): as, os, seus, outros, suas, alguns, todos, todas, outras, váriosSing(33535; 86% of non-emptyNumber): o, a, um, uma, sua, seu, esta, este, essa, esseEMPTY(8765): os, um, uma, sua, seu, o, seus, cada, a, suas
| Paradigm o | Sing | Plur |
|---|---|---|
| Definite=Def|Gender=Masc|PronType=Art | o, a | os |
| Definite=Def|Gender=Fem|PronType=Art | a | as |
| Gender=Masc|PronType=Art | o | os |
| Gender=Masc|PronType=Dem | o | |
| Gender=Fem|PronType=Art | a | as |
NOUN
8351 NOUN tokens (15% of all NOUN tokens) have a non-empty value of Number.
The most frequent other feature values with which NOUN and Number co-occurred: Gender=Masc (4644; 56%).
NOUN tokens may have the following values of Number:
Plur(2046; 25% of non-emptyNumber): anos, km, dias, pessoas, empresas, países, vezes, meses, pontos, minutosSing(6305; 75% of non-emptyNumber): feira, dia, ano, estado, presidente, acordo, país, governo, tempo, áreaEMPTY(48239): anos, ano, dia, r, pessoas, presidente, cidade, acordo, governo, parte
| Paradigm ano | Sing | Plur |
|---|---|---|
| ano | anos |
PROPN
5452 PROPN tokens (17% of all PROPN tokens) have a non-empty value of Number.
PROPN tokens may have the following values of Number:
Plur(75; 1% of non-emptyNumber): Estados, EUA, Jogos, Beatles, Set, APPs, Abid, Aflitos, Agentes, AncestraisSing(5377; 99% of non-emptyNumber): the, Paulo, Brasil, São, Federal, of, &, Sul, Rio, SantosEMPTY(26827): feira, Brasil, São, Paulo, rio, Nacional, Estado, janeiro, Federal, quinta
| Paradigm R | Sing | Plur |
|---|---|---|
| R | R |
Number seems to be lexical feature of PROPN. 100% lemmas (3386) occur only with one value of Number.
VERB
3154 VERB tokens (11% of all VERB tokens) have a non-empty value of Number.
The most frequent other feature values with which VERB and Number co-occurred: VerbForm=Fin (2258; 72%).
VERB tokens may have the following values of Number:
Plur(729; 23% of non-emptyNumber): cobertos, podem, começaram, passam, têm, voltaram, considerados, continuam, devem, ficaramSing(2425; 77% of non-emptyNumber): disse, tem, acabou, chegou, começou, tornou, passou, afirmou, pode, voltouEMPTY(24819): é, tem, disse, está, há, fazer, foi, estão, afirmou, partir
| Paradigm ter | Sing | Plur |
|---|---|---|
| ExtPos=AUX|Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | Temos | |
| ExtPos=AUX|Mood=Ind|Person=3|Tense=Fut|VerbForm=Fin | terá | |
| ExtPos=AUX|Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | teve | |
| ExtPos=AUX|Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | tem | |
| Gender=Masc|VerbForm=Part | tido | |
| Mood=Cnd|Person=3|VerbForm=Fin | teria | |
| Mood=Ind|Person=1|Tense=Fut|VerbForm=Fin | teremos | |
| Mood=Ind|Person=1|Tense=Imp|VerbForm=Fin | tínhamos | |
| Mood=Ind|Person=1|Tense=Past|VerbForm=Fin | tive | tivemos |
| Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | tenho | Temos |
| Mood=Ind|Person=3|Tense=Fut|VerbForm=Fin | terá | |
| Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | tinha | tinham |
| Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | teve | tiveram |
| Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | tem | têm |
| Mood=Ind|Person=3|VerbForm=Fin | tiveram | |
| Mood=Sub|Person=1|Tense=Fut|VerbForm=Fin | tivermos | |
| Mood=Sub|Person=3|Tense=Pres|VerbForm=Fin | tenha | tenham |
ADJ
2077 ADJ tokens (14% of all ADJ tokens) have a non-empty value of Number.
The most frequent other feature values with which ADJ and Number co-occurred: Gender=Masc (1155; 56%).
ADJ tokens may have the following values of Number:
Plur(477; 23% of non-emptyNumber): últimos, novos, principais, novas, primeiros, diferentes, maiores, pequenos, tropicais, anterioresSing(1600; 77% of non-emptyNumber): maior, grande, primeira, ex, primeiro, novo, segunda, última, último, melhorEMPTY(12962): maior, grande, primeiro, primeira, novo, segundo, última, segunda, mesmo, nova
| Paradigm primeiro | Sing | Plur |
|---|---|---|
| Gender=Masc | primeiro | primeiros |
| Gender=Fem | primeira | primeiras |
PRON
1562 PRON tokens (20% of all PRON tokens) have a non-empty value of Number.
The most frequent other feature values with which PRON and Number co-occurred: Gender=Masc (1103; 71%).
PRON tokens may have the following values of Number:
Plur(334; 21% of non-emptyNumber): que, se, eles, os, elas, quais, nós, outros, todos, losSing(1228; 79% of non-emptyNumber): que, se, o, ele, ela, isso, onde, a, quem, euEMPTY(6158): que, se, ele, isso, o, eu, um, ela, eles, quem
| Paradigm que | Sing | Plur |
|---|---|---|
| Gender=Masc|PronType=Dem | que | |
| Gender=Masc|PronType=Ind | que | |
| Gender=Masc|PronType=Int | que | |
| Gender=Masc|PronType=Rel | que | que |
| Gender=Fem|PronType=Rel | que | que |
AUX
847 AUX tokens (12% of all AUX tokens) have a non-empty value of Number.
The most frequent other feature values with which AUX and Number co-occurred: VerbForm=Fin (821; 97%), Person=3 (818; 97%), Mood=Ind (751; 89%).
AUX tokens may have the following values of Number:
Plur(207; 24% of non-emptyNumber): foram, são, estão, serão, estavam, serem, eram, têm, vamos, irãoSing(640; 76% of non-emptyNumber): é, foi, está, era, será, vai, estava, havia, tem, terEMPTY(6089): é, foi, ser, foram, será, são, vai, pode, sendo, deve
| Paradigm ser | Sing | Plur |
|---|---|---|
| _ | ser | |
| Gender=Masc|VerbForm=Part | sido | |
| Mood=Cnd|Person=3|VerbForm=Fin | seria | seriam |
| Mood=Ind|Person=1|Tense=Past|VerbForm=Fin | fui | fomos |
| Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | Sou | |
| Mood=Ind|Person=3|Tense=Fut|VerbForm=Fin | será | serão |
| Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | era | eram |
| Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | foi | foram |
| Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | é | são |
| Mood=Ind|Person=3|VerbForm=Fin | foram | |
| Mood=Sub|Person=3|Tense=Fut|VerbForm=Fin | for | forem |
| Mood=Sub|Person=3|Tense=Imp|VerbForm=Fin | fosse | fossem |
| Mood=Sub|Person=3|Tense=Pres|VerbForm=Fin | seja | sejam |
| Person=3|VerbForm=Inf | ser | serem |
ADV
10 ADV tokens (0% of all ADV tokens) have a non-empty value of Number.
The most frequent other feature values with which ADV and Number co-occurred: Polarity=EMPTY (10; 100%).
ADV tokens may have the following values of Number:
Plur(2; 20% of non-emptyNumber): juntosSing(8; 80% of non-emptyNumber): Mal, Nada, caro, devagarinho, entanto, independente, pouco, quantoEMPTY(9760): não, mais, também, já, ainda, muito, depois, onde, além, apenas
NUM
8 NUM tokens (0% of all NUM tokens) have a non-empty value of Number.
The most frequent other feature values with which NUM and Number co-occurred: NumType=EMPTY (6; 75%).
NUM tokens may have the following values of Number:
Plur(1; 13% of non-emptyNumber): centenasSing(7; 88% of non-emptyNumber): 2012, 3, 470, cento, cem, sessentaEMPTY(8504): dois, três, mil, duas, milhões, um, 1, quatro, 2012, 2
ADP
7 ADP tokens (0% of all ADP tokens) have a non-empty value of Number.
ADP tokens may have the following values of Number:
Sing(7; 100% of non-emptyNumber): in, Contra, Pra, at, queEMPTY(51219): de, em, a, para, por, com, como, entre, sobre, até
X
6 X tokens (1% of all X tokens) have a non-empty value of Number.
X tokens may have the following values of Number:
Sing(6; 100% of non-emptyNumber): \epsilon=\epsilon_{0}, \kappa, center, market, on, spinEMPTY(397): disso, deles, delas, dele, do, +, etc, @, comigo, nele
CCONJ
2 CCONJ tokens (0% of all CCONJ tokens) have a non-empty value of Number.
CCONJ tokens may have the following values of Number:
Sing(2; 100% of non-emptyNumber): &, EEMPTY(10424): e, que, mas, ou, se, quando, como, porque, enquanto, pois
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number:
PROPN –[flat:name]–> PROPN (1877; 98%),
NOUN –[amod]–> ADJ (1662; 98%),
NOUN –[nmod]–> NOUN (1481; 66%),
VERB –[obl]–> NOUN (824; 53%),
VERB –[nsubj]–> NOUN (744; 91%),
NOUN –[nmod]–> PROPN (567; 77%),
VERB –[nsubj]–> PRON (484; 92%),
PROPN –[conj]–> PROPN (460; 98%),
PROPN –[nmod]–> PROPN (443; 98%),
NOUN –[appos]–> PROPN (392; 88%).