Treebank Statistics: UD_Portuguese-PUD: Features: Number
This feature is universal.
It occurs with 2 different values: Plur, Sing.
This is a layered feature with the following layers: Number, Number[psor].
13912 tokens (59%) have a non-empty value of Number.
5371 types (91%) occur at least once with a non-empty value of Number.
3632 lemmas (96%) occur at least once with a non-empty value of Number.
The feature is used with 9 part-of-speech tags: NOUN (4600; 20% instances), DET (3540; 15% instances), ADJ (1550; 7% instances), VERB (1473; 6% instances), PROPN (1393; 6% instances), AUX (681; 3% instances), PRON (640; 3% instances), NUM (24; 0% instances), ADP (11; 0% instances).
NOUN
4600 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Number.
The most frequent other feature values with which NOUN and Number co-occurred: Gender=Masc (2528; 55%).
NOUN tokens may have the following values of Number:
Plur(1340; 29% of non-emptyNumber): anos, pessoas, vezes, estados, meses, ações, dados, partes, terras, diasSing(3260; 71% of non-emptyNumber): vez, guerra, ano, parte, governo, cidade, estado, mundo, acordo, século
DET
3540 DET tokens (100% of all DET tokens) have a non-empty value of Number.
The most frequent other feature values with which DET and Number co-occurred: PronType=Art (3220; 91%), Definite=EMPTY (3040; 86%), Gender=Masc (1879; 53%).
DET tokens may have the following values of Number:
Plur(769; 22% of non-emptyNumber): os, as, muitos, várias, outras, muitas, outros, vários, alguns, estesSing(2771; 78% of non-emptyNumber): o, a, um, uma, esta, este, cada, isso, outro, mesmoEMPTY(1): uma
| Paradigm o | Sing | Plur |
|---|---|---|
| Case=Acc|Definite=Def|Person=1 | os | |
| Case=Dat|Definite=Def|Person=1 | os | |
| Definite=Def|Gender=Masc | o | os |
| Definite=Def|Gender=Fem | a | as, os |
| Gender=Masc | o | os |
| Gender=Fem | a | as |
ADJ
1550 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Number.
The most frequent other feature values with which ADJ and Number co-occurred: Gender=Masc (843; 54%).
ADJ tokens may have the following values of Number:
Plur(477; 31% of non-emptyNumber): novos, grandes, últimos, mais, Unidos, agrícolas, indígenas, políticos, Olímpicos, americanosSing(1073; 69% of non-emptyNumber): grande, primeira, maior, nova, mais, primeiro, nacional, novo, melhor, segunda
VERB
1473 VERB tokens (72% of all VERB tokens) have a non-empty value of Number.
The most frequent other feature values with which VERB and Number co-occurred: Gender=EMPTY (1116; 76%), Person=3 (1063; 72%), Mood=Ind (1042; 71%).
VERB tokens may have the following values of Number:
Plur(423; 29% of non-emptyNumber): têm, estão, incluem, começaram, dizem, tinham, conquistaram, decidiram, fornecem, ocorreramSing(1050; 71% of non-emptyNumber): disse, há, tem, começou, diz, é, está, fez, tornou, devidoEMPTY(559): fazer, ter, partir, incluindo, manter, ajudar, criar, levar, deixar, encontrar
PROPN
1393 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Number.
The most frequent other feature values with which PROPN and Number co-occurred: Foreign=EMPTY (1229; 88%), Gender=Masc (826; 59%).
PROPN tokens may have the following values of Number:
Plur(36; 3% of non-emptyNumber): EUA, Alpes, Andes, Balcãs, Kitai, Américas, Antillas, Caribs, Estados, FilipinasSing(1357; 97% of non-emptyNumber): China, Trump, Mediterrâneo, América, the, Austrália, Europa, França, Grécia, Hong
Number seems to be lexical feature of PROPN. 100% lemmas (995) occur only with one value of Number.
AUX
681 AUX tokens (84% of all AUX tokens) have a non-empty value of Number.
The most frequent other feature values with which AUX and Number co-occurred: Person=3 (644; 95%), Mood=Ind (602; 88%).
AUX tokens may have the following values of Number:
Plur(215; 32% of non-emptyNumber): foram, são, estão, podem, tinham, eram, estavam, têm, poderiam, serãoSing(466; 68% of non-emptyNumber): é, foi, está, pode, tinha, estava, era, tem, seria, poderiaEMPTY(126): ser, sido, ter, estar, sendo, tendo, tornar, tornado, tornando, começado
PRON
640 PRON tokens (69% of all PRON tokens) have a non-empty value of Number.
The most frequent other feature values with which PRON and Number co-occurred: Person=3 (461; 72%), Number[psor]=EMPTY (406; 63%), PronType=EMPTY (394; 62%), Case=EMPTY (389; 61%), Gender=Masc (367; 57%).
PRON tokens may have the following values of Number:
Plur(156; 24% of non-emptyNumber): eles, suas, seus, quais, nós, estes, aqueles, elas, os, vocêsSing(484; 76% of non-emptyNumber): ele, sua, seu, ela, o, eu, qual, isso, isto, loEMPTY(294): que, se, quem, si
NUM
24 NUM tokens (5% of all NUM tokens) have a non-empty value of Number.
The most frequent other feature values with which NUM and Number co-occurred: Gender=EMPTY (21; 88%).
NUM tokens may have the following values of Number:
Plur(18; 75% of non-emptyNumber): milhões, bilhões, bnSing(6; 25% of non-emptyNumber): bilhão, bn, milhão, um, CincoEMPTY(445): dois, um, três, duas, quatro, uma, 10, 3, seis, 1
ADP
11 ADP tokens (0% of all ADP tokens) have a non-empty value of Number.
ADP tokens may have the following values of Number:
Plur(2; 18% of non-emptyNumber): Aqueles, nestesSing(9; 82% of non-emptyNumber): a, nessa, nesse, consigo, daquelaEMPTY(3806): de, em, a, para, por, com, como, que, durante, entre
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number:
NOUN –[det]–> DET (3040; 100%),
NOUN –[amod]–> ADJ (1277; 100%),
NOUN –[nmod]–> NOUN (728; 60%),
VERB –[nsubj]–> NOUN (431; 83%),
PROPN –[det]–> DET (355; 99%),
NOUN –[nmod]–> PROPN (249; 71%),
NOUN –[det]–> PRON (232; 100%),
NOUN –[conj]–> NOUN (201; 77%),
VERB –[aux:pass]–> AUX (183; 80%),
VERB –[nsubj]–> PRON (169; 53%).