Treebank Statistics: UD_Portuguese-PetroGold: Features: Number
This feature is universal.
It occurs with 2 different values: Plur, Sing.
148337 tokens (59%) have a non-empty value of Number.
14142 types (93%) occur at least once with a non-empty value of Number.
9311 lemmas (89%) occur at least once with a non-empty value of Number.
The feature is used with 10 part-of-speech tags: NOUN (57548; 23% instances), DET (36327; 14% instances), ADJ (17069; 7% instances), VERB (16245; 6% instances), PROPN (11928; 5% instances), AUX (5434; 2% instances), PRON (3514; 1% instances), ADV (215; 0% instances), NUM (56; 0% instances), X (1; 0% instances).
NOUN
57548 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Number.
The most frequent other feature values with which NOUN and Number co-occurred: Gender=Masc (28798; 50%).
NOUN tokens may have the following values of Number:
Plur(16036; 28% of non-emptyNumber): fluidos, dados, resultados, valores, fácies, propriedades, emissões, custos, poços, característicasSing(41512; 72% of non-emptyNumber): óleo, água, figura, fluido, petróleo, gás, produção, área, argila, processoEMPTY(12): place, ,, cima, cm, d’água, e, hk, Å
| Paradigm óleo | Sing | Plur |
|---|---|---|
| _ | Óleo | |
| Gender=Masc | óleo | óleos |
| Gender=Fem | óleo |
DET
36327 DET tokens (100% of all DET tokens) have a non-empty value of Number.
The most frequent other feature values with which DET and Number co-occurred: PronType=Art (31749; 87%), Definite=Def (29007; 80%), Gender=Fem (18194; 50%).
DET tokens may have the following values of Number:
Plur(8215; 23% of non-emptyNumber): os, as, estes, estas, suas, esses, todos, tais, essas, outrosSing(28112; 77% of non-emptyNumber): a, o, um, uma, este, esta, sua, esse, cada, seu
| Paradigm o | Sing | Plur |
|---|---|---|
| Definite=Def|Gender=Masc|PronType=Art | o | os |
| Definite=Def|Gender=Fem|PronType=Art | a, á | as, A |
| Gender=Masc | o | |
| Gender=Masc|PronType=Art | os |
ADJ
17069 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Number.
The most frequent other feature values with which ADJ and Number co-occurred: Gender=Fem (8545; 50%).
ADJ tokens may have the following values of Number:
Plur(5977; 35% of non-emptyNumber): diferentes, principais, grandes, maiores, presentes, magnéticos, magnéticas, sedimentares, químicos, altasSing(11092; 65% of non-emptyNumber): maior, grande, menor, possível, magnético, total, natural, magnética, presente, necessárioEMPTY(11): subsea, primeira, próximo
| Paradigm maior | Sing | Plur |
|---|---|---|
| Gender=Masc | maior | maiores |
| Gender=Fem | maior | maiores |
VERB
16245 VERB tokens (80% of all VERB tokens) have a non-empty value of Number.
The most frequent other feature values with which VERB and Number co-occurred: Voice=EMPTY (12244; 75%), Tense=EMPTY (8939; 55%), Mood=EMPTY (8854; 55%), Person=EMPTY (8784; 54%), VerbForm=Part (8776; 54%).
VERB tokens may have the following values of Number:
Plur(5911; 36% of non-emptyNumber): podem, apresentam, utilizados, obtidos, apresentados, possuem, realizados, associados, ocorrem, preparadosSing(10334; 64% of non-emptyNumber): pode, devido, apresenta, utilizado, tem, deve, mostra, ocorre, possui, sejaEMPTY(4112): partir, utilizando, observar, seguir, aumentar, podendo, obter, formando, apresentar, contendo
| Paradigm poder | Sing | Plur |
|---|---|---|
| Mood=Cnd|Person=3|VerbForm=Fin | poderia | poderiam |
| Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | podemos | |
| Mood=Ind|Person=3|Tense=Fut|VerbForm=Fin | poderá | poderão |
| Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | podiam | |
| Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | pôde | puderam |
| Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | pode | podem |
| Mood=Sub|Person=3|Tense=Imp|VerbForm=Fin | pudesse | pudessem |
| Mood=Sub|Person=3|Tense=Pres|VerbForm=Fin | possa | possam |
| Person=1|VerbForm=Inf | podermos | |
| Person=3|VerbForm=Inf | poderem |
PROPN
11928 PROPN tokens (99% of all PROPN tokens) have a non-empty value of Number.
PROPN tokens may have the following values of Number:
Plur(219; 2% of non-emptyNumber): RCEs, GPM, estados, ARGILAS, Formações, MW, Barras, Camadas, Campos, CartasSing(11709; 98% of non-emptyNumber): et, al., CO2, Bacia, Cabo, Frio, Santos, Campos, &, grandeEMPTY(69): ., ,, -, /, Quiricó, +, E, ;, =, cm2/g=
| Paradigm Santos | Sing | Plur |
|---|---|---|
| _ | Santos | |
| Gender=Masc | Santos, SANTOS | Santos |
Number seems to be lexical feature of PROPN. 98% lemmas (3116) occur only with one value of Number.
AUX
5434 AUX tokens (83% of all AUX tokens) have a non-empty value of Number.
The most frequent other feature values with which AUX and Number co-occurred: Person=3 (5428; 100%), VerbForm=Fin (5342; 98%), Mood=Ind (5079; 93%), Tense=Pres (3483; 64%).
AUX tokens may have the following values of Number:
Plur(2032; 37% of non-emptyNumber): são, foram, estão, serão, serem, eram, sejam, têm, seriam, estejamSing(3402; 63% of non-emptyNumber): é, foi, está, será, era, seja, seria, tem, for, iráEMPTY(1140): ser, sendo, estar, sido, ter, tendo, estando, é, e, estado
| Paradigm ser | Sing | Plur |
|---|---|---|
| ExtPos=SCONJ|Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | é | |
| Gender=Masc|VerbForm=Part | sido | |
| Mood=Cnd|Person=3|VerbForm=Fin | seria | seriam |
| Mood=Ind|Person=3|Tense=Fut|VerbForm=Fin | será | serão |
| Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | era | eram |
| Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | foi | |
| Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | é, È, Ǻ | são |
| Mood=Ind|Person=3|VerbForm=Fin | foram | |
| Mood=Sub|Person=3|Tense=Fut|VerbForm=Fin | for, sera | forem |
| Mood=Sub|Person=3|Tense=Imp|VerbForm=Fin | fosse | fossem |
| Mood=Sub|Person=3|Tense=Pres|VerbForm=Fin | seja | sejam |
| Person=3|VerbForm=Inf | ser | serem |
| Tense=Pres|VerbForm=Fin | é | |
| VerbForm=Part | sido |
PRON
3514 PRON tokens (65% of all PRON tokens) have a non-empty value of Number.
The most frequent other feature values with which PRON and Number co-occurred: Gender=Masc (2289; 65%), PronType=Rel (1989; 57%).
PRON tokens may have the following values of Number:
Plur(1116; 32% of non-emptyNumber): que, eles, estes, elas, os, quais, outros, as, estas, aquelesSing(2398; 68% of non-emptyNumber): que, o, isso, a, isto, este, qual, um, uma, estaEMPTY(1886): se, um
| Paradigm que | Sing | Plur |
|---|---|---|
| Gender=Masc | que | que |
| Gender=Fem | que | que |
| que |
ADV
215 ADV tokens (3% of all ADV tokens) have a non-empty value of Number.
ADV tokens may have the following values of Number:
Plur(44; 20% of non-emptyNumber): ondeSing(171; 80% of non-emptyNumber): onde, Antes, SIM, melhorEMPTY(6224): mais, não, também, através, já, muito, assim, bem, ainda, além
| Paradigm onde | Sing | Plur |
|---|---|---|
| Gender=Masc | onde | onde |
| Gender=Fem | onde | onde |
NUM
56 NUM tokens (1% of all NUM tokens) have a non-empty value of Number.
The most frequent other feature values with which NUM and Number co-occurred: NumType=EMPTY (36; 64%).
NUM tokens may have the following values of Number:
Sing(56; 100% of non-emptyNumber): 1, 19, 2.3, 4, 8, II.7, III.2, ii, 36º, 43ºEMPTY(7233): dois, 1, 3, 2, 5, 10, duas, três, 4, 2005
Number seems to be lexical feature of NUM. 100% lemmas (54) occur only with one value of Number.
X
1 X tokens (0% of all X tokens) have a non-empty value of Number.
The most frequent other feature values with which X and Number co-occurred: Foreign=EMPTY (1; 100%).
X tokens may have the following values of Number:
Plur(1; 100% of non-emptyNumber): drill-inEMPTY(216): in, drill, n, flow, core, ., booster, pin, situ, stripe
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number:
NOUN –[det]–> DET (32939; 100%),
NOUN –[amod]–> ADJ (14486; 99%),
NOUN –[nmod]–> NOUN (13125; 64%),
VERB –[obl]–> NOUN (4219; 53%),
NOUN –[acl]–> VERB (4088; 93%),
PROPN –[flat:name]–> PROPN (3657; 96%),
VERB –[nsubj]–> NOUN (3652; 93%),
NOUN –[conj]–> NOUN (3363; 77%),
VERB –[aux:pass]–> AUX (2703; 80%),
VERB –[nsubj:pass]–> NOUN (2492; 90%).