Treebank Statistics: UD_English-GUM: Features: Number
This feature is universal.
It occurs with 3 different values: Plur, Ptan, Sing.
88630 tokens (38%) have a non-empty value of Number.
14141 types (77%) occur at least once with a non-empty value of Number.
10925 lemmas (76%) occur at least once with a non-empty value of Number.
The feature is used with 8 part-of-speech tags: NOUN (38902; 17% instances), PRON (17278; 7% instances), PROPN (13284; 6% instances), VERB (9058; 4% instances), AUX (8575; 4% instances), DET (1496; 1% instances), SYM (33; 0% instances), NUM (4; 0% instances).
NOUN
38902 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Number.
NOUN tokens may have the following values of Number:
Plur(10649; 27% of non-emptyNumber): people, years, things, days, guys, minutes, months, others, studies, childrenPtan(116; 0% of non-emptyNumber): clothes, thanks, pants, means, glasses, 1960s, politics, jeans, surroundings, 1970sSing(28137; 72% of non-emptyNumber): time, day, way, year, world, life, today, city, work, lotEMPTY(2): m, per
| Paradigm person | Sing | Plur |
|---|---|---|
| _ | person | people, persons |
| Typo=Yes | people |
PRON
17278 PRON tokens (86% of all PRON tokens) have a non-empty value of Number.
The most frequent other feature values with which PRON and Number co-occurred: PronType=Prs (15367; 89%), Poss=EMPTY (14154; 82%), Gender=EMPTY (12095; 70%), Case=Nom (9670; 56%).
PRON tokens may have the following values of Number:
Plur(3961; 23% of non-emptyNumber): we, they, our, their, them, us, you, those, these, ‘sSing(13317; 77% of non-emptyNumber): i, it, you, he, that, his, your, my, this, sheEMPTY(2703): that, what, there, which, who, whatever, whose, whom, one, whoever
| Paradigm you | Sing | Plur |
|---|---|---|
| Case=Acc | you | you |
| Case=Acc|Style=Coll|Typo=Yes | ya | |
| Case=Acc|Typo=Yes | y' | you |
| Case=Nom | you | you |
| Case=Nom|Typo=Yes | you, Ya |
PROPN
13284 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Number.
PROPN tokens may have the following values of Number:
Plur(669; 5% of non-emptyNumber): States, Americans, Nations, skittles, Chathams, Pirates, Mets, Sox, Democrats, OlmecPtan(32; 0% of non-emptyNumber): Netherlands, Olympics, Paralympics, Philippines, Vans, Analytics, Forties, MaldivesSing(12583; 95% of non-emptyNumber): University, President, York, America, New, south, Warhol, State, figure, northEMPTY(1): #langu
| Paradigm State | Sing | Plur |
|---|---|---|
| State, States | States |
Number seems to be lexical feature of PROPN. 98% lemmas (4408) occur only with one value of Number.
VERB
9058 VERB tokens (37% of all VERB tokens) have a non-empty value of Number.
The most frequent other feature values with which VERB and Number co-occurred: VerbForm=Fin (9058; 100%), Voice=EMPTY (9058; 100%), Mood=Ind (9049; 100%), Person=3 (6075; 67%), Tense=Pres (5256; 58%).
VERB tokens may have the following values of Number:
Plur(2541; 28% of non-emptyNumber): have, are, had, know, need, got, want, make, did, doSing(6517; 72% of non-emptyNumber): know, said, think, has, have, had, is, ‘s, want, meanEMPTY(15681): have, do, make, get, see, go, know, going, take, united
| Paradigm have | Sing | Plur |
|---|---|---|
| Person=1|Tense=Past | had | had |
| Person=1|Tense=Pres | have, 've | have |
| Person=2|Tense=Past | had | had |
| Person=2|Tense=Pres | have | have |
| Person=3|Tense=Past | had | had |
| Person=3|Tense=Pres|Typo=Yes | has | |
| Person=3|Tense=Pres | has | have |
AUX
8575 AUX tokens (67% of all AUX tokens) have a non-empty value of Number.
The most frequent other feature values with which AUX and Number co-occurred: VerbForm=Fin (8574; 100%), Mood=Ind (8563; 100%), Person=3 (6683; 78%), Tense=Pres (6391; 75%).
AUX tokens may have the following values of Number:
Plur(2029; 24% of non-emptyNumber): are, were, have, ‘re, do, did, had, ‘ve, ’re, wasSing(6546; 76% of non-emptyNumber): is, was, ‘s, has, do, ‘m, did, ’s, had, doesEMPTY(4259): be, can, will, would, been, could, should, may, have, ‘ll
| Paradigm be | Sing | Plur |
|---|---|---|
| _ | Be | |
| Mood=Ind|Person=1|Tense=Past|VerbForm=Fin | was, were | were, was |
| Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | 'm, am, ’m | are, 're, ’re |
| Mood=Ind|Person=2|Tense=Past|VerbForm=Fin | were, was | were |
| Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin | 're, are, ’re | 're, are |
| Mood=Ind|Person=3|Tense=Past|Typo=Yes|VerbForm=Fin | was, where | were |
| Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | was | were, was |
| Mood=Ind|Person=3|Tense=Pres|Typo=Yes|VerbForm=Fin | is | are |
| Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | is, 's, ’s, S’, s | are, 're, ’re, R, am |
| Mood=Sub|Person=3|Tense=Past|VerbForm=Fin | were | |
| Mood=Sub|Person=3|Tense=Pres|VerbForm=Fin | be | be |
DET
1496 DET tokens (8% of all DET tokens) have a non-empty value of Number.
The most frequent other feature values with which DET and Number co-occurred: Definite=EMPTY (1496; 100%), PronType=Dem (1495; 100%).
DET tokens may have the following values of Number:
Plur(370; 25% of non-emptyNumber): these, thoseSing(1126; 75% of non-emptyNumber): this, that, halfEMPTY(17581): the, a, an, all, some, no, any, every, another, each
| Paradigm this | Sing | Plur |
|---|---|---|
| this | these | |
| Typo=Yes | this |
SYM
33 SYM tokens (10% of all SYM tokens) have a non-empty value of Number.
The most frequent other feature values with which SYM and Number co-occurred: ExtPos=EMPTY (33; 100%).
SYM tokens may have the following values of Number:
Sing(33; 100% of non-emptyNumber): %EMPTY(300): /, –, $, -, +, =, DKK, €, #, £
NUM
4 NUM tokens (0% of all NUM tokens) have a non-empty value of Number.
The most frequent other feature values with which NUM and Number co-occurred: NumForm=Word (4; 100%), NumType=Frac (4; 100%).
NUM tokens may have the following values of Number:
Sing(4; 100% of non-emptyNumber): halfEMPTY(4270): one, two, 1, three, 2, 3, four, five, 4, 10
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number:
VERB –[nsubj]–> PRON (4356; 54%),
NOUN –[nmod]–> NOUN (3487; 62%),
NOUN –[compound]–> NOUN (2469; 67%),
NOUN –[conj]–> NOUN (2203; 80%),
VERB –[nsubj]–> NOUN (2093; 67%),
NOUN –[nmod:poss]–> PRON (1941; 65%),
PROPN –[flat]–> PROPN (1604; 100%),
NOUN –[cop]–> AUX (1403; 76%),
PROPN –[compound]–> PROPN (1399; 90%),
VERB –[nsubj]–> PROPN (872; 72%).