Treebank Statistics: UD_English-GUM: Features: Number
This feature is universal.
It occurs with 3 different values: Plur
, Ptan
, Sing
.
80329 tokens (38%) have a non-empty value of Number
.
13421 types (77%) occur at least once with a non-empty value of Number
.
10413 lemmas (76%) occur at least once with a non-empty value of Number
.
The feature is used with 11 part-of-speech tags: NOUN (35511; 17% instances), PRON (15472; 7% instances), PROPN (12188; 6% instances), VERB (8193; 4% instances), AUX (7602; 4% instances), DET (1315; 1% instances), SYM (26; 0% instances), ADV (13; 0% instances), NUM (7; 0% instances), ADJ (1; 0% instances), PUNCT (1; 0% instances).
NOUN
35511 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Number
.
NOUN
tokens may have the following values of Number
:
Plur
(9822; 28% of non-emptyNumber
): people, years, things, days, guys, data, minutes, others, studies, childrenPtan
(94; 0% of non-emptyNumber
): clothes, species, thanks, pants, glasses, means, newspapers, politics, jeans, surroundingsSing
(25595; 72% of non-emptyNumber
): time, day, way, life, world, year, city, today, work, exampleEMPTY
(1): per
Paradigm person | Sing | Plur |
---|---|---|
person | people, persons |
PRON
15472 PRON tokens (87% of all PRON
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PRON
and Number
co-occurred: PronType=Prs (13799; 89%), Poss=EMPTY (12564; 81%), Gender=EMPTY (10792; 70%), Case=Nom (8589; 56%).
PRON
tokens may have the following values of Number
:
Plur
(3483; 23% of non-emptyNumber
): we, they, our, their, them, us, you, those, these, ‘sSing
(11989; 77% of non-emptyNumber
): i, it, you, he, that, his, your, my, this, sheEMPTY
(2346): that, what, which, there, who, whatever, whose, whom, one, whoever
Paradigm you | Sing | Plur |
---|---|---|
Case=Acc | you | you |
Case=Acc|Style=Coll|Typo=Yes | ya | |
Case=Acc|Typo=Yes | y' | |
Case=Nom | you | you |
Case=Nom|Typo=Yes | you, Ya |
PROPN
12188 PROPN tokens (100% of all PROPN
tokens) have a non-empty value of Number
.
PROPN
tokens may have the following values of Number
:
Plur
(607; 5% of non-emptyNumber
): States, Americans, Nations, skittles, Chathams, Mets, Sox, Democrats, Olmec, MuslimsPtan
(34; 0% of non-emptyNumber
): Netherlands, Olympics, Commons, Paralympics, Philippines, Vans, Analytics, Andes, Forties, MaldivesSing
(11547; 95% of non-emptyNumber
): University, President, York, New, America, Warhol, north, figure, south, ScientologyEMPTY
(1): #langu
Paradigm State | Sing | Plur |
---|---|---|
State, States | States |
Number
seems to be lexical feature of PROPN
. 98% lemmas (4119) occur only with one value of Number
.
VERB
8193 VERB tokens (37% of all VERB
tokens) have a non-empty value of Number
.
The most frequent other feature values with which VERB
and Number
co-occurred: Voice=EMPTY (8193; 100%), VerbForm=Fin (8178; 100%), Mood=Ind (8172; 100%), Person=3 (5525; 67%), Tense=Pres (4655; 57%).
VERB
tokens may have the following values of Number
:
Plur
(2225; 27% of non-emptyNumber
): have, are, had, know, need, want, make, do, go, gotSing
(5968; 73% of non-emptyNumber
): know, said, think, has, have, had, is, ‘s, mean, wantEMPTY
(14085): have, do, make, get, see, go, know, take, united, going
Paradigm have | Sing | Plur |
---|---|---|
Person=1|Tense=Past | had | had |
Person=1|Tense=Pres | have, 've | have |
Person=2|Tense=Past | had | had |
Person=2|Tense=Pres | have | have |
Person=3|Tense=Past | had | had |
Person=3|Tense=Pres|Typo=Yes | has | |
Person=3|Tense=Pres | has | have |
AUX
7602 AUX tokens (67% of all AUX
tokens) have a non-empty value of Number
.
The most frequent other feature values with which AUX
and Number
co-occurred: VerbForm=Fin (7598; 100%), Mood=Ind (7588; 100%), Person=3 (5922; 78%), Tense=Pres (5610; 74%).
AUX
tokens may have the following values of Number
:
Plur
(1798; 24% of non-emptyNumber
): are, were, have, ‘re, do, did, had, ‘ve, ’re, wasSing
(5804; 76% of non-emptyNumber
): is, was, ‘s, has, do, ‘m, did, ’s, had, doesEMPTY
(3753): be, can, will, would, been, should, could, may, ‘ll, being
Paradigm be | Sing | Plur |
---|---|---|
_ | R, Be | |
Mood=Ind|Person=1|Tense=Past|VerbForm=Fin | was | were, was |
Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | 'm, am, ’m | are, 're, ’re |
Mood=Ind|Person=2|Tense=Past|VerbForm=Fin | were, was | were |
Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin | 're, are, ’re | 're, are |
Mood=Ind|Person=3|Tense=Past|Typo=Yes|VerbForm=Fin | was, where | were |
Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | was | were, was |
Mood=Ind|Person=3|Tense=Pres|Typo=Yes|VerbForm=Fin | is, s | are |
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | is, 's, ’s, S’ | are, 're, ’re, am |
Mood=Sub|Person=3|Tense=Past|VerbForm=Fin | were | |
Mood=Sub|Person=3|Tense=Pres|VerbForm=Fin | be | be |
DET
1315 DET tokens (8% of all DET
tokens) have a non-empty value of Number
.
The most frequent other feature values with which DET
and Number
co-occurred: Definite=EMPTY (1315; 100%), PronType=Dem (1314; 100%).
DET
tokens may have the following values of Number
:
Plur
(324; 25% of non-emptyNumber
): these, thoseSing
(991; 75% of non-emptyNumber
): this, that, halfEMPTY
(16017): the, a, an, all, some, no, any, every, another, each
Paradigm this | Sing | Plur |
---|---|---|
this | these |
SYM
26 SYM tokens (8% of all SYM
tokens) have a non-empty value of Number
.
SYM
tokens may have the following values of Number
:
Sing
(26; 100% of non-emptyNumber
): %EMPTY
(290): /, –, $, -, +, =, DKK, €, £, §
ADV
13 ADV tokens (0% of all ADV
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADV
and Number
co-occurred: Degree=EMPTY (13; 100%), PronType=EMPTY (13; 100%).
ADV
tokens may have the following values of Number
:
Sing
(13; 100% of non-emptyNumber
): Always, Now, Alike, Little, Loud, Out, Too, Truly, northwest, southEMPTY
(10097): so, when, just, then, also, how, now, more, here, really
Number
seems to be lexical feature of ADV
. 100% lemmas (10) occur only with one value of Number
.
NUM
7 NUM tokens (0% of all NUM
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NUM
and Number
co-occurred: NumForm=Word (7; 100%), NumType=Frac (4; 57%).
NUM
tokens may have the following values of Number
:
Sing
(7; 100% of non-emptyNumber
): half, Seven, ThreeEMPTY
(3985): one, two, 1, three, 2, 3, four, 4, five, 10
ADJ
1 ADJ tokens (0% of all ADJ
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADJ
and Number
co-occurred: Degree=EMPTY (1; 100%).
ADJ
tokens may have the following values of Number
:
Sing
(1; 100% of non-emptyNumber
): AloneEMPTY
(13950): other, first, new, many, good, little, more, different, such, same
PUNCT
1 PUNCT tokens (0% of all PUNCT
tokens) have a non-empty value of Number
.
PUNCT
tokens may have the following values of Number
:
Sing
(1; 100% of non-emptyNumber
): pointEMPTY
(28955): ,, ., -, “, ?, (, ), —, [, :
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number
:
VERB –[nsubj]–> PRON (3892; 55%),
NOUN –[nmod]–> NOUN (3179; 61%),
NOUN –[compound]–> NOUN (2256; 67%),
NOUN –[conj]–> NOUN (2054; 80%),
VERB –[nsubj]–> NOUN (1917; 68%),
NOUN –[nmod:poss]–> PRON (1805; 65%),
PROPN –[flat]–> PROPN (1648; 99%),
PROPN –[compound]–> PROPN (1272; 91%),
NOUN –[cop]–> AUX (1263; 77%),
NOUN –[nmod]–> PROPN (790; 67%).