Treebank Statistics: UD_English-GUM: Features: Number
This feature is universal.
It occurs with 2 different values: Plur
, Sing
.
63470 tokens (39%) have a non-empty value of Number
.
12048 types (78%) occur at least once with a non-empty value of Number
.
9513 lemmas (77%) occur at least once with a non-empty value of Number
.
The feature is used with 11 part-of-speech tags: NOUN (28390; 17% instances), PRON (11034; 7% instances), PROPN (10403; 6% instances), VERB (6502; 4% instances), AUX (5685; 3% instances), DET (927; 1% instances), ADJ (499; 0% instances), SYM (15; 0% instances), ADV (8; 0% instances), NUM (6; 0% instances), PUNCT (1; 0% instances).
NOUN
28390 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Number
.
NOUN
tokens may have the following values of Number
:
Plur
(8064; 28% of non-emptyNumber
): people, years, things, guys, days, studies, minutes, months, children, languagesSing
(20326; 72% of non-emptyNumber
): time, city, day, way, world, year, life, today, something, exampleEMPTY
(13): etc., etc
Paradigm person | Sing | Plur |
---|---|---|
person | people, persons |
PRON
11034 PRON tokens (86% of all PRON
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PRON
and Number
co-occurred: PronType=Prs (10046; 91%), Poss=EMPTY (8733; 79%), Gender=EMPTY (7429; 67%), Case=Nom (6116; 55%).
PRON
tokens may have the following values of Number
:
Plur
(2475; 22% of non-emptyNumber
): we, they, their, our, them, us, you, those, these, ‘sSing
(8559; 78% of non-emptyNumber
): i, it, you, he, his, your, my, she, this, thatEMPTY
(1724): that, which, what, there, who, whose, whatever, whom, one, whoever
Paradigm you | Sing | Plur |
---|---|---|
Case=Acc | you, y- | you |
Case=Acc|Typo=Yes | ya | |
Case=Nom | you | you |
Number
seems to be lexical feature of PRON
. 92% lemmas (36) occur only with one value of Number
.
PROPN
10403 PROPN tokens (100% of all PROPN
tokens) have a non-empty value of Number
.
PROPN
tokens may have the following values of Number
:
Plur
(538; 5% of non-emptyNumber
): States, skittles, Americans, Nations, Chathams, Mets, Netherlands, Sox, Democrats, OlmecSing
(9865; 95% of non-emptyNumber
): New, University, York, President, Scientology, north, America, figure, Warhol, lee
Paradigm State | Sing | Plur |
---|---|---|
State, States | States |
Number
seems to be lexical feature of PROPN
. 98% lemmas (3684) occur only with one value of Number
.
VERB
6502 VERB tokens (37% of all VERB
tokens) have a non-empty value of Number
.
The most frequent other feature values with which VERB
and Number
co-occurred: VerbForm=Fin (6485; 100%), Mood=Ind (6480; 100%), Person=3 (4607; 71%), Tense=Pres (3534; 54%).
VERB
tokens may have the following values of Number
:
Plur
(1769; 27% of non-emptyNumber
): have, are, had, need, were, know, want, did, make, gotSing
(4733; 73% of non-emptyNumber
): said, has, is, know, have, had, think, was, want, ‘sEMPTY
(10988): have, make, get, do, see, go, know, united, take, going
Paradigm have | Sing | Plur |
---|---|---|
Person=1|Tense=Past | had | had |
Person=1|Tense=Pres | have, 've | have |
Person=2|Tense=Past | had | had |
Person=2|Tense=Pres | have | |
Person=3|Tense=Past | had | had |
Person=3|Tense=Pres|Typo=Yes | has | |
Person=3|Tense=Pres | has | have |
AUX
5685 AUX tokens (70% of all AUX
tokens) have a non-empty value of Number
.
The most frequent other feature values with which AUX
and Number
co-occurred: VerbForm=Fin (5683; 100%), Mood=Ind (5358; 94%), Person=3 (4558; 80%), Tense=Pres (3794; 67%).
AUX
tokens may have the following values of Number
:
Plur
(1442; 25% of non-emptyNumber
): are, were, have, ‘re, do, will, did, had, can, ‘veSing
(4243; 75% of non-emptyNumber
): is, was, ‘s, has, do, ‘m, had, did, ’s, ‘reEMPTY
(2392): be, can, will, been, would, should, could, may, have, ‘ll
Paradigm be | Sing | Plur |
---|---|---|
_ | Be | |
Mood=Ind|Person=1|Tense=Past|VerbForm=Fin | was | were, was |
Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | 'm, am, ’m, m | are, 're, ’re |
Mood=Ind|Person=2|Tense=Past|VerbForm=Fin | were | were |
Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin | 're, are, ’re | 're, are |
Mood=Ind|Person=3|Tense=Past|Typo=Yes|VerbForm=Fin | was, where | were |
Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | was | were, was |
Mood=Ind|Person=3|Tense=Pres|Typo=Yes|VerbForm=Fin | is, s | are |
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | is, 's, ’s, S’ | are, 're, ’re |
Mood=Sub|Person=3|Tense=Pres|VerbForm=Fin | be |
DET
927 DET tokens (7% of all DET
tokens) have a non-empty value of Number
.
The most frequent other feature values with which DET
and Number
co-occurred: Definite=EMPTY (927; 100%), PronType=Dem (927; 100%).
DET
tokens may have the following values of Number
:
Plur
(241; 26% of non-emptyNumber
): these, thoseSing
(686; 74% of non-emptyNumber
): this, thatEMPTY
(12690): the, a, an, all, some, no, any, every, each, another
Paradigm this | Sing | Plur |
---|---|---|
this | these |
ADJ
499 ADJ tokens (5% of all ADJ
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADJ
and Number
co-occurred: Degree=Pos (495; 99%).
ADJ
tokens may have the following values of Number
:
Sing
(499; 100% of non-emptyNumber
): national, International, New, American, Democratic, Red, Civic, Creative, Open, GreatEMPTY
(10517): other, first, many, good, new, little, different, more, such, last
Number
seems to be lexical feature of ADJ
. 100% lemmas (205) occur only with one value of Number
.
SYM
15 SYM tokens (6% of all SYM
tokens) have a non-empty value of Number
.
SYM
tokens may have the following values of Number
:
Sing
(15; 100% of non-emptyNumber
): %EMPTY
(237): /, –, -, $, +, =, DKK, €, £, #
ADV
8 ADV tokens (0% of all ADV
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADV
and Number
co-occurred: Degree=EMPTY (8; 100%), PronType=EMPTY (8; 100%).
ADV
tokens may have the following values of Number
:
Sing
(8; 100% of non-emptyNumber
): Always, Little, Loud, Out, northwest, southEMPTY
(7500): so, when, just, also, then, now, more, very, here, there
NUM
6 NUM tokens (0% of all NUM
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NUM
and Number
co-occurred: NumForm=Word (6; 100%).
NUM
tokens may have the following values of Number
:
Sing
(6; 100% of non-emptyNumber
): half, Seven, ThreeEMPTY
(3169): one, two, 1, 2, three, 3, four, 6, 4, 10
PUNCT
1 PUNCT tokens (0% of all PUNCT
tokens) have a non-empty value of Number
.
PUNCT
tokens may have the following values of Number
:
Sing
(1; 100% of non-emptyNumber
): ?EMPTY
(22118): ,, ., “, -, (, ), ?, [, ], :
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number
:
VERB –[nsubj]–> PRON (2919; 55%),
NOUN –[nmod]–> NOUN (2514; 60%),
NOUN –[compound]–> NOUN (1853; 66%),
VERB –[nsubj]–> NOUN (1616; 68%),
NOUN –[conj]–> NOUN (1611; 79%),
NOUN –[nmod:poss]–> PRON (1457; 65%),
PROPN –[flat]–> PROPN (1396; 100%),
PROPN –[compound]–> PROPN (1143; 90%),
NOUN –[cop]–> AUX (911; 77%),
VERB –[nsubj]–> PROPN (698; 75%).