Number: number
In English, Number is a feature of nouns and other parts of speech that mark agreement with nouns, i.e. personal pronouns, verbs, and some determiners.
Sing: singular
A singular noun denotes one person, animal or thing. Every noun with the PTB tag NN or NNP is marked with this feature.
Examples:
- an apple, a train
Pronouns that refer to a single person, an animal or a thing are also marked with this feature.
- I, me, my, myself, he, his, him, himself, she, her, hers, herself, it, its, itself
We also mark all verbs with the PTB tag VBZ with this feature.
Examples:
- she does
- he walks
Further, we mark inflections of be that can only have a singular noun or pronoun in subject position with this feature.
- am, is, was
Demonstrative determiners of singular nouns and demonstrative pronouns that refer to singular nouns are also marked with this feature.
- this, that
Plur: plural
A plural noun denotes several persons, animals or things. Every noun with the PTB tag NNS or NNPS is marked with this feature.
Examples:
- several apples, two trains
Pronouns that refer to a single person, an animal or a thing are also marked with this feature.
- we, us, our, ours, ourselves, yourselves, they, them, their, theirs, themselves
Demonstrative determiners of plural nouns and demonstrative pronouns that refer to plural nouns are also marked with this feature.
- these, those
We currently don’t mark plurale tantum or collective/mass nouns.
Treebank Statistics (UD_English)
This feature is universal.
It occurs with 2 different values: Plur, Sing.
85754 tokens (34%) have a non-empty value of Number.
14556 types (75%) occur at least once with a non-empty value of Number.
11966 lemmas (73%) occur at least once with a non-empty value of Number.
The feature is used with 13 part-of-speech tags: en-pos/NOUN (43786; 17% instances), en-pos/PROPN (16896; 7% instances), en-pos/PRON (16019; 6% instances), en-pos/VERB (5697; 2% instances), en-pos/AUX (1871; 1% instances), en-pos/DET (1426; 1% instances), en-pos/SYM (48; 0% instances), en-pos/ADJ (5; 0% instances), en-pos/X (2; 0% instances), en-pos/ADP (1; 0% instances), en-pos/ADV (1; 0% instances), en-pos/INTJ (1; 0% instances), en-pos/NUM (1; 0% instances).
NOUN
43786 en-pos/NOUN tokens (100% of all NOUN tokens) have a non-empty value of Number.
NOUN tokens may have the following values of Number:
Plur(10271; 23% of non-emptyNumber): people, years, days, things, questions, times, months, guys, friends, placesSing(33515; 77% of non-emptyNumber): time, service, place, thanks, food, way, year, day, number, pmEMPTY(4): answers, equivalant, folks, marrying
| Paradigm time | Sing | Plur |
|---|---|---|
| time | times |
PROPN
16896 en-pos/PROPN tokens (100% of all PROPN tokens) have a non-empty value of Number.
PROPN tokens may have the following values of Number:
Plur(638; 4% of non-emptyNumber): americans, Beatles, Iraqis, Palestinians, Islands, Tigers, Shiites, Seas, Muslims, ChristiansSing(16258; 96% of non-emptyNumber): Bush, US, al, Iraq, enron, united, Iran, New, China, statesEMPTY(3): Central, Modern, english
| Paradigm States | Sing | Plur |
|---|---|---|
| States | States |
Number seems to be lexical feature of PROPN. 100% lemmas (5490) occur only with one value of Number.
PRON
16019 en-pos/PRON tokens (75% of all PRON tokens) have a non-empty value of Number.
The most frequent other feature values with which PRON and Number co-occurred: PronType=Prs (15055; 94%), Poss=EMPTY (13193; 82%), Gender=EMPTY (11204; 70%), Case=Nom (9592; 60%).
PRON tokens may have the following values of Number:
Plur(4115; 26% of non-emptyNumber): they, we, their, our, them, us, those, these, themselves, ourselvesSing(11904; 74% of non-emptyNumber): i, it, my, he, me, this, his, that, him, sheEMPTY(5209): you, your, what, there, who, one, mine, whom, ‘s, it’s
Number seems to be lexical feature of PRON. 100% lemmas (22) occur only with one value of Number.
VERB
5697 en-pos/VERB tokens (17% of all VERB tokens) have a non-empty value of Number.
The most frequent other feature values with which VERB and Number co-occurred: VerbForm=Fin (5691; 100%), Mood=Ind (5691; 100%), Person=3 (5518; 97%), Tense=Pres (4715; 83%).
VERB tokens may have the following values of Number:
Plur(2; 0% of non-emptyNumber): associates, raysSing(5695; 100% of non-emptyNumber): is, was, ‘s, has, am, s, says, makes, seems, needsEMPTY(28381): have, are, be, get, know, had, go, do, want, said
Number seems to be lexical feature of VERB. 100% lemmas (439) occur only with one value of Number.
AUX
1871 en-pos/AUX tokens (19% of all AUX tokens) have a non-empty value of Number.
The most frequent other feature values with which AUX and Number co-occurred: VerbForm=Fin (1871; 100%), Mood=Ind (1871; 100%), Person=3 (1656; 89%), Tense=Pres (1362; 73%).
AUX tokens may have the following values of Number:
Sing(1871; 100% of non-emptyNumber): is, was, has, does, am, ‘s, s, gets, `sEMPTY(7877): will, can, would, have, do, be, are, could, should, did
DET
1426 en-pos/DET tokens (7% of all DET tokens) have a non-empty value of Number.
The most frequent other feature values with which DET and Number co-occurred: Definite=EMPTY (1426; 100%), PronType=Dem (1425; 100%).
DET tokens may have the following values of Number:
Plur(292; 20% of non-emptyNumber): these, thoseSing(1134; 80% of non-emptyNumber): this, that, AEMPTY(19698): the, a, that, an, all, some, which, any, no, another
SYM
48 en-pos/SYM tokens (6% of all SYM tokens) have a non-empty value of Number.
SYM tokens may have the following values of Number:
Sing(48; 100% of non-emptyNumber): %, 1%P701!.doc, Assh@%$eEMPTY(711): $, -, :), /, +, |, :(, :-), :D, x
ADJ
5 en-pos/ADJ tokens (0% of all ADJ tokens) have a non-empty value of Number.
The most frequent other feature values with which ADJ and Number co-occurred: Degree=EMPTY (5; 100%).
ADJ tokens may have the following values of Number:
Sing(5; 100% of non-emptyNumber): Global, Pakistani, criminal, female, middleEMPTY(15952): good, great, other, best, new, many, more, last, same, few
X
2 en-pos/X tokens (0% of all X tokens) have a non-empty value of Number.
X tokens may have the following values of Number:
Sing(2; 100% of non-emptyNumber): URSULAEMPTY(1139): etc, 1, 2, etc., 3, a, carol.st.clair@enron.com, 4, over, -
ADV
1 en-pos/ADV tokens (0% of all ADV tokens) have a non-empty value of Number.
The most frequent other feature values with which ADV and Number co-occurred: PronType=EMPTY (1; 100%).
ADV tokens may have the following values of Number:
Sing(1; 100% of non-emptyNumber): bestEMPTY(13041): so, just, when, very, also, how, now, even, there, then
NUM
1 en-pos/NUM tokens (0% of all NUM tokens) have a non-empty value of Number.
The most frequent other feature values with which NUM and Number co-occurred: NumType=EMPTY (1; 100%).
NUM tokens may have the following values of Number:
Sing(1; 100% of non-emptyNumber): 9/11EMPTY(4912): one, two, 2, 3, 5, 1, 10, 4, three, 20
INTJ
1 en-pos/INTJ tokens (0% of all INTJ tokens) have a non-empty value of Number.
INTJ tokens may have the following values of Number:
Sing(1; 100% of non-emptyNumber): appetitEMPTY(922): please, yes, well, no, hi, like, ok, lol, hey, oh
ADP
1 en-pos/ADP tokens (0% of all ADP tokens) have a non-empty value of Number.
ADP tokens may have the following values of Number:
Sing(1; 100% of non-emptyNumber): autoEMPTY(21680): of, in, to, for, on, with, at, from, by, as
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number:
NOUN –[compound]–> NOUN (3916; 71%),
NOUN –[nmod]–> NOUN (3203; 61%),
PROPN –[compound]–> PROPN (2899; 92%),
NOUN –[conj]–> NOUN (1931; 79%),
NOUN –[nmod:poss]–> PRON (1810; 50%),
PROPN –[name]–> PROPN (1526; 99%),
NOUN –[cop]–> VERB (1209; 61%),
NOUN –[nmod]–> PROPN (1141; 71%),
NOUN –[compound]–> PROPN (898; 72%),
PROPN –[conj]–> PROPN (854; 96%).
Number in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]