## Treebank Statistics: UD_English: Features: `Number`

This feature is universal.
It occurs with 2 different values: `Plur`

, `Sing`

.

85775 tokens (34%) have a non-empty value of `Number`

.
14560 types (75%) occur at least once with a non-empty value of `Number`

.
11969 lemmas (73%) occur at least once with a non-empty value of `Number`

.
The feature is used with 13 part-of-speech tags: `NOUN` (43109; 17% instances), `PROPN` (16897; 7% instances), `PRON` (16728; 7% instances), `AUX` (5215; 2% instances), `VERB` (2353; 1% instances), `DET` (1415; 1% instances), `SYM` (47; 0% instances), `ADJ` (5; 0% instances), `X` (2; 0% instances), `ADP` (1; 0% instances), `ADV` (1; 0% instances), `INTJ` (1; 0% instances), `NUM` (1; 0% instances).

`NOUN`

43109 `NOUN` tokens (100% of all `NOUN`

tokens) have a non-empty value of `Number`

.

`NOUN`

tokens may have the following values of `Number`

:

`Plur`

(10271; 24% of non-empty`Number`

):*people, years, days, things, questions, times, months, guys, friends, places*`Sing`

(32838; 76% of non-empty`Number`

):*time, service, place, thanks, food, way, year, day, number, pm*`EMPTY`

(3):*equivalant, folks, staff*

Paradigm time | Sing | Plur |
---|---|---|

time | times |

`PROPN`

16897 `PROPN` tokens (100% of all `PROPN`

tokens) have a non-empty value of `Number`

.

`PROPN`

tokens may have the following values of `Number`

:

`Plur`

(638; 4% of non-empty`Number`

):*americans, Beatles, Iraqis, Palestinians, Islands, Tigers, Shiites, Seas, Muslims, Christians*`Sing`

(16259; 96% of non-empty`Number`

):*Bush, US, al, Iraq, enron, united, Iran, New, China, states*`EMPTY`

(3):*Central, Modern, english*

Paradigm States | Sing | Plur |
---|---|---|

States | States |

`Number`

seems to be **lexical feature** of `PROPN`

. 100% lemmas (5491) occur only with one value of `Number`

.

`PRON`

16728 `PRON` tokens (73% of all `PRON`

tokens) have a non-empty value of `Number`

.

The most frequent other feature values with which `PRON`

and `Number`

co-occurred: `PronType``=Prs` (15076; 90%), `Poss``=EMPTY` (13902; 83%), `Gender``=EMPTY` (11913; 71%), `Case``=Nom` (9591; 57%).

`PRON`

tokens may have the following values of `Number`

:

`Plur`

(4136; 25% of non-empty`Number`

):*they, we, their, our, them, us, those, these, themselves, ‘s*`Sing`

(12592; 75% of non-empty`Number`

):*i, it, my, he, me, this, his, that, him, she*`EMPTY`

(6225):*you, your, that, what, there, who, which, one, mine, whom*

`Number`

seems to be **lexical feature** of `PRON`

. 100% lemmas (36) occur only with one value of `Number`

.

`AUX`

5215 `AUX` tokens (34% of all `AUX`

tokens) have a non-empty value of `Number`

.

The most frequent other feature values with which `AUX`

and `Number`

co-occurred: `Mood``=Ind` (5215; 100%), `VerbForm``=Fin` (5215; 100%), `Person``=3` (4849; 93%), `Tense``=Pres` (3860; 74%).

`AUX`

tokens may have the following values of `Number`

:

`Sing`

(5215; 100% of non-empty`Number`

):*is, was, has, ‘s, am, does, s, ’s, `s, ai*`EMPTY`

(10143):*be, are, will, can, would, have, do, were, been, could*

`VERB`

2353 `VERB` tokens (8% of all `VERB`

tokens) have a non-empty value of `Number`

.

The most frequent other feature values with which `VERB`

and `Number`

co-occurred: `Mood``=Ind` (2347; 100%), `VerbForm``=Fin` (2347; 100%), `Tense``=Pres` (2217; 94%).

`VERB`

tokens may have the following values of `Number`

:

`Plur`

(2; 0% of non-empty`Number`

):*associates, rays*`Sing`

(2351; 100% of non-empty`Number`

):*is, has, was, says, ‘s, makes, seems, needs, looks, comes*`EMPTY`

(26139):*have, get, know, had, go, do, want, said, see, going*

`Number`

seems to be **lexical feature** of `VERB`

. 100% lemmas (438) occur only with one value of `Number`

.

`DET`

1415 `DET` tokens (7% of all `DET`

tokens) have a non-empty value of `Number`

.

The most frequent other feature values with which `DET`

and `Number`

co-occurred: `Definite``=EMPTY` (1415; 100%), `PronType``=Dem` (1414; 100%).

`DET`

tokens may have the following values of `Number`

:

`Plur`

(292; 21% of non-empty`Number`

):*these, those*`Sing`

(1123; 79% of non-empty`Number`

):*this, that, A*`EMPTY`

(18663):*the, a, an, all, some, any, no, another, every, each*

`SYM`

47 `SYM` tokens (6% of all `SYM`

tokens) have a non-empty value of `Number`

.

`SYM`

tokens may have the following values of `Number`

:

`Sing`

(47; 100% of non-empty`Number`

):*%, 1%P701!.doc*`EMPTY`

(710):*$, -, :), /, +, |, :(, :-), :D, x*

`ADJ`

5 `ADJ` tokens (0% of all `ADJ`

tokens) have a non-empty value of `Number`

.

The most frequent other feature values with which `ADJ`

and `Number`

co-occurred: `Degree``=EMPTY` (5; 100%).

`ADJ`

tokens may have the following values of `Number`

:

`Sing`

(5; 100% of non-empty`Number`

):*Global, Pakistani, criminal, female, middle*`EMPTY`

(15953):*good, great, other, best, new, many, more, last, same, few*

`X`

2 `X` tokens (0% of all `X`

tokens) have a non-empty value of `Number`

.

`X`

tokens may have the following values of `Number`

:

`Sing`

(2; 100% of non-empty`Number`

):*URSULA*`EMPTY`

(1138):*etc, 1, 2, etc., 3, a, carol.st.clair@enron.com, 4, over, -*

`ADP`

1 `ADP` tokens (0% of all `ADP`

tokens) have a non-empty value of `Number`

.

`ADP`

tokens may have the following values of `Number`

:

`Sing`

(1; 100% of non-empty`Number`

):*auto*`EMPTY`

(21679):*of, in, to, for, on, with, at, from, by, as*

`ADV`

1 `ADV` tokens (0% of all `ADV`

tokens) have a non-empty value of `Number`

.

The most frequent other feature values with which `ADV`

and `Number`

co-occurred: `PronType``=EMPTY` (1; 100%).

`ADV`

tokens may have the following values of `Number`

:

`Sing`

(1; 100% of non-empty`Number`

):*best*`EMPTY`

(13040):*so, just, when, very, also, how, now, even, there, then*

`INTJ`

1 `INTJ` tokens (0% of all `INTJ`

tokens) have a non-empty value of `Number`

.

`INTJ`

tokens may have the following values of `Number`

:

`Sing`

(1; 100% of non-empty`Number`

):*appetit*`EMPTY`

(922):*please, yes, well, no, hi, like, ok, lol, hey, oh*

`NUM`

1 `NUM` tokens (0% of all `NUM`

tokens) have a non-empty value of `Number`

.

The most frequent other feature values with which `NUM`

and `Number`

co-occurred: `NumType``=EMPTY` (1; 100%).

`NUM`

tokens may have the following values of `Number`

:

`Sing`

(1; 100% of non-empty`Number`

):*9/11*`EMPTY`

(4911):*one, two, 2, 3, 5, 1, 10, 4, three, 20*

## Relations with Agreement in `Number`

The 10 most frequent relations where parent and child node agree in `Number`

:
`NOUN –[ compound]–> NOUN` (3910; 71%),

`NOUN –[`(3045; 61%),

`nmod`]–> NOUN`PROPN –[`(2613; 91%),

`compound`]–> PROPN`NOUN –[`(1925; 79%),

`conj`]–> NOUN`NOUN –[`(1815; 50%),

`nmod:poss`]–> PRON`PROPN –[`(1814; 99%),

`flat`]–> PROPN`NOUN –[`(1186; 61%),

`cop`]–> AUX`NOUN –[`(1108; 71%),

`nmod`]–> PROPN`NOUN –[`(917; 72%),

`compound`]–> PROPN`PROPN –[`(855; 95%).

`conj`]–> PROPN