home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-GUM: Features: Number

This feature is universal. It occurs with 3 different values: Plur, Ptan, Sing.

80329 tokens (38%) have a non-empty value of Number. 13421 types (77%) occur at least once with a non-empty value of Number. 10413 lemmas (76%) occur at least once with a non-empty value of Number. The feature is used with 11 part-of-speech tags: NOUN (35511; 17% instances), PRON (15472; 7% instances), PROPN (12188; 6% instances), VERB (8193; 4% instances), AUX (7602; 4% instances), DET (1315; 1% instances), SYM (26; 0% instances), ADV (13; 0% instances), NUM (7; 0% instances), ADJ (1; 0% instances), PUNCT (1; 0% instances).

NOUN

35511 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Number.

NOUN tokens may have the following values of Number:

Paradigm personSingPlur
personpeople, persons

PRON

15472 PRON tokens (87% of all PRON tokens) have a non-empty value of Number.

The most frequent other feature values with which PRON and Number co-occurred: PronType=Prs (13799; 89%), Poss=EMPTY (12564; 81%), Gender=EMPTY (10792; 70%), Case=Nom (8589; 56%).

PRON tokens may have the following values of Number:

Paradigm youSingPlur
Case=Accyouyou
Case=Acc|Style=Coll|Typo=Yesya
Case=Acc|Typo=Yesy'
Case=Nomyouyou
Case=Nom|Typo=Yesyou, Ya

PROPN

12188 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Number.

PROPN tokens may have the following values of Number:

Paradigm StateSingPlur
State, StatesStates

Number seems to be lexical feature of PROPN. 98% lemmas (4119) occur only with one value of Number.

VERB

8193 VERB tokens (37% of all VERB tokens) have a non-empty value of Number.

The most frequent other feature values with which VERB and Number co-occurred: Voice=EMPTY (8193; 100%), VerbForm=Fin (8178; 100%), Mood=Ind (8172; 100%), Person=3 (5525; 67%), Tense=Pres (4655; 57%).

VERB tokens may have the following values of Number:

Paradigm haveSingPlur
Person=1|Tense=Pasthadhad
Person=1|Tense=Preshave, 'vehave
Person=2|Tense=Pasthadhad
Person=2|Tense=Preshavehave
Person=3|Tense=Pasthadhad
Person=3|Tense=Pres|Typo=Yeshas
Person=3|Tense=Preshashave

AUX

7602 AUX tokens (67% of all AUX tokens) have a non-empty value of Number.

The most frequent other feature values with which AUX and Number co-occurred: VerbForm=Fin (7598; 100%), Mood=Ind (7588; 100%), Person=3 (5922; 78%), Tense=Pres (5610; 74%).

AUX tokens may have the following values of Number:

Paradigm beSingPlur
_R, Be
Mood=Ind|Person=1|Tense=Past|VerbForm=Finwaswere, was
Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin'm, am, ’mare, 're, ’re
Mood=Ind|Person=2|Tense=Past|VerbForm=Finwere, waswere
Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin're, are, ’re're, are
Mood=Ind|Person=3|Tense=Past|Typo=Yes|VerbForm=Finwas, wherewere
Mood=Ind|Person=3|Tense=Past|VerbForm=Finwaswere, was
Mood=Ind|Person=3|Tense=Pres|Typo=Yes|VerbForm=Finis, sare
Mood=Ind|Person=3|Tense=Pres|VerbForm=Finis, 's, ’s, S’are, 're, ’re, am
Mood=Sub|Person=3|Tense=Past|VerbForm=Finwere
Mood=Sub|Person=3|Tense=Pres|VerbForm=Finbebe

DET

1315 DET tokens (8% of all DET tokens) have a non-empty value of Number.

The most frequent other feature values with which DET and Number co-occurred: Definite=EMPTY (1315; 100%), PronType=Dem (1314; 100%).

DET tokens may have the following values of Number:

Paradigm thisSingPlur
thisthese

SYM

26 SYM tokens (8% of all SYM tokens) have a non-empty value of Number.

SYM tokens may have the following values of Number:

ADV

13 ADV tokens (0% of all ADV tokens) have a non-empty value of Number.

The most frequent other feature values with which ADV and Number co-occurred: Degree=EMPTY (13; 100%), PronType=EMPTY (13; 100%).

ADV tokens may have the following values of Number:

Number seems to be lexical feature of ADV. 100% lemmas (10) occur only with one value of Number.

NUM

7 NUM tokens (0% of all NUM tokens) have a non-empty value of Number.

The most frequent other feature values with which NUM and Number co-occurred: NumForm=Word (7; 100%), NumType=Frac (4; 57%).

NUM tokens may have the following values of Number:

ADJ

1 ADJ tokens (0% of all ADJ tokens) have a non-empty value of Number.

The most frequent other feature values with which ADJ and Number co-occurred: Degree=EMPTY (1; 100%).

ADJ tokens may have the following values of Number:

PUNCT

1 PUNCT tokens (0% of all PUNCT tokens) have a non-empty value of Number.

PUNCT tokens may have the following values of Number:

Relations with Agreement in Number

The 10 most frequent relations where parent and child node agree in Number: VERB –[nsubj]–> PRON (3892; 55%), NOUN –[nmod]–> NOUN (3179; 61%), NOUN –[compound]–> NOUN (2256; 67%), NOUN –[conj]–> NOUN (2054; 80%), VERB –[nsubj]–> NOUN (1917; 68%), NOUN –[nmod:poss]–> PRON (1805; 65%), PROPN –[flat]–> PROPN (1648; 99%), PROPN –[compound]–> PROPN (1272; 91%), NOUN –[cop]–> AUX (1263; 77%), NOUN –[nmod]–> PROPN (790; 67%).