home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-GUM: Features: Number

This feature is universal. It occurs with 2 different values: Plur, Sing.

63470 tokens (39%) have a non-empty value of Number. 12048 types (78%) occur at least once with a non-empty value of Number. 9513 lemmas (77%) occur at least once with a non-empty value of Number. The feature is used with 11 part-of-speech tags: NOUN (28390; 17% instances), PRON (11034; 7% instances), PROPN (10403; 6% instances), VERB (6502; 4% instances), AUX (5685; 3% instances), DET (927; 1% instances), ADJ (499; 0% instances), SYM (15; 0% instances), ADV (8; 0% instances), NUM (6; 0% instances), PUNCT (1; 0% instances).

NOUN

28390 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Number.

NOUN tokens may have the following values of Number:

Paradigm personSingPlur
personpeople, persons

PRON

11034 PRON tokens (86% of all PRON tokens) have a non-empty value of Number.

The most frequent other feature values with which PRON and Number co-occurred: PronType=Prs (10046; 91%), Poss=EMPTY (8733; 79%), Gender=EMPTY (7429; 67%), Case=Nom (6116; 55%).

PRON tokens may have the following values of Number:

Paradigm youSingPlur
Case=Accyou, y-you
Case=Acc|Typo=Yesya
Case=Nomyouyou

Number seems to be lexical feature of PRON. 92% lemmas (36) occur only with one value of Number.

PROPN

10403 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Number.

PROPN tokens may have the following values of Number:

Paradigm StateSingPlur
State, StatesStates

Number seems to be lexical feature of PROPN. 98% lemmas (3684) occur only with one value of Number.

VERB

6502 VERB tokens (37% of all VERB tokens) have a non-empty value of Number.

The most frequent other feature values with which VERB and Number co-occurred: VerbForm=Fin (6485; 100%), Mood=Ind (6480; 100%), Person=3 (4607; 71%), Tense=Pres (3534; 54%).

VERB tokens may have the following values of Number:

Paradigm haveSingPlur
Person=1|Tense=Pasthadhad
Person=1|Tense=Preshave, 'vehave
Person=2|Tense=Pasthadhad
Person=2|Tense=Preshave
Person=3|Tense=Pasthadhad
Person=3|Tense=Pres|Typo=Yeshas
Person=3|Tense=Preshashave

AUX

5685 AUX tokens (70% of all AUX tokens) have a non-empty value of Number.

The most frequent other feature values with which AUX and Number co-occurred: VerbForm=Fin (5683; 100%), Mood=Ind (5358; 94%), Person=3 (4558; 80%), Tense=Pres (3794; 67%).

AUX tokens may have the following values of Number:

Paradigm beSingPlur
_Be
Mood=Ind|Person=1|Tense=Past|VerbForm=Finwaswere, was
Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin'm, am, ’m, mare, 're, ’re
Mood=Ind|Person=2|Tense=Past|VerbForm=Finwerewere
Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin're, are, ’re're, are
Mood=Ind|Person=3|Tense=Past|Typo=Yes|VerbForm=Finwas, wherewere
Mood=Ind|Person=3|Tense=Past|VerbForm=Finwaswere, was
Mood=Ind|Person=3|Tense=Pres|Typo=Yes|VerbForm=Finis, sare
Mood=Ind|Person=3|Tense=Pres|VerbForm=Finis, 's, ’s, S’are, 're, ’re
Mood=Sub|Person=3|Tense=Pres|VerbForm=Finbe

DET

927 DET tokens (7% of all DET tokens) have a non-empty value of Number.

The most frequent other feature values with which DET and Number co-occurred: Definite=EMPTY (927; 100%), PronType=Dem (927; 100%).

DET tokens may have the following values of Number:

Paradigm thisSingPlur
thisthese

ADJ

499 ADJ tokens (5% of all ADJ tokens) have a non-empty value of Number.

The most frequent other feature values with which ADJ and Number co-occurred: Degree=Pos (495; 99%).

ADJ tokens may have the following values of Number:

Number seems to be lexical feature of ADJ. 100% lemmas (205) occur only with one value of Number.

SYM

15 SYM tokens (6% of all SYM tokens) have a non-empty value of Number.

SYM tokens may have the following values of Number:

ADV

8 ADV tokens (0% of all ADV tokens) have a non-empty value of Number.

The most frequent other feature values with which ADV and Number co-occurred: Degree=EMPTY (8; 100%), PronType=EMPTY (8; 100%).

ADV tokens may have the following values of Number:

NUM

6 NUM tokens (0% of all NUM tokens) have a non-empty value of Number.

The most frequent other feature values with which NUM and Number co-occurred: NumForm=Word (6; 100%).

NUM tokens may have the following values of Number:

PUNCT

1 PUNCT tokens (0% of all PUNCT tokens) have a non-empty value of Number.

PUNCT tokens may have the following values of Number:

Relations with Agreement in Number

The 10 most frequent relations where parent and child node agree in Number: VERB –[nsubj]–> PRON (2919; 55%), NOUN –[nmod]–> NOUN (2514; 60%), NOUN –[compound]–> NOUN (1853; 66%), VERB –[nsubj]–> NOUN (1616; 68%), NOUN –[conj]–> NOUN (1611; 79%), NOUN –[nmod:poss]–> PRON (1457; 65%), PROPN –[flat]–> PROPN (1396; 100%), PROPN –[compound]–> PROPN (1143; 90%), NOUN –[cop]–> AUX (911; 77%), VERB –[nsubj]–> PROPN (698; 75%).