home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-GUM: Features: Number

This feature is universal. It occurs with 3 different values: Plur, Ptan, Sing.

98512 tokens (38%) have a non-empty value of Number. 15091 types (79%) occur at least once with a non-empty value of Number. 11501 lemmas (77%) occur at least once with a non-empty value of Number. The feature is used with 8 part-of-speech tags: NOUN (42913; 17% instances), PRON (18946; 7% instances), PROPN (14545; 6% instances), VERB (11030; 4% instances), AUX (9403; 4% instances), DET (1633; 1% instances), SYM (38; 0% instances), NUM (4; 0% instances).

NOUN

42913 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Number.

NOUN tokens may have the following values of Number:

Paradigm personSingPlur
_personpeople, persons
Typo=Yespeople

PRON

18946 PRON tokens (86% of all PRON tokens) have a non-empty value of Number.

The most frequent other feature values with which PRON and Number co-occurred: PronType=Prs (16852; 89%), Poss=EMPTY (15481; 82%), Gender=EMPTY (13352; 70%), Case=Nom (10575; 56%).

PRON tokens may have the following values of Number:

Paradigm youSingPlur
Case=Accyouyou
Case=Acc|Style=Coll|Typo=Yesya
Case=Acc|Typo=Yesy'you
Case=Nomyouyou
Case=Nom|Typo=Yesyou, Ya

PROPN

14545 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Number.

PROPN tokens may have the following values of Number:

Paradigm StateSingPlur
State, StatesStates

Number seems to be lexical feature of PROPN. 97% lemmas (4674) occur only with one value of Number.

VERB

11030 VERB tokens (41% of all VERB tokens) have a non-empty value of Number.

The most frequent other feature values with which VERB and Number co-occurred: Voice=EMPTY (11030; 100%), VerbForm=Fin (11029; 100%), Mood=Ind (9960; 90%), Person=3 (6695; 61%), Tense=Pres (5853; 53%).

VERB tokens may have the following values of Number:

Paradigm haveSingPlur
Mood=Imp|Person=2haveHave
Mood=Ind|Person=1|Tense=Pasthadhad
Mood=Ind|Person=1|Tense=Preshave, 'vehave
Mood=Ind|Person=2|Tense=Pasthadhad
Mood=Ind|Person=2|Tense=Preshavehave
Mood=Ind|Person=3|Tense=Pasthadhad
Mood=Ind|Person=3|Tense=Pres|Typo=Yeshas
Mood=Ind|Person=3|Tense=Preshashave

AUX

9403 AUX tokens (67% of all AUX tokens) have a non-empty value of Number.

The most frequent other feature values with which AUX and Number co-occurred: VerbForm=Fin (9402; 100%), Mood=Ind (9359; 100%), Person=3 (7315; 78%), Tense=Pres (7016; 75%).

AUX tokens may have the following values of Number:

Paradigm beSingPlur
_Be
Mood=Imp|Person=2|VerbForm=Finbe
Mood=Ind|Person=1|Tense=Past|VerbForm=Finwas, werewere, was
Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin'm, am, ’mare, 're, ’re
Mood=Ind|Person=2|Tense=Past|VerbForm=Finwere, waswere
Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin're, are, ’re're, are
Mood=Ind|Person=3|Tense=Past|Typo=Yes|VerbForm=Finwas, wherewere
Mood=Ind|Person=3|Tense=Past|VerbForm=Finwaswere, was
Mood=Ind|Person=3|Tense=Pres|Typo=Yes|VerbForm=Finisare
Mood=Ind|Person=3|Tense=Pres|VerbForm=Finis, 's, ’s, S’, sare, 're, ’re, R, am
Mood=Sub|Person=1|Tense=Past|VerbForm=Finwere
Mood=Sub|Person=3|Tense=Past|VerbForm=Finwere
Mood=Sub|Person=3|Tense=Pres|VerbForm=Finbebe

DET

1633 DET tokens (8% of all DET tokens) have a non-empty value of Number.

The most frequent other feature values with which DET and Number co-occurred: Definite=EMPTY (1633; 100%), PronType=Dem (1632; 100%).

DET tokens may have the following values of Number:

Paradigm thisSingPlur
thisthese
Typo=Yesthis

SYM

38 SYM tokens (11% of all SYM tokens) have a non-empty value of Number.

The most frequent other feature values with which SYM and Number co-occurred: ExtPos=EMPTY (38; 100%).

SYM tokens may have the following values of Number:

NUM

4 NUM tokens (0% of all NUM tokens) have a non-empty value of Number.

The most frequent other feature values with which NUM and Number co-occurred: NumForm=Word (4; 100%), NumType=Frac (4; 100%).

NUM tokens may have the following values of Number:

Relations with Agreement in Number

The 10 most frequent relations where parent and child node agree in Number: VERB –[nsubj]–> PRON (4785; 54%), NOUN –[nmod]–> NOUN (3821; 62%), NOUN –[compound]–> NOUN (2760; 67%), NOUN –[conj]–> NOUN (2434; 80%), VERB –[nsubj]–> NOUN (2329; 67%), NOUN –[nmod:poss]–> PRON (2157; 65%), PROPN –[flat]–> PROPN (1741; 100%), PROPN –[compound]–> PROPN (1535; 90%), NOUN –[cop]–> AUX (1511; 76%), VERB –[nsubj]–> PROPN (943; 71%).