home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-Taiga: Features: Number

This feature is universal. It occurs with 3 different values: Dual, Plur, Sing.

893561 tokens (51%) have a non-empty value of Number. 149696 types (99%) occur at least once with a non-empty value of Number. 47100 lemmas (85%) occur at least once with a non-empty value of Number. The feature is used with 8 part-of-speech tags: NOUN (373634; 21% instances), VERB (166378; 9% instances), ADJ (146580; 8% instances), PRON (84719; 5% instances), DET (55996; 3% instances), PROPN (54576; 3% instances), AUX (9777; 1% instances), NUM (1901; 0% instances).

NOUN

373634 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Number.

The most frequent other feature values with which NOUN and Number co-occurred: Animacy=Inan (310893; 83%).

NOUN tokens may have the following values of Number:

Paradigm рукаSingDualPlur
Case=Accрукуруцеруки, ру́ки, руки́
Case=Datрукерукам
Case=Genрукирук
Case=Insрукой, рукоюруками
Case=Locрукеруках
Case=Nomрука, рука́руки, руци
Case=Nom|Typo=Yesвуки

VERB

166378 VERB tokens (79% of all VERB tokens) have a non-empty value of Number.

The most frequent other feature values with which VERB and Number co-occurred: VerbForm=Fin (143627; 86%), Mood=Ind (137621; 83%), Voice=Act (119127; 72%), Person=EMPTY (103100; 62%), Tense=Past (97192; 58%), Gender=EMPTY (90234; 54%), Aspect=Imp (87406; 53%).

VERB tokens may have the following values of Number:

Paradigm мочьSingPlur
Case=Acc|Gender=Neut|Tense=Pres|VerbForm=Partмогущее
Case=Gen|Gender=Masc|Tense=Pres|VerbForm=Partмогущего
Case=Gen|Gender=Neut|Tense=Pres|VerbForm=Partмогущего
Case=Gen|Tense=Pres|VerbForm=Partмогущих
Case=Nom|Gender=Masc|Tense=Pres|VerbForm=Partмогущий
Case=Nom|Gender=Fem|Tense=Pres|VerbForm=Partмогущая
Case=Nom|Tense=Pres|Variant=Short|VerbForm=Partмогуще
Case=Nom|Tense=Pres|VerbForm=Partмогущие
Gender=Masc|Mood=Ind|Tense=Past|VerbForm=Finмог
Gender=Fem|Mood=Ind|Tense=Past|VerbForm=Finмогла
Gender=Neut|Mood=Ind|Tense=Past|VerbForm=Finмогло
Mood=Ind|Person=1|Tense=Pres|Typo=Yes|VerbForm=Finмогу-у-у
Mood=Ind|Person=1|Tense=Pres|VerbForm=Finмогуможем
Mood=Ind|Person=2|Tense=Pres|VerbForm=Finможешь, могешьможете
Mood=Ind|Person=3|Tense=Pres|Typo=Yes|VerbForm=Finмогуь
Mood=Ind|Person=3|Tense=Pres|VerbForm=Finможет, могёт, можемогут
Mood=Ind|Tense=Past|VerbForm=Finмогли

ADJ

146580 ADJ tokens (95% of all ADJ tokens) have a non-empty value of Number.

The most frequent other feature values with which ADJ and Number co-occurred: Degree=Pos (140773; 96%).

ADJ tokens may have the following values of Number:

Paradigm русскийSingPlur
Animacy=Anim|Case=Accрусских
Animacy=Anim|Case=Acc|Gender=Mascрусского
Animacy=Inan|Case=Accрусские
Animacy=Inan|Case=Acc|Gender=Mascрусский, русская
Case=Acc|Gender=Femрусскую
Case=Acc|Gender=Fem|Typo=YesРускую
Case=Acc|Gender=Neutрусское
Case=Datрусским
Case=Dat|Gender=Mascрусскому
Case=Dat|Gender=Femрусской
Case=Dat|Gender=Neutрусскому
Case=Genрусских
Case=Gen|Gender=Mascрусского
Case=Gen|Gender=Femрусской, Русския
Case=Gen|Gender=Fem|Typo=YesРускыя
Case=Gen|Gender=Neutрусского
Case=Insрусскими
Case=Ins|Gender=Mascрусским
Case=Ins|Gender=Femрусской
Case=Ins|Gender=Neutрусским
Case=Locрусских
Case=Loc|Gender=Mascрусском
Case=Loc|Gender=Femрусской
Case=Loc|Gender=Neutрусском
Case=Nomрусские
Case=Nom|Gender=Mascрусский
Case=Nom|Gender=Femрусская
Case=Nom|Gender=Neutрусское
Case=Nom|Typo=Yesруския

PRON

84719 PRON tokens (95% of all PRON tokens) have a non-empty value of Number.

The most frequent other feature values with which PRON and Number co-occurred: PronType=Prs (60180; 71%), Animacy=EMPTY (60179; 71%), Case=Nom (45142; 53%).

PRON tokens may have the following values of Number:

Paradigm онSingPlur
Case=Acc|Gender=Mascего, него, Эго
Case=Acc|Gender=Neutего
Case=Dat|Gender=Mascему, нему
Case=Datим
Case=Gen|Gender=Mascнего, его
Case=Ins|Gender=Mascним, им
Case=Ins|Gender=Neutним
Case=Loc|Gender=Mascнем, нём, нëм
Case=Nom|Gender=Mascон
Case=Nom|Gender=Masc|Typo=Yesона, от

Number seems to be lexical feature of PRON. 91% lemmas (60) occur only with one value of Number.

DET

55996 DET tokens (88% of all DET tokens) have a non-empty value of Number.

The most frequent other feature values with which DET and Number co-occurred: Animacy=EMPTY (48901; 87%), Poss=EMPTY (44621; 80%).

DET tokens may have the following values of Number:

Paradigm этотSingPlur
Animacy=Anim|Case=Acc|Gender=Mascэтого
Animacy=Anim|Case=Accэтих
Animacy=Inan|Case=Acc|ExtPos=DET|Gender=Mascэтот
Animacy=Inan|Case=Acc|ExtPos=DETэти
Animacy=Inan|Case=Acc|Gender=Mascэтот
Animacy=Inan|Case=Accэти, этих
Animacy=Inan|Case=Gen|Gender=Neutэтого
Animacy=Inan|Case=Nom|Gender=Mascэтот
Case=Acc|ExtPos=DET|Gender=Femэту
Case=Acc|ExtPos=DET|Gender=Neutэто
Case=Acc|Gender=Femэту
Case=Acc|Gender=Neutэто
Case=Acc|Gender=Neut|Typo=Yesэто
Case=Dat|ExtPos=DET|Gender=Mascэтому
Case=Dat|ExtPos=DET|Gender=Femэтой
Case=Dat|ExtPos=DET|Gender=Neutэтому
Case=Dat|ExtPos=DETэтим
Case=Dat|Gender=Mascэтому
Case=Dat|Gender=Femэтой
Case=Dat|Gender=Neutэтому
Case=Datэтим
Case=Gen|ExtPos=DET|Gender=Mascэтого
Case=Gen|ExtPos=DET|Gender=Femэтой
Case=Gen|ExtPos=DET|Gender=Neutэтого
Case=Gen|ExtPos=DETэтих
Case=Gen|Gender=Mascэтого
Case=Gen|Gender=Femэтой
Case=Gen|Gender=Neutэтого
Case=Gen|Gender=Neut|Typo=Yesэтово
Case=Genэтих
Case=Ins|ExtPos=DET|Gender=Mascэтим
Case=Ins|ExtPos=DET|Gender=Femэтой
Case=Ins|ExtPos=DET|Gender=Neutэтим
Case=Ins|ExtPos=DETЭтими
Case=Ins|Gender=Mascэтим
Case=Ins|Gender=Femэтой, этою
Case=Ins|Gender=Neutэтим
Case=Insэтими
Case=Loc|ExtPos=DET|Gender=Mascэтом
Case=Loc|ExtPos=DET|Gender=Femэтой
Case=Loc|ExtPos=DETэтих
Case=Loc|Gender=Mascэтом
Case=Loc|Gender=Femэтой
Case=Loc|Gender=Neutэтом
Case=Locэтих
Case=Nom|ExtPos=DET|Gender=Mascэтот
Case=Nom|ExtPos=DET|Gender=Femэта
Case=Nom|ExtPos=DET|Gender=NeutЭто
Case=Nom|ExtPos=DETэти
Case=Nom|Gender=Mascэтот, Это
Case=Nom|Gender=Femэта
Case=Nom|Gender=Neutэто
Case=Nom|Gender=Neut|Typo=Yesэто
Case=Nomэти

PROPN

54576 PROPN tokens (81% of all PROPN tokens) have a non-empty value of Number.

The most frequent other feature values with which PROPN and Number co-occurred: Abbr=EMPTY (54565; 100%), Animacy=Anim (45261; 83%), Gender=Masc (35768; 66%), Case=Nom (29185; 53%).

PROPN tokens may have the following values of Number:

Paradigm ПушкинSingPlur
Animacy=Anim|Case=Acc|NameType=SurПушкина
Animacy=Anim|Case=Acc|NameType=Sur|Typo=YesПуш-ки-на
Animacy=Anim|Case=Dat|NameType=SurПушкину
Animacy=Anim|Case=Gen|NameType=SurПушкинаПушкиных
Animacy=Anim|Case=Ins|NameType=SurПушкиным
Animacy=Anim|Case=Loc|NameType=SurПушкине
Animacy=Anim|Case=Nom|NameType=SurПушкинПушкины
Animacy=Inan|Case=Loc|NameType=GeoПушкине
Animacy=Inan|Case=Nom|NameType=GeoПушкин

Number seems to be lexical feature of PROPN. 98% lemmas (7847) occur only with one value of Number.

AUX

9777 AUX tokens (75% of all AUX tokens) have a non-empty value of Number.

The most frequent other feature values with which AUX and Number co-occurred: Voice=Act (9777; 100%), VerbForm=Fin (9772; 100%), Mood=Ind (9654; 99%), Tense=Past (7476; 76%), Person=EMPTY (7474; 76%).

AUX tokens may have the following values of Number:

Paradigm бытьSingPlur
Animacy=Anim|Aspect=Imp|Case=Acc|Gender=Masc|Tense=Past|VerbForm=Partбывшего
Aspect=Imp|Case=Nom|Gender=Masc|Tense=Past|VerbForm=Partбывший
Aspect=Imp|Case=Nom|Gender=Fem|Tense=Past|VerbForm=Partбывшая
Aspect=Imp|Case=Nom|Tense=Past|VerbForm=Partбывшие
Aspect=Imp|Gender=Masc|Mood=Ind|Tense=Past|VerbForm=Finбыл
Aspect=Imp|Gender=Fem|Mood=Ind|Tense=Past|VerbForm=Finбыла
Aspect=Imp|Gender=Neut|Mood=Ind|Tense=Past|VerbForm=Finбыло
Aspect=Imp|Mood=Ind|Person=1|Tense=Pres|VerbForm=Finесть
Aspect=Imp|Mood=Ind|Person=3|Tense=Pres|VerbForm=Finестьесть
Aspect=Imp|Mood=Ind|Tense=Past|VerbForm=Finбыли
Gender=Masc|Mood=Ind|Tense=Past|VerbForm=Finбыл, бывший
Gender=Fem|Mood=Ind|Tense=Past|VerbForm=Finбыла, бывшая
Gender=Neut|Mood=Ind|Tense=Past|VerbForm=Finбыло
Mood=Imp|Person=2|Typo=Yes|VerbForm=Finбудте
Mood=Imp|Person=2|VerbForm=Finбудьбудьте
Mood=Ind|Person=1|Tense=Fut|VerbForm=Finбудубудем
Mood=Ind|Person=1|Tense=Pres|VerbForm=Finесмьесмы
Mood=Ind|Person=2|Tense=Fut|VerbForm=Finбудешьбудете
Mood=Ind|Person=2|Tense=Pres|VerbForm=Finесиесте
Mood=Ind|Person=3|Tense=Fut|VerbForm=Finбудетбудут
Mood=Ind|Person=3|Tense=Past|VerbForm=Finбеаше, бысть
Mood=Ind|Person=3|Tense=Pres|VerbForm=Finестьесть, суть
Mood=Ind|Tense=Past|VerbForm=Finбыли, бывших

NUM

1901 NUM tokens (15% of all NUM tokens) have a non-empty value of Number.

The most frequent other feature values with which NUM and Number co-occurred: NumForm=Word (1901; 100%), NumType=Card (1901; 100%).

NUM tokens may have the following values of Number:

Relations with Agreement in Number

The 10 most frequent relations where parent and child node agree in Number: NOUN –[amod]–> ADJ (104901; 98%), NOUN –[nmod]–> NOUN (46735; 61%), VERB –[nsubj]–> NOUN (44646; 94%), NOUN –[det]–> DET (37383; 84%), VERB –[obl]–> NOUN (34370; 53%), VERB –[nsubj]–> PRON (31180; 97%), NOUN –[conj]–> NOUN (30662; 83%), VERB –[conj]–> VERB (23694; 83%), VERB –[nsubj]–> PROPN (12032; 91%), ADJ –[conj]–> ADJ (10211; 98%).