home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-Taiga: Features: Number

This feature is universal. It occurs with 2 different values: Plur, Sing.

99232 tokens (50%) have a non-empty value of Number. 35609 types (94%) occur at least once with a non-empty value of Number. 16517 lemmas (82%) occur at least once with a non-empty value of Number. The feature is used with 8 part-of-speech tags: NOUN (43554; 22% instances), VERB (18408; 9% instances), ADJ (15894; 8% instances), PRON (10860; 6% instances), DET (5335; 3% instances), PROPN (3793; 2% instances), AUX (1194; 1% instances), NUM (194; 0% instances).

NOUN

43554 NOUN tokens (98% of all NOUN tokens) have a non-empty value of Number.

The most frequent other feature values with which NOUN and Number co-occurred: Animacy=Inan (37106; 85%).

NOUN tokens may have the following values of Number:

Paradigm магазинSingPlur
Case=Accмагазинмагазины
Case=Datмагазину
Case=Genмагазинамагазинов
Case=Insмагазиноммагазинами
Case=Locмагазинемагазинах
Case=Nomмагазинмагазины

VERB

18408 VERB tokens (74% of all VERB tokens) have a non-empty value of Number.

The most frequent other feature values with which VERB and Number co-occurred: VerbForm=Fin (16768; 91%), Mood=Ind (15792; 86%), Voice=Act (13818; 75%), Gender=EMPTY (12523; 68%), Aspect=Imp (11161; 61%).

VERB tokens may have the following values of Number:

Paradigm бытьSingPlur
Aspect=Imp|Case=Nom|Gender=Masc|Tense=Past|VerbForm=Partбывший
Aspect=Imp|Gender=Masc|Mood=Ind|Tense=Past|Typo=Yes|VerbForm=Finбыл
Aspect=Imp|Gender=Masc|Mood=Ind|Tense=Past|VerbForm=Finбыл
Aspect=Imp|Gender=Fem|Mood=Ind|Tense=Past|VerbForm=Finбыла
Aspect=Imp|Gender=Neut|Mood=Ind|Tense=Past|VerbForm=Finбыло
Aspect=Imp|Mood=Imp|Person=2|Typo=Yes|VerbForm=FinБыди
Aspect=Imp|Mood=Ind|Person=1|Tense=Pres|VerbForm=Finесть
Aspect=Imp|Mood=Ind|Person=3|Tense=Pres|Typo=Yes|VerbForm=FinЕсли
Aspect=Imp|Mood=Ind|Person=3|Tense=Pres|VerbForm=Finестьесть
Aspect=Imp|Mood=Ind|Tense=Past|VerbForm=Finбыли
Mood=Ind|Person=3|Tense=Fut|VerbForm=Finбудетбудут

ADJ

15894 ADJ tokens (94% of all ADJ tokens) have a non-empty value of Number.

The most frequent other feature values with which ADJ and Number co-occurred: Degree=Pos (15614; 98%), Variant=EMPTY (13532; 85%).

ADJ tokens may have the following values of Number:

Paradigm хорошийSingPlur
Animacy=Anim|Case=Accхороших
Animacy=Inan|Case=Accхорошие
Animacy=Inan|Case=Acc|Gender=Mascхороший
Animacy=Inan|Case=Acc|Typo=Yesхорошии
Case=Acc|Gender=Femхорошую
Case=Acc|Gender=Neutхорошее
Case=Datхорошим
Case=Dat|Gender=Femхорошей
Case=Genхороших
Case=Gen|Gender=Mascхорошего
Case=Gen|Gender=Femхорошей
Case=Gen|Gender=Neutхорошего
Case=Insхорошими
Case=Ins|Gender=Mascхорошим
Case=Ins|Gender=Femхорошей
Case=Ins|Gender=Neutхорошим
Case=Loc|Gender=Mascхорошем
Case=Loc|Gender=Neutхорошем
Case=Nomхорошие
Case=Nom|Gender=Mascхороший, хоро, хорошии
Case=Nom|Gender=Masc|Typo=YesХорлший
Case=Nom|Gender=Femхорошая, Шорошая
Case=Nom|Gender=Neutхорошее
Case=Nom|Gender=Neut|Typo=YesХорошое
Case=Nom|Typo=Yesхорошии
Gender=Masc|Variant=Shortхорош
Gender=Fem|Variant=Shortхороша
Gender=Neut|Variant=Shortхорошо
Variant=Shortхороши

PRON

10860 PRON tokens (97% of all PRON tokens) have a non-empty value of Number.

The most frequent other feature values with which PRON and Number co-occurred: Animacy=EMPTY (6746; 62%), PronType=Prs (6399; 59%), Case=Nom (5587; 51%).

PRON tokens may have the following values of Number:

Paradigm онSingPlur
Case=Acc|Gender=Masc|PronType=Prsего, него, Эго
Case=Acc|Gender=Neut|PronType=Prsего
Case=Dat|Gender=Masc|PronType=Prsему, нему
Case=Dat|PronType=Prsим
Case=Gen|Gender=Masc|PronType=Prsнего, его
Case=Ins|Gender=Masc|PronType=Demим
Case=Ins|Gender=Masc|PronType=Prsним, им
Case=Ins|Gender=Neut|PronType=Prsним
Case=Loc|Gender=Masc|PronType=Prsнем, нём, нëм
Case=Nom|Gender=Masc|PronType=Prsон
Case=Nom|Gender=Masc|PronType=Prs|Typo=Yesона, от

Number seems to be lexical feature of PRON. 93% lemmas (56) occur only with one value of Number.

DET

5335 DET tokens (94% of all DET tokens) have a non-empty value of Number.

The most frequent other feature values with which DET and Number co-occurred: Animacy=EMPTY (4429; 83%), Poss=EMPTY (4168; 78%).

DET tokens may have the following values of Number:

Paradigm этотSingPlur
Animacy=Anim|Case=Acc|Gender=Mascэтого
Animacy=Anim|Case=Accэтих
Animacy=Inan|Case=Acc|Gender=Mascэтот
Animacy=Inan|Case=Accэти, этих
Case=Acc|Gender=Femэту
Case=Acc|Gender=Neutэто
Case=Acc|Gender=Neut|Typo=Yesэто
Case=Dat|Gender=Mascэтому
Case=Dat|Gender=Femэтой
Case=Dat|Gender=Neutэтому
Case=Datэтим
Case=Gen|Gender=Mascэтого
Case=Gen|Gender=Femэтой
Case=Gen|Gender=Neutэтого
Case=Genэтих
Case=Ins|Gender=Mascэтим
Case=Ins|Gender=Femэтой
Case=Ins|Gender=Neutэтим
Case=Insэтими
Case=Loc|Gender=Mascэтом
Case=Loc|Gender=Femэтой
Case=Loc|Gender=Neutэтом
Case=Locэтих
Case=Nom|Gender=Mascэтот
Case=Nom|Gender=Femэта
Case=Nom|Gender=Neutэто
Case=Nom|Gender=Neut|Typo=Yesэто
Case=Nomэти

PROPN

3793 PROPN tokens (85% of all PROPN tokens) have a non-empty value of Number.

The most frequent other feature values with which PROPN and Number co-occurred: Gender=Masc (2411; 64%), Animacy=Anim (1986; 52%).

PROPN tokens may have the following values of Number:

Paradigm СочиSingPlur
Case=AccсочиСочи
Case=GenСочи
Case=LocСочи
Case=NomСочи

Number seems to be lexical feature of PROPN. 99% lemmas (1812) occur only with one value of Number.

AUX

1194 AUX tokens (76% of all AUX tokens) have a non-empty value of Number.

The most frequent other feature values with which AUX and Number co-occurred: Voice=Act (1194; 100%), VerbForm=Fin (1192; 100%), Mood=Ind (1178; 99%), Aspect=Imp (877; 73%), Person=EMPTY (717; 60%), Tense=Past (717; 60%), Gender=EMPTY (609; 51%).

AUX tokens may have the following values of Number:

Paradigm бытьSingPlur
Aspect=Imp|Case=Nom|Gender=Masc|Tense=Past|VerbForm=Partбывший
Aspect=Imp|Case=Nom|Gender=Fem|Tense=Past|VerbForm=Partбывшая
Aspect=Imp|Gender=Masc|Mood=Ind|Tense=Past|VerbForm=Finбыл
Aspect=Imp|Gender=Fem|Mood=Ind|Tense=Past|VerbForm=Finбыла
Aspect=Imp|Gender=Neut|Mood=Ind|Tense=Past|VerbForm=Finбыло
Aspect=Imp|Mood=Ind|Person=1|Tense=Pres|VerbForm=Finесть
Aspect=Imp|Mood=Ind|Person=3|Tense=Pres|VerbForm=Finестьесть
Aspect=Imp|Mood=Ind|Tense=Past|VerbForm=Finбыли
Mood=Imp|Person=2|Typo=Yes|VerbForm=Finбудте
Mood=Imp|Person=2|VerbForm=Finбудьбудьте
Mood=Ind|Person=1|Tense=Fut|VerbForm=Finбудубудем
Mood=Ind|Person=2|Tense=Fut|VerbForm=Finбудешьбудете
Mood=Ind|Person=3|Tense=Fut|VerbForm=Finбудетбудут

NUM

194 NUM tokens (6% of all NUM tokens) have a non-empty value of Number.

The most frequent other feature values with which NUM and Number co-occurred: NumForm=Word (194; 100%), NumType=Card (194; 100%), Gender=Masc (100; 52%).

NUM tokens may have the following values of Number:

Relations with Agreement in Number

The 10 most frequent relations where parent and child node agree in Number: NOUN –[amod]–> ADJ (10423; 97%), NOUN –[nmod]–> NOUN (4723; 61%), VERB –[nsubj]–> NOUN (4261; 88%), NOUN –[det]–> DET (3933; 91%), VERB –[nsubj]–> PRON (3520; 96%), NOUN –[conj]–> NOUN (2742; 72%), VERB –[conj]–> VERB (2642; 74%), ADJ –[nsubj]–> NOUN (1244; 87%), ADJ –[conj]–> ADJ (807; 91%), VERB –[parataxis]–> VERB (700; 55%).