home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-SynTagRus: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

635498 tokens (42%) have a non-empty value of Gender. 109813 types (78%) occur at least once with a non-empty value of Gender. 41810 lemmas (79%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (360669; 24% instances), ADJ (91801; 6% instances), VERB (55679; 4% instances), PROPN (44965; 3% instances), PRON (39232; 3% instances), DET (32356; 2% instances), AUX (6205; 0% instances), NUM (4591; 0% instances).

NOUN

360669 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Animacy=Inan (311282; 86%), Number=Sing (254830; 71%).

NOUN tokens may have the following values of Gender:

Paradigm спецпитаниеMascFemNeut
Case=Accспецпитание
Case=Genспецпитанияспецпитания

Gender seems to be lexical feature of NOUN. 100% lemmas (18960) occur only with one value of Gender.

ADJ

91801 ADJ tokens (65% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (91801; 100%), Degree=Pos (90767; 99%).

ADJ tokens may have the following values of Gender:

Paradigm новыйMascFemNeut
Animacy=Anim|Case=Accнового
Animacy=Inan|Case=Accновый
Case=Accновуюновое
Case=Datновомуновойновому
Case=Genновогоновойнового
Case=Insновымновойновым
Case=Locновомновойновом
Case=Nomновыйноваяновое
Variant=Shortновнованово

VERB

55679 VERB tokens (32% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Number=Sing (55679; 100%), Person=EMPTY (55679; 100%), Tense=Past (51968; 93%), Mood=Ind (40915; 73%), VerbForm=Fin (40915; 73%), Aspect=Perf (35086; 63%), Voice=Act (33362; 60%).

VERB tokens may have the following values of Gender:

Paradigm мочьMascFemNeut
Aspect=Imp|Case=Acc|Tense=Pres|VerbForm=Partмогущую
Aspect=Imp|Case=Nom|Tense=Pres|VerbForm=Partмогущее
Aspect=Imp|Mood=Ind|Tense=Past|VerbForm=Finмогмогламогло
Aspect=Perf|Mood=Ind|Tense=Past|VerbForm=Finсмогсмогласмогло

PROPN

44965 PROPN tokens (90% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (44433; 99%), Animacy=Anim (24982; 56%).

PROPN tokens may have the following values of Gender:

Paradigm ЕльцинMascNeut
Case=AccЕльцина
Case=DatЕльцину
Case=GenЕльцинаЕльцина
Case=InsЕльциным
Case=LocЕльцине
Case=NomЕльцин

Gender seems to be lexical feature of PROPN. 99% lemmas (7976) occur only with one value of Gender.

PRON

39232 PRON tokens (59% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (39221; 100%), Person=EMPTY (23347; 60%), Animacy=Inan (20912; 53%).

PRON tokens may have the following values of Gender:

Paradigm тоMascFemNeut
Case=Accтомто
Case=Dat|ExtPos=ADVтому
Case=Dat|ExtPos=NOUNт.
Case=Datтому, т.
Case=Gen|ExtPos=ADVтого
Case=Genтоготого
Case=Ins|ExtPos=ADVтемтем
Case=Ins|ExtPos=SCONJтем
Case=Insтемтем
Case=Loc|ExtPos=PRONтом
Case=Loc|ExtPos=VERBтом
Case=Locтом
Case=Nom|ExtPos=CCONJто, т., т
Case=Nom|ExtPos=PARTто
Case=Nom|ExtPos=SCONJто
Case=Nomто

Gender seems to be lexical feature of PRON. 93% lemmas (26) occur only with one value of Gender.

DET

32356 DET tokens (59% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (32354; 100%), Poss=EMPTY (26441; 82%).

DET tokens may have the following values of Gender:

Paradigm этотMascFemNeut
Animacy=Anim|Case=Accэтого
Animacy=Inan|Case=Accэтот
Case=Accэтот, этоэтуэто
Case=Datэтомуэтойэтому
Case=Genэтогоэтойэтого
Case=Insэтимэтойэтим
Case=Locэтомэтойэтом
Case=Nomэтотэтаэто

AUX

6205 AUX tokens (45% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Number=Sing (6205; 100%), Person=EMPTY (6205; 100%), Tense=Past (6205; 100%), Voice=Act (6205; 100%), Mood=Ind (6200; 100%), VerbForm=Fin (6200; 100%).

AUX tokens may have the following values of Gender:

Paradigm бытьMascFemNeut
Case=Loc|VerbForm=Partбывшем
Case=Nom|VerbForm=Partбывший
Mood=Ind|VerbForm=Finбылбылабыло

NUM

4591 NUM tokens (26% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (4591; 100%), NumType=Card (4500; 98%), Animacy=EMPTY (3670; 80%), Number=Sing (2695; 59%).

NUM tokens may have the following values of Gender:

Paradigm одинMascFemNeut
Animacy=Anim|Case=Accодного
Animacy=Inan|Case=Acc|ExtPos=NUMодин
Animacy=Inan|Case=Accодин
Case=Acc|ExtPos=ADVодин
Case=Acc|ExtPos=NUMодну
Case=Acc|ExtPos=PRONоднуодно
Case=Accоднуодно
Case=Datодномуоднойодному
Case=Gen|ExtPos=NUMодного
Case=Genодногооднойодного
Case=Insодним, однимиодной, одноюодним
Case=Loc|ExtPos=NUMодномодной
Case=Locодномоднойодном
Case=Nom|ExtPos=NUMодиноднаОдно
Case=Nom|ExtPos=PRONодин
Case=Nomодиноднаодно

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (69187; 65%), NOUN –[det]–> DET (21824; 58%), PROPN –[flat:name]–> PROPN (5762; 85%), VERB –[conj]–> VERB (5528; 55%), VERB –[nsubj]–> PROPN (4609; 67%), NOUN –[appos]–> PROPN (4254; 75%), ADJ –[nsubj]–> NOUN (3609; 62%), ADJ –[conj]–> ADJ (3359; 94%), NOUN –[amod]–> VERB (2683; 61%), NOUN –[appos]–> NOUN (2066; 58%).