home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-SynTagRus: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

633970 tokens (42%) have a non-empty value of Gender. 110069 types (78%) occur at least once with a non-empty value of Gender. 41705 lemmas (79%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (359763; 24% instances), ADJ (92822; 6% instances), VERB (55675; 4% instances), PROPN (43362; 3% instances), PRON (39190; 3% instances), DET (32357; 2% instances), AUX (6210; 0% instances), NUM (4591; 0% instances).

NOUN

359763 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Animacy=Inan (310448; 86%), Number=Sing (254111; 71%).

NOUN tokens may have the following values of Gender:

Paradigm спецпитаниеMascFemNeut
Case=Accспецпитание
Case=Genспецпитанияспецпитания

Gender seems to be lexical feature of NOUN. 100% lemmas (18759) occur only with one value of Gender.

ADJ

92822 ADJ tokens (66% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (92822; 100%), Degree=Pos (91388; 98%).

ADJ tokens may have the following values of Gender:

Paradigm новыйMascFemNeut
Animacy=Anim|Case=Accнового
Animacy=Inan|Case=Accновый
Case=Accновуюновое
Case=Datновомуновойновому
Case=Genновогоновойнового
Case=Insновымновойновым
Case=Locновомновойновом
Case=Nomновыйноваяновое
Variant=Shortновнованово

VERB

55675 VERB tokens (32% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Number=Sing (55675; 100%), Person=EMPTY (55675; 100%), Tense=Past (51964; 93%), Mood=Ind (40910; 73%), VerbForm=Fin (40910; 73%), Aspect=Perf (35086; 63%), Voice=Act (33372; 60%).

VERB tokens may have the following values of Gender:

Paradigm мочьMascFemNeut
Case=Acc|Tense=Pres|VerbForm=Partмогущую
Case=Nom|Tense=Pres|VerbForm=Partмогущее
Mood=Ind|Tense=Past|VerbForm=Finмогмогламогло

PROPN

43362 PROPN tokens (88% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Abbr=EMPTY (43362; 100%), Number=Sing (42847; 99%), Animacy=Anim (25080; 58%).

PROPN tokens may have the following values of Gender:

Paradigm ЕльцинMascNeut
Case=AccЕльцина
Case=DatЕльцину
Case=GenЕльцина, ЕЛЬЦИНАЕльцина
Case=InsЕльциным
Case=LocЕльцине
Case=NomЕльцин

Gender seems to be lexical feature of PROPN. 99% lemmas (7927) occur only with one value of Gender.

PRON

39190 PRON tokens (59% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (39179; 100%), Person=EMPTY (23312; 59%), Animacy=Inan (20879; 53%).

PRON tokens may have the following values of Gender:

Paradigm тоMascFemNeut
Case=Accтомто
Case=Dat|ExtPos=ADVтому
Case=Dat|ExtPos=NOUNт.
Case=Datтому
Case=Gen|ExtPos=ADVтого
Case=Genтоготого
Case=Ins|ExtPos=ADVтемтем
Case=Ins|ExtPos=SCONJтем
Case=Insтемтем
Case=Loc|ExtPos=PRONтом
Case=Loc|ExtPos=VERBтом
Case=Locтом
Case=Nom|ExtPos=CCONJто, т., т
Case=Nom|ExtPos=NOUNто
Case=Nom|ExtPos=PARTто
Case=Nom|ExtPos=SCONJто
Case=Nomто

Gender seems to be lexical feature of PRON. 93% lemmas (26) occur only with one value of Gender.

DET

32357 DET tokens (59% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (32355; 100%), Poss=EMPTY (26442; 82%).

DET tokens may have the following values of Gender:

Paradigm этотMascFemNeut
Animacy=Anim|Case=Accэтого
Animacy=Inan|Case=Accэтот
Case=Accэтот, этоэтуэто
Case=Datэтомуэтойэтому
Case=Genэтогоэтойэтого
Case=Insэтимэтойэтим
Case=Locэтомэтойэтом
Case=Nomэтотэтаэто

AUX

6210 AUX tokens (45% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Number=Sing (6210; 100%), Person=EMPTY (6210; 100%), Tense=Past (6210; 100%), Voice=Act (6210; 100%), Mood=Ind (6205; 100%), VerbForm=Fin (6205; 100%).

AUX tokens may have the following values of Gender:

Paradigm бытьMascFemNeut
Case=Loc|VerbForm=Partбывшем
Case=Nom|VerbForm=Partбывший
Mood=Ind|VerbForm=Finбылбылабыло

NUM

4591 NUM tokens (26% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (4591; 100%), NumType=Card (4500; 98%), Animacy=EMPTY (3670; 80%), Number=Sing (2695; 59%).

NUM tokens may have the following values of Gender:

Paradigm одинMascFemNeut
Animacy=Anim|Case=Accодного
Animacy=Inan|Case=Acc|ExtPos=NUMодин
Animacy=Inan|Case=Accодин
Case=Acc|ExtPos=ADVодин
Case=Acc|ExtPos=NUMодну
Case=Acc|ExtPos=PRONоднуодно
Case=Accоднуодно
Case=Datодномуоднойодному
Case=Gen|ExtPos=NUMодного
Case=Genодногооднойодного
Case=Insодним, однимиодной, одноюодним
Case=Loc|ExtPos=NUMодномодной
Case=Locодномоднойодном
Case=Nom|ExtPos=ADVодин
Case=Nom|ExtPos=NUMодиноднаОдно
Case=Nom|ExtPos=PRONодин
Case=Nomодиноднаодно

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (69767; 65%), NOUN –[det]–> DET (21807; 58%), PROPN –[flat:name]–> PROPN (5836; 86%), VERB –[conj]–> VERB (5530; 55%), VERB –[nsubj]–> PROPN (4571; 67%), NOUN –[appos]–> PROPN (4319; 75%), ADJ –[nsubj]–> NOUN (3605; 61%), ADJ –[conj]–> ADJ (3421; 94%), NOUN –[amod]–> VERB (2684; 61%), NOUN –[appos]–> NOUN (2030; 58%).