home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-SynTagRus: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

461772 tokens (42%) have a non-empty value of Gender. 89151 types (77%) occur at least once with a non-empty value of Gender. 35251 lemmas (80%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (270589; 24% instances), ADJ (71465; 6% instances), VERB (37473; 3% instances), PROPN (36937; 3% instances), PRON (21747; 2% instances), DET (16428; 1% instances), AUX (4141; 0% instances), NUM (2992; 0% instances).

NOUN

270589 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Animacy=Inan (235101; 87%), Number=Sing (189516; 70%).

NOUN tokens may have the following values of Gender:

Paradigm оMascFemNeut
Case=Acc|Number=Singо
Case=Acc|Number=Plurо.
Case=Nom|Number=SingО

Gender seems to be lexical feature of NOUN. 100% lemmas (16345) occur only with one value of Gender.

ADJ

71465 ADJ tokens (66% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (71465; 100%), Degree=Pos (71073; 99%).

ADJ tokens may have the following values of Gender:

Paradigm другойMascFemNeut
Animacy=Anim|Case=Accдругого
Animacy=Inan|Case=Accдругой
Case=Accдругую, другойдругое
Case=Datдругомудругойдругому
Case=Genдругогодругойдругого
Case=Insдругимдругойдругим
Case=Locдругомдругойдругом
Case=Nomдругойдругая, другойдругое, др.

VERB

37473 VERB tokens (30% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Number=Sing (37473; 100%), Person=EMPTY (37473; 100%), Tense=Past (34600; 92%), Case=EMPTY (30137; 80%), Mood=Ind (26319; 70%), VerbForm=Fin (26319; 70%), Aspect=Perf (23620; 63%), Voice=Act (21661; 58%).

VERB tokens may have the following values of Gender:

Paradigm мочьMascFemNeut
Aspect=Imp|Case=Acc|Tense=Pres|VerbForm=Partмогущую
Aspect=Imp|Case=Nom|Tense=Pres|VerbForm=Partмогущее
Aspect=Imp|Mood=Ind|Tense=Past|VerbForm=Finмогмогламогло
Aspect=Perf|Mood=Ind|Tense=Past|VerbForm=Finсмогсмогласмогло

PROPN

36937 PROPN tokens (89% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (36238; 98%), Animacy=Anim (19096; 52%).

PROPN tokens may have the following values of Gender:

Paradigm GONGOMascFemNeut
Case=Gen|Number=SingGONGO
Case=Ins|Number=PlurGONGO
Case=Nom|Number=PlurGONGO

Gender seems to be lexical feature of PROPN. 98% lemmas (6764) occur only with one value of Gender.

PRON

21747 PRON tokens (44% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (21744; 100%), Person=EMPTY (11116; 51%), Animacy=Inan (11109; 51%).

PRON tokens may have the following values of Gender:

Paradigm тоMascFemNeut
Case=Accтомто
Case=Datтому, т.
Case=Genтоготого
Case=Insтемтем
Case=Locтом
Case=Nomто, т., т

DET

16428 DET tokens (59% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (16426; 100%).

DET tokens may have the following values of Gender:

Paradigm этотMascFemNeut
Case=Accэтот, этого, этоэтуэто
Case=Datэтомуэтойэтому
Case=Genэтогоэтойэтого
Case=Insэтимэтойэтим
Case=Locэтомэтойэтом
Case=Nomэтотэтаэто

AUX

4141 AUX tokens (43% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Aspect=Imp (4141; 100%), Number=Sing (4141; 100%), Person=EMPTY (4141; 100%), Tense=Past (4141; 100%), Voice=Act (4141; 100%), Mood=Ind (4138; 100%), VerbForm=Fin (4138; 100%).

AUX tokens may have the following values of Gender:

Paradigm бытьMascFemNeut
Case=Loc|VerbForm=Partбывшем
Case=Nom|VerbForm=Partбывший
Mood=Ind|VerbForm=Finбылбылабыло

NUM

2992 NUM tokens (16% of all NUM tokens) have a non-empty value of Gender.

NUM tokens may have the following values of Gender:

Paradigm одинMascFemNeut
Animacy=Anim|Case=Accодного
Animacy=Inan|Case=Accодин
Case=Accоднуодно
Case=Datодномуоднойодному
Case=Genодногооднойодного
Case=Insоднимоднойодним
Case=Locодномоднойодном
Case=Nomодиноднаодно

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (52763; 66%), NOUN –[det]–> DET (13954; 58%), PROPN –[flat:name]–> PROPN (5313; 99%), NOUN –[acl]–> VERB (3968; 50%), NOUN –[appos]–> PROPN (3525; 80%), VERB –[conj]–> VERB (3358; 53%), VERB –[nsubj]–> PROPN (3146; 63%), ADJ –[nsubj]–> NOUN (2888; 62%), ADJ –[conj]–> ADJ (2500; 94%), NOUN –[amod]–> VERB (2054; 60%).