This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home ru/feat issue tracker

Gender: gender

Gender is a lexical feature of nouns and inflectional feature of other parts of speech (adjectives, verbs) that mark agreement with nouns. There are three values of gender: masculine, feminine, and neuter.

See also the related feature of Animacy.

Masc: masculine gender

Nouns denoting male persons are masculine. Other nouns may be also grammatically masculine, without any relation to sex.

Examples

Fem: feminine gender

Nouns denoting female persons are feminine. Other nouns may be also grammatically feminine, without any relation to sex.

Examples

Neut: neuter gender

This third gender is for nouns that are neither masculine nor feminine (grammatically). Nouns whose nominative suffix is -о  or -е  (including a large group of deverbative nouns denoting actions) are usually neuter.

Examples


Treebank Statistics (UD_Russian)

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

51180 tokens (51%) have a non-empty value of Gender. 25253 types (84%) occur at least once with a non-empty value of Gender. 15578 lemmas (83%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: ru-pos/NOUN (27197; 27% instances), ru-pos/ADJ (9591; 10% instances), ru-pos/PROPN (7074; 7% instances), ru-pos/VERB (4010; 4% instances), ru-pos/PRON (1412; 1% instances), ru-pos/DET (851; 1% instances), ru-pos/NUM (601; 1% instances), ru-pos/AUX (444; 0% instances).

NOUN

27197 ru-pos/NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Animacy=Inan (23438; 86%), Number=Sing (20473; 75%).

NOUN tokens may have the following values of Gender:

Paradigm WINDOWSMascFemNeut
Case=Acc|Number=SingWindows
Case=Loc|Number=PlurWindows
Case=Nom|Number=PlurWindows

Gender seems to be lexical feature of NOUN. 99% lemmas (6314) occur only with one value of Gender.

ADJ

9591 ru-pos/ADJ tokens (78% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (9590; 100%), Animacy=Inan (8715; 91%), Variant=Full (6889; 72%).

ADJ tokens may have the following values of Gender:

Paradigm ПЕРВЫЙMascFemNeut
Animacy=Anim|Case=Genпервогопервой
Animacy=Anim|Case=Insпервым
Animacy=Anim|Case=Nomпервый
Animacy=Inan|Case=Accпервыйпервуюпервое
Animacy=Inan|Case=Datпервому
Animacy=Inan|Case=Genпервогопервойпервого
Animacy=Inan|Case=Insпервымпервойпервым
Animacy=Inan|Case=Locпервомпервойпервом
Animacy=Inan|Case=Nomпервыйперваяпервое

PROPN

7074 ru-pos/PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (6837; 97%), Animacy=Inan (3654; 52%).

PROPN tokens may have the following values of Gender:

Paradigm ДЕMascFemNeut
Animacy=Anim|Case=Accде
Animacy=Anim|Case=Genде
Animacy=Anim|Case=Insдеде
Animacy=Anim|Case=Locде
Animacy=Anim|Case=Nomде
Animacy=Inan|Case=LocДе
Animacy=Inan|Case=Nomде

Gender seems to be lexical feature of PROPN. 99% lemmas (4836) occur only with one value of Gender.

VERB

4010 ru-pos/VERB tokens (46% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Number=Sing (4010; 100%), Person=EMPTY (4010; 100%), Tense=Past (3765; 94%), Variant=EMPTY (3257; 81%), Aspect=Perf (2640; 66%), Animacy=EMPTY (2587; 65%), VerbForm=Fin (2587; 65%), Mood=Ind (2587; 65%), Case=EMPTY (2587; 65%), Voice=EMPTY (2587; 65%).

VERB tokens may have the following values of Gender:

Paradigm БЫТЬMascFemNeut
былбылабыло

PRON

1412 ru-pos/PRON tokens (74% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (1412; 100%), Person=EMPTY (774; 55%).

PRON tokens may have the following values of Gender:

Paradigm КОТОРЫЙMascFemNeut
Animacy=Anim|Case=Accкоторого, которыйкоторую
Animacy=Anim|Case=Datкоторому
Animacy=Anim|Case=Genкоторогокоторой
Animacy=Anim|Case=Insкоторымкоторой
Animacy=Anim|Case=Nomкоторыйкоторая
Animacy=Inan|Case=Accкоторыйкоторуюкоторое, которого
Animacy=Inan|Case=Datкоторомукоторойкоторому
Animacy=Inan|Case=Genкоторогокоторойкоторого
Animacy=Inan|Case=Insкоторымкоторой
Animacy=Inan|Case=Locкоторомкоторойкотором
Animacy=Inan|Case=Nomкоторыйкотораякоторое

Gender seems to be lexical feature of PRON. 92% lemmas (12) occur only with one value of Gender.

DET

851 ru-pos/DET tokens (53% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (851; 100%), Person=EMPTY (814; 96%), Animacy=Inan (767; 90%), Reflex=EMPTY (651; 76%).

DET tokens may have the following values of Gender:

Paradigm ЭТОТMascFemNeut
Animacy=Anim|Case=Accэтого
Animacy=Anim|Case=Genэтого
Animacy=Anim|Case=Locэтом
Animacy=Anim|Case=Nomэтот
Animacy=Inan|Case=Accэтотэтуэто
Animacy=Inan|Case=Datэтомуэтойэтому
Animacy=Inan|Case=Genэтогоэтой, этоэтого
Animacy=Inan|Case=Insэтимэтойэтим
Animacy=Inan|Case=Locэтомэтойэтом
Animacy=Inan|Case=Nomэтотэтаэто

NUM

601 ru-pos/NUM tokens (30% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Animacy=Inan (485; 81%), Number=Sing (303; 50%).

NUM tokens may have the following values of Gender:

Paradigm ОДИНMascFemNeut
Animacy=Anim|Case=Acc|Number=Singодного
Animacy=Anim|Case=Dat|Number=Singодному
Animacy=Anim|Case=Gen|Number=Singодногоодного
Animacy=Anim|Case=Ins|Number=Singоднимодной
Animacy=Anim|Case=Nom|Number=Singодинодна
Animacy=Inan|Case=Acc|Number=Singодиноднуодно, одного
Animacy=Inan|Case=Dat|Number=Singодномуодной
Animacy=Inan|Case=Gen|Number=Singодногооднойодного
Animacy=Inan|Case=Ins|Number=Singоднимоднойодним
Animacy=Inan|Case=Loc|Number=Singодномоднойодном
Animacy=Inan|Case=Loc|Number=Plurодних
Animacy=Inan|Case=Nom|Number=Singодиноднаодно

Gender seems to be lexical feature of NUM. 93% lemmas (124) occur only with one value of Gender.

AUX

444 ru-pos/AUX tokens (72% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Aspect=Imp (444; 100%), Tense=Past (444; 100%), Number=Sing (444; 100%), VerbForm=Fin (442; 100%), Mood=Ind (442; 100%).

AUX tokens may have the following values of Gender:

Paradigm БЫТЬMascFemNeut
Animacy=Anim|Case=Gen|VerbForm=Part|Voice=Actбывшего
Animacy=Anim|Case=Ins|VerbForm=Part|Voice=Actбывшим
Mood=Ind|VerbForm=Finбылбылабыло

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (7454; 73%), NOUN –[conj]–> NOUN (1019; 55%), PROPN –[name]–> PROPN (987; 99%), NOUN –[appos]–> PROPN (869; 67%), NOUN –[det]–> DET (662; 52%), NOUN –[acl]–> VERB (518; 53%), NOUN –[appos]–> NOUN (479; 52%), VERB –[nsubj]–> PROPN (456; 68%), PROPN –[conj]–> PROPN (428; 74%), VERB –[auxpass]–> AUX (401; 95%).


Treebank Statistics (UD_Russian-SynTagRus)

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

439285 tokens (41%) have a non-empty value of Gender. 88407 types (78%) occur at least once with a non-empty value of Gender. 33405 lemmas (80%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: ru-pos/NOUN (269867; 25% instances), ru-pos/ADJ (76038; 7% instances), ru-pos/PROPN (36956; 3% instances), ru-pos/VERB (36725; 3% instances), ru-pos/DET (12735; 1% instances), ru-pos/AUX (4089; 0% instances), ru-pos/NUM (2875; 0% instances).

NOUN

269867 ru-pos/NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Animacy=Inan (233902; 87%), Number=Sing (191750; 71%).

NOUN tokens may have the following values of Gender:

Paradigm тоMascFemNeut
Case=Accто
Case=Datтому
Case=Genтоготого
Case=Insтемтем
Case=Locтом
Case=Nomто

Gender seems to be lexical feature of NOUN. 100% lemmas (16405) occur only with one value of Gender.

ADJ

76038 ru-pos/ADJ tokens (66% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (76038; 100%), Degree=Pos (75767; 100%).

ADJ tokens may have the following values of Gender:

Paradigm которыйMascFemNeut
Animacy=Anim|Case=Accкоторого
Animacy=Inan|Case=Accкоторыйкоторые
Case=Accкоторуюкоторое
Case=Datкоторомукоторойкоторому
Case=Genкоторогокоторойкоторого
Case=Insкоторымкоторойкоторым
Case=Locкоторомкоторойкотором
Case=Nomкоторыйкотораякоторое

PROPN

36956 ru-pos/PROPN tokens (93% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (35808; 97%), Animacy=Inan (19202; 52%).

PROPN tokens may have the following values of Gender:

Paradigm gongoMascFemNeut
Case=Gen|Number=SingGONGO
Case=Ins|Number=PlurGONGO
Case=Nom|Number=PlurGONGO

Gender seems to be lexical feature of PROPN. 98% lemmas (7462) occur only with one value of Gender.

VERB

36725 ru-pos/VERB tokens (30% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Number=Sing (36725; 100%), Person=EMPTY (36725; 100%), Tense=Past (33933; 92%), Case=EMPTY (29549; 80%), Voice=Act (28776; 78%), Mood=Ind (25810; 70%), VerbForm=Fin (25810; 70%), Aspect=Perf (23217; 63%).

VERB tokens may have the following values of Gender:

Paradigm мочьMascFemNeut
Aspect=Imp|Case=Acc|Tense=Pres|VerbForm=Partмогущую
Aspect=Imp|Case=Nom|Tense=Pres|VerbForm=Partмогущее
Aspect=Imp|Mood=Ind|Tense=Past|VerbForm=Finмогмогламогло
Aspect=Perf|Mood=Ind|Tense=Past|VerbForm=Finсмогсмогласмогло

DET

12735 ru-pos/DET tokens (59% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (12735; 100%).

DET tokens may have the following values of Gender:

Paradigm этотMascFemNeut
Case=Accэтот, этого, этоэтуэто
Case=Datэтомуэтойэтому
Case=Genэтогоэтойэтого
Case=Insэтимэтойэтим
Case=Locэтомэтойэтом
Case=Nomэтотэтаэто

AUX

4089 ru-pos/AUX tokens (51% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Number=Sing (4089; 100%), Person=EMPTY (4089; 100%), Voice=Act (4089; 100%), Tense=Past (4089; 100%), Aspect=Imp (4089; 100%), Mood=Ind (4086; 100%), VerbForm=Fin (4086; 100%).

AUX tokens may have the following values of Gender:

Paradigm бытьMascFemNeut
Case=Loc|VerbForm=Partбывшем
Case=Nom|VerbForm=Partбывший
Mood=Ind|VerbForm=Finбылбылабыло

NUM

2875 ru-pos/NUM tokens (18% of all NUM tokens) have a non-empty value of Gender.

NUM tokens may have the following values of Gender:

Paradigm одинMascFemNeut
Animacy=Anim|Case=Accодного
Animacy=Inan|Case=Accодин
Case=Accоднуодно
Case=Datодномуоднойодному
Case=Genодногооднойодного
Case=Insоднимоднойодним
Case=Locодномоднойодном
Case=Nomодиноднаодно

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (54026; 66%), NOUN –[det]–> DET (12483; 59%), NOUN –[amod]–> VERB (5837; 56%), PROPN –[name]–> PROPN (5180; 100%), NOUN –[appos]–> PROPN (4169; 81%), ADJ –[nsubj]–> NOUN (3484; 65%), VERB –[conj]–> VERB (3282; 54%), ADJ –[conj]–> ADJ (2688; 95%), VERB –[nsubj]–> PROPN (2418; 59%), PROPN –[amod]–> ADJ (1851; 90%).


Gender in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]