home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

51179 tokens (51%) have a non-empty value of Gender. 25253 types (84%) occur at least once with a non-empty value of Gender. 15578 lemmas (83%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (27196; 27% instances), ADJ (9591; 10% instances), PROPN (7074; 7% instances), VERB (3835; 4% instances), PRON (1411; 1% instances), DET (851; 1% instances), AUX (620; 1% instances), NUM (601; 1% instances).

NOUN

27196 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Animacy=Inan (23437; 86%), Number=Sing (20472; 75%).

NOUN tokens may have the following values of Gender:

Paradigm WINDOWSMascFemNeut
Case=Acc|Number=SingWindows
Case=Loc|Number=PlurWindows
Case=Nom|Number=PlurWindows

Gender seems to be lexical feature of NOUN. 99% lemmas (6313) occur only with one value of Gender.

ADJ

9591 ADJ tokens (78% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (9590; 100%), Animacy=Inan (8715; 91%).

ADJ tokens may have the following values of Gender:

Paradigm ПЕРВЫЙMascFemNeut
Animacy=Anim|Case=Genпервогопервой
Animacy=Anim|Case=Insпервым
Animacy=Anim|Case=Nomпервый
Animacy=Inan|Case=Accпервыйпервуюпервое
Animacy=Inan|Case=Datпервому
Animacy=Inan|Case=Genпервогопервойпервого
Animacy=Inan|Case=Insпервымпервойпервым
Animacy=Inan|Case=Locпервомпервойпервом
Animacy=Inan|Case=Nomпервыйперваяпервое

PROPN

7074 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (6837; 97%), Animacy=Inan (3654; 52%).

PROPN tokens may have the following values of Gender:

Paradigm ДЕMascFemNeut
Animacy=Anim|Case=Accде
Animacy=Anim|Case=Genде
Animacy=Anim|Case=Insдеде
Animacy=Anim|Case=Locде
Animacy=Anim|Case=Nomде
Animacy=Inan|Case=LocДе
Animacy=Inan|Case=Nomде

Gender seems to be lexical feature of PROPN. 99% lemmas (4836) occur only with one value of Gender.

VERB

3835 VERB tokens (46% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Number=Sing (3835; 100%), Person=EMPTY (3835; 100%), Tense=Past (3592; 94%), Variant=EMPTY (3083; 80%), Aspect=Perf (2611; 68%), Animacy=EMPTY (2415; 63%), Case=EMPTY (2415; 63%), Mood=Ind (2415; 63%), VerbForm=Fin (2415; 63%).

VERB tokens may have the following values of Gender:

Paradigm БЫТЬMascFemNeut
былбылабыло

PRON

1411 PRON tokens (73% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (1411; 100%), Person=EMPTY (773; 55%).

PRON tokens may have the following values of Gender:

Paradigm КОТОРЫЙMascFemNeut
Animacy=Anim|Case=Accкоторого, которыйкоторую
Animacy=Anim|Case=Datкоторому
Animacy=Anim|Case=Genкоторогокоторой
Animacy=Anim|Case=Insкоторымкоторой
Animacy=Anim|Case=Nomкоторыйкоторая
Animacy=Inan|Case=Accкоторыйкоторуюкоторое, которого
Animacy=Inan|Case=Datкоторомукоторойкоторому
Animacy=Inan|Case=Genкоторогокоторойкоторого
Animacy=Inan|Case=Insкоторымкоторой
Animacy=Inan|Case=Locкоторомкоторойкотором
Animacy=Inan|Case=Nomкоторыйкотораякоторое

Gender seems to be lexical feature of PRON. 92% lemmas (12) occur only with one value of Gender.

DET

851 DET tokens (53% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (851; 100%), Person=EMPTY (814; 96%), Animacy=Inan (767; 90%), Reflex=EMPTY (651; 76%).

DET tokens may have the following values of Gender:

Paradigm ЭТОТMascFemNeut
Animacy=Anim|Case=Accэтого
Animacy=Anim|Case=Genэтого
Animacy=Anim|Case=Locэтом
Animacy=Anim|Case=Nomэтот
Animacy=Inan|Case=Accэтотэтуэто
Animacy=Inan|Case=Datэтомуэтойэтому
Animacy=Inan|Case=Genэтогоэтой, этоэтого
Animacy=Inan|Case=Insэтимэтойэтим
Animacy=Inan|Case=Locэтомэтойэтом
Animacy=Inan|Case=Nomэтотэтаэто

AUX

620 AUX tokens (62% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Person=EMPTY (620; 100%), Number=Sing (619; 100%), Tense=Past (617; 100%), Mood=Ind (614; 99%), VerbForm=Fin (614; 99%), Voice=EMPTY (593; 96%), Aspect=Imp (590; 95%).

AUX tokens may have the following values of Gender:

Paradigm БЫТЬMascFemNeut
Animacy=Anim|Case=Gen|VerbForm=Part|Voice=Actбывшего
Animacy=Anim|Case=Ins|VerbForm=Part|Voice=Actбывшим
Mood=Ind|VerbForm=Finбылбылабыло

NUM

601 NUM tokens (30% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (601; 100%), Animacy=Inan (485; 81%), Number=Sing (303; 50%).

NUM tokens may have the following values of Gender:

Paradigm ОДИНMascFemNeut
Animacy=Anim|Case=Acc|Number=Singодного
Animacy=Anim|Case=Dat|Number=Singодному
Animacy=Anim|Case=Gen|Number=Singодногоодного
Animacy=Anim|Case=Ins|Number=Singоднимодной
Animacy=Anim|Case=Nom|Number=Singодинодна
Animacy=Inan|Case=Acc|Number=Singодиноднуодно, одного
Animacy=Inan|Case=Dat|Number=Singодномуодной
Animacy=Inan|Case=Gen|Number=Singодногооднойодного
Animacy=Inan|Case=Ins|Number=Singоднимоднойодним
Animacy=Inan|Case=Loc|Number=Singодномоднойодном
Animacy=Inan|Case=Loc|Number=Plurодних
Animacy=Inan|Case=Nom|Number=Singодиноднаодно

Gender seems to be lexical feature of NUM. 93% lemmas (124) occur only with one value of Gender.

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (7216; 73%), NOUN –[conj]–> NOUN (1103; 55%), PROPN –[flat]–> PROPN (984; 99%), NOUN –[appos]–> PROPN (843; 67%), NOUN –[det]–> DET (657; 51%), NOUN –[acl]–> VERB (514; 53%), NOUN –[appos]–> NOUN (483; 53%), VERB –[nsubj]–> PROPN (465; 68%), PROPN –[conj]–> PROPN (424; 74%), VERB –[aux:pass]–> AUX (402; 95%).