home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-GSD: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

51214 tokens (52%) have a non-empty value of Gender. 25269 types (84%) occur at least once with a non-empty value of Gender. 15580 lemmas (84%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (27194; 27% instances), ADJ (9623; 10% instances), PROPN (7074; 7% instances), VERB (3832; 4% instances), PRON (1415; 1% instances), DET (850; 1% instances), AUX (625; 1% instances), NUM (601; 1% instances).

NOUN

27194 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Animacy=Inan (23435; 86%), Number=Sing (20470; 75%).

NOUN tokens may have the following values of Gender:

Paradigm WINDOWSMascFemNeut
Case=Acc|Number=SingWindows
Case=Loc|Number=PlurWindows
Case=Nom|Number=PlurWindows

Gender seems to be lexical feature of NOUN. 99% lemmas (6313) occur only with one value of Gender.

ADJ

9623 ADJ tokens (78% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (9622; 100%), Animacy=Inan (8715; 91%).

ADJ tokens may have the following values of Gender:

Paradigm ПЕРВЫЙMascFemNeut
Animacy=Anim|Case=Genпервогопервой
Animacy=Anim|Case=Insпервым
Animacy=Anim|Case=Nomпервый
Animacy=Inan|Case=Accпервыйпервуюпервое
Animacy=Inan|Case=Datпервому
Animacy=Inan|Case=Genпервогопервойпервого
Animacy=Inan|Case=Insпервымпервойпервым
Animacy=Inan|Case=Locпервомпервойпервом
Animacy=Inan|Case=Nomпервыйперваяпервое

PROPN

7074 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (6837; 97%), Animacy=Inan (3654; 52%).

PROPN tokens may have the following values of Gender:

Paradigm ДЕMascFemNeut
Animacy=Anim|Case=Accде
Animacy=Anim|Case=Genде
Animacy=Anim|Case=Insдеде
Animacy=Anim|Case=Locде
Animacy=Anim|Case=Nomде
Animacy=Inan|Case=LocДе
Animacy=Inan|Case=Nomде

Gender seems to be lexical feature of PROPN. 99% lemmas (4836) occur only with one value of Gender.

VERB

3832 VERB tokens (46% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Number=Sing (3832; 100%), Person=EMPTY (3832; 100%), Tense=Past (3589; 94%), Variant=EMPTY (3078; 80%), Aspect=Perf (2613; 68%), Animacy=EMPTY (2410; 63%), Case=EMPTY (2410; 63%), Mood=Ind (2410; 63%), VerbForm=Fin (2410; 63%).

VERB tokens may have the following values of Gender:

Paradigm БЫТЬMascFemNeut
былбылабыло

PRON

1415 PRON tokens (74% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (1415; 100%), Person=EMPTY (777; 55%).

PRON tokens may have the following values of Gender:

Paradigm КОТОРЫЙMascFemNeut
Animacy=Anim|Case=Accкоторого, которыйкоторую
Animacy=Anim|Case=Datкоторому
Animacy=Anim|Case=Genкоторогокоторой
Animacy=Anim|Case=Insкоторымкоторой
Animacy=Anim|Case=Nomкоторыйкоторая
Animacy=Inan|Case=Accкоторыйкоторуюкоторое, которого
Animacy=Inan|Case=Datкоторомукоторойкоторому
Animacy=Inan|Case=Genкоторогокоторойкоторого
Animacy=Inan|Case=Insкоторымкоторой
Animacy=Inan|Case=Locкоторомкоторойкотором
Animacy=Inan|Case=Nomкоторыйкотораякоторое

Gender seems to be lexical feature of PRON. 92% lemmas (12) occur only with one value of Gender.

DET

850 DET tokens (53% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (850; 100%), Person=EMPTY (813; 96%), Animacy=Inan (766; 90%), Reflex=EMPTY (650; 76%).

DET tokens may have the following values of Gender:

Paradigm ЭТОТMascFemNeut
Animacy=Anim|Case=Accэтого
Animacy=Anim|Case=Genэтого
Animacy=Anim|Case=Locэтом
Animacy=Anim|Case=Nomэтот
Animacy=Inan|Case=Accэтотэтуэто
Animacy=Inan|Case=Datэтомуэтойэтому
Animacy=Inan|Case=Genэтогоэтой, этоэтого
Animacy=Inan|Case=Insэтимэтойэтим
Animacy=Inan|Case=Locэтомэтойэтом
Animacy=Inan|Case=Nomэтотэтаэто

AUX

625 AUX tokens (62% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Person=EMPTY (625; 100%), Number=Sing (624; 100%), Tense=Past (622; 100%), Mood=Ind (619; 99%), VerbForm=Fin (619; 99%), Voice=EMPTY (598; 96%), Aspect=Imp (595; 95%).

AUX tokens may have the following values of Gender:

Paradigm БЫТЬMascFemNeut
Animacy=Anim|Case=Gen|VerbForm=Part|Voice=Actбывшего
Animacy=Anim|Case=Ins|VerbForm=Part|Voice=Actбывшим
Mood=Ind|VerbForm=Finбылбылабыло

NUM

601 NUM tokens (30% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (601; 100%), Animacy=Inan (485; 81%), Number=Sing (303; 50%).

NUM tokens may have the following values of Gender:

Paradigm ОДИНMascFemNeut
Animacy=Anim|Case=Acc|Number=Singодного
Animacy=Anim|Case=Dat|Number=Singодному
Animacy=Anim|Case=Gen|Number=Singодногоодного
Animacy=Anim|Case=Ins|Number=Singоднимодной
Animacy=Anim|Case=Nom|Number=Singодинодна
Animacy=Inan|Case=Acc|Number=Singодиноднуодно, одного
Animacy=Inan|Case=Dat|Number=Singодномуодной
Animacy=Inan|Case=Gen|Number=Singодногооднойодного
Animacy=Inan|Case=Ins|Number=Singоднимоднойодним
Animacy=Inan|Case=Loc|Number=Singодномоднойодном
Animacy=Inan|Case=Loc|Number=Plurодних
Animacy=Inan|Case=Nom|Number=Singодиноднаодно

Gender seems to be lexical feature of NUM. 93% lemmas (124) occur only with one value of Gender.

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (7229; 73%), NOUN –[conj]–> NOUN (1118; 55%), PROPN –[flat]–> PROPN (985; 99%), NOUN –[appos]–> PROPN (837; 67%), NOUN –[det]–> DET (657; 51%), NOUN –[acl]–> VERB (516; 53%), NOUN –[appos]–> NOUN (475; 52%), VERB –[nsubj]–> PROPN (465; 68%), PROPN –[conj]–> PROPN (425; 74%), VERB –[aux:pass]–> AUX (402; 95%).