home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-GSD: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

50178 tokens (51%) have a non-empty value of Gender. 24612 types (82%) occur at least once with a non-empty value of Gender. 15059 lemmas (80%) occur at least once with a non-empty value of Gender. The feature is used with 10 part-of-speech tags: NOUN (26754; 27% instances), ADJ (9543; 10% instances), PROPN (6584; 7% instances), VERB (3857; 4% instances), PRON (1428; 1% instances), DET (824; 1% instances), NUM (607; 1% instances), AUX (579; 1% instances), PART (1; 0% instances), PUNCT (1; 0% instances).

NOUN

26754 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Animacy=Inan (23057; 86%), Number=Sing (20138; 75%).

NOUN tokens may have the following values of Gender:

Paradigm годMascFem
Animacy=Anim|Case=Loc|Number=Singгоду
Animacy=Inan|Case=Acc|Number=Singгод, года
Animacy=Inan|Case=Acc|Number=Plurгоды, гг., лет, годовгоды
Animacy=Inan|Case=Dat|Number=Singгоду
Animacy=Inan|Case=Dat|Number=Plurгодам, гг.
Animacy=Inan|Case=Gen|Number=Singгода, г., гг.
Animacy=Inan|Case=Gen|Number=Plurлет, годов, гг.
Animacy=Inan|Case=Ins|Number=Singгодом
Animacy=Inan|Case=Ins|Number=Plurгодами
Animacy=Inan|Case=Loc|Number=Singгоду, г.
Animacy=Inan|Case=Loc|Number=Plurгодах, гг., годы
Animacy=Inan|Case=Nom|Number=Singгод, г.
Animacy=Inan|Case=Nom|Number=Plurгоды, гг.

Gender seems to be lexical feature of NOUN. 99% lemmas (6039) occur only with one value of Gender.

ADJ

9543 ADJ tokens (78% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (9541; 100%), Degree=Pos (9485; 99%).

ADJ tokens may have the following values of Gender:

Paradigm первыйMascFemNeut
Animacy=Inan|Case=Accпервый
Case=Accпервуюпервое
Case=Datпервому
Case=Genпервогопервойпервого
Case=Insпервымпервойпервым
Case=Locпервомпервойпервом
Case=Nomпервыйперваяпервое

PROPN

6584 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (6382; 97%), Animacy=Anim (3299; 50%).

PROPN tokens may have the following values of Gender:

Paradigm НАТОMascFemNeut
Case=GenНАТОНАТОНАТО
Case=NomНАТО

Gender seems to be lexical feature of PROPN. 99% lemmas (4426) occur only with one value of Gender.

VERB

3857 VERB tokens (45% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Number=Sing (3857; 100%), Person=EMPTY (3857; 100%), Tense=Past (3617; 94%), Variant=EMPTY (3106; 81%), Aspect=Perf (2627; 68%), Animacy=EMPTY (2452; 64%), Case=EMPTY (2451; 64%), Mood=Ind (2451; 64%), VerbForm=Fin (2451; 64%), Voice=Act (2153; 56%).

VERB tokens may have the following values of Gender:

Paradigm статьMascFemNeut
Animacy=Anim|Case=Ins|VerbForm=Part|Voice=Actставшим
Animacy=Anim|Case=Nom|VerbForm=Part|Voice=Actставший
Animacy=Inan|Case=Nom|VerbForm=Part|Voice=ActСтавшая
Mood=Ind|VerbForm=Finсталсталастало
Mood=Ind|VerbForm=Fin|Voice=Actсталсталастало

PRON

1428 PRON tokens (74% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (1428; 100%), Person=EMPTY (792; 55%).

PRON tokens may have the following values of Gender:

Paradigm которыйMascFemNeut
Animacy=Anim|Case=Accкоторого, которыйкоторую
Animacy=Anim|Case=Datкоторому
Animacy=Anim|Case=Genкоторогокоторой
Animacy=Anim|Case=Insкоторымкоторой
Animacy=Anim|Case=Nomкоторыйкоторая
Animacy=Inan|Case=Accкоторыйкоторуюкоторое, которого
Animacy=Inan|Case=Datкоторомукоторойкоторому
Animacy=Inan|Case=Genкоторогокоторойкоторого
Animacy=Inan|Case=Insкоторымкоторой
Animacy=Inan|Case=Locкоторомкоторойкотором
Animacy=Inan|Case=Nomкоторыйкотораякоторое

Gender seems to be lexical feature of PRON. 94% lemmas (15) occur only with one value of Gender.

DET

824 DET tokens (53% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (824; 100%), Animacy=EMPTY (729; 88%).

DET tokens may have the following values of Gender:

Paradigm этотMascFemNeut
Animacy=Anim|Case=Accэтого
Animacy=Inan|Case=Accэтот
Case=Accэтуэто
Case=Datэтомуэтойэтому
Case=Genэтогоэтой, этоэтого
Case=Insэтимэтойэтим
Case=Locэтомэтойэтом
Case=Nomэтотэтаэто

NUM

607 NUM tokens (29% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (590; 97%), Animacy=Inan (493; 81%).

NUM tokens may have the following values of Gender:

Paradigm одинMascFemNeut
Animacy=Anim|Case=Acc|Number=Singодного
Animacy=Anim|Case=Dat|Number=Singодному
Animacy=Anim|Case=Gen|Number=Singодногоодного
Animacy=Anim|Case=Ins|Number=Singоднимодной
Animacy=Anim|Case=Nom|Number=Singодинодна
Animacy=Inan|Case=Acc|Number=Singодиноднуодно, одного
Animacy=Inan|Case=Dat|Number=Singодномуодной
Animacy=Inan|Case=Gen|Number=Singодногооднойодного
Animacy=Inan|Case=Ins|Number=Singоднимоднойодним
Animacy=Inan|Case=Loc|Number=Singодномоднойодном
Animacy=Inan|Case=Loc|Number=Plurодних
Animacy=Inan|Case=Nom|Number=Singодиноднаодно

Gender seems to be lexical feature of NUM. 93% lemmas (125) occur only with one value of Gender.

AUX

579 AUX tokens (72% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Aspect=Imp (579; 100%), Number=Sing (579; 100%), Tense=Past (579; 100%), Mood=Ind (577; 100%), VerbForm=Fin (577; 100%).

AUX tokens may have the following values of Gender:

Paradigm бытьMascFemNeut
Animacy=Anim|Case=Gen|VerbForm=Part|Voice=Actбывшего
Animacy=Anim|Case=Ins|VerbForm=Part|Voice=Actбывшим
Mood=Ind|VerbForm=Finбылбылабыло
Mood=Ind|VerbForm=Fin|Voice=Actбылбыло

PART

1 PART tokens (0% of all PART tokens) have a non-empty value of Gender.

The most frequent other feature values with which PART and Gender co-occurred: Polarity=EMPTY (1; 100%).

PART tokens may have the following values of Gender:

PUNCT

1 PUNCT tokens (0% of all PUNCT tokens) have a non-empty value of Gender.

PUNCT tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (7460; 76%), NOUN –[conj]–> NOUN (1107; 55%), PROPN –[flat:name]–> PROPN (964; 99%), NOUN –[appos]–> PROPN (790; 69%), NOUN –[det]–> DET (657; 51%), NOUN –[acl]–> VERB (509; 53%), VERB –[nsubj]–> PROPN (470; 69%), NOUN –[appos]–> NOUN (405; 52%), VERB –[aux:pass]–> AUX (405; 95%), VERB –[nsubj:pass]–> NOUN (388; 71%).