home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-Poetry: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

25361 tokens (40%) have a non-empty value of Gender. 13577 types (75%) occur at least once with a non-empty value of Gender. 7149 lemmas (72%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (15618; 24% instances), ADJ (4419; 7% instances), VERB (2263; 4% instances), DET (1244; 2% instances), PRON (1121; 2% instances), PROPN (530; 1% instances), AUX (105; 0% instances), NUM (61; 0% instances).

NOUN

15618 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Animacy=Inan (14003; 90%), Number=Sing (11310; 72%).

NOUN tokens may have the following values of Gender:

Paradigm полчасаMascFemNeut
полчасаполчасаполчаса

Gender seems to be lexical feature of NOUN. 99% lemmas (3987) occur only with one value of Gender.

ADJ

4419 ADJ tokens (73% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (4414; 100%), Degree=Pos (4339; 98%), Variant=EMPTY (3735; 85%).

ADJ tokens may have the following values of Gender:

Paradigm белыйMascFemNeut
Animacy=Inan|Case=Accбелый
Case=Accбелыйбелуюбелое
Case=Genбелогобелой
Case=Insбелымбелою, белойбелым
Case=Locбеломбелойбелом
Case=Nomбелыйбелаябелое
Variant=Shortбелбела

VERB

2263 VERB tokens (27% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (2263; 100%), Number=Sing (2262; 100%), Tense=Past (2117; 94%), Mood=Ind (1589; 70%), VerbForm=Fin (1589; 70%), Voice=Act (1511; 67%), Aspect=Perf (1471; 65%).

VERB tokens may have the following values of Gender:

Paradigm бытьMascFemNeut
былбылабыло

DET

1244 DET tokens (69% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (1244; 100%), Animacy=EMPTY (1100; 88%), Poss=EMPTY (659; 53%).

DET tokens may have the following values of Gender:

Paradigm мойMascFemNeut
Animacy=Inan|Case=Accмой
Case=Accмоюмое
Case=Datмоемумоеймоему
Case=Genмоегомоеймоего
Case=Insмоиммоей
Case=Locмоеммоеймоем
Case=Nomмоймоямое

PRON

1121 PRON tokens (32% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (1121; 100%), Person=EMPTY (658; 59%), Case=Nom (634; 57%).

PRON tokens may have the following values of Gender:

Paradigm чтоMascNeut
Animacy=Anim|Case=Nom|PronType=Relчто
Animacy=Inan|Case=Acc|PronType=Intчто
Animacy=Inan|Case=Acc|PronType=Negчто
Animacy=Inan|Case=Acc|PronType=Relчто
Animacy=Inan|Case=Dat|PronType=Intчему
Animacy=Inan|Case=Dat|PronType=Relчему
Animacy=Inan|Case=Gen|PronType=Intчего
Animacy=Inan|Case=Gen|PronType=Relчего
Animacy=Inan|Case=Ins|PronType=Intчем
Animacy=Inan|Case=Ins|PronType=Relчем
Animacy=Inan|Case=Loc|PronType=Intчем, чём
Animacy=Inan|Case=Loc|PronType=Relчем
Animacy=Inan|Case=Nom|ExtPos=ADV|PronType=Relчто
Animacy=Inan|Case=Nom|ExtPos=DET|PronType=ExcЧто
Animacy=Inan|Case=Nom|ExtPos=DET|PronType=Relчто
Animacy=Inan|Case=Nom|PronType=Excчто
Animacy=Inan|Case=Nom|PronType=Intчто
Animacy=Inan|Case=Nom|PronType=Relчто

PROPN

530 PROPN tokens (90% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (520; 98%), Animacy=Anim (327; 62%).

PROPN tokens may have the following values of Gender:

Paradigm КастусьMascFem
КастусьКастусь

Gender seems to be lexical feature of PROPN. 99% lemmas (348) occur only with one value of Gender.

AUX

105 AUX tokens (31% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=Ind (105; 100%), Number=Sing (105; 100%), Person=EMPTY (105; 100%), Tense=Past (105; 100%), VerbForm=Fin (105; 100%), Voice=Act (105; 100%).

AUX tokens may have the following values of Gender:

Paradigm бытьMascFemNeut
былбылабыло

NUM

61 NUM tokens (24% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (61; 100%), NumType=Card (58; 95%), Case=Nom (33; 54%).

NUM tokens may have the following values of Gender:

Paradigm дваMascFemNeut
Animacy=Anim|Case=Accдвух
Animacy=Inan|Case=Accдвадве
Case=Genдвухдвухдвух
Case=Insдвумя
Case=Locдвух
Case=Nomдвадведва

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (3037; 71%), NOUN –[det]–> DET (921; 65%), ADJ –[conj]–> ADJ (316; 90%), ADJ –[nsubj]–> NOUN (255; 68%), NOUN –[amod]–> VERB (197; 62%), NOUN –[acl]–> VERB (185; 63%), NOUN –[appos]–> NOUN (124; 70%), VERB –[nsubj:pass]–> NOUN (74; 61%), PROPN –[amod]–> ADJ (58; 97%), ADJ –[det]–> DET (45; 100%).