home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-RNC: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

47724 tokens (50%) have a non-empty value of Gender. 16199 types (80%) occur at least once with a non-empty value of Gender. 7080 lemmas (80%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (21520; 23% instances), ADJ (7811; 8% instances), PROPN (7071; 7% instances), DET (4491; 5% instances), VERB (2907; 3% instances), PRON (2857; 3% instances), NUM (906; 1% instances), AUX (161; 0% instances).

NOUN

21520 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (15053; 70%).

NOUN tokens may have the following values of Gender:

Gender seems to be lexical feature of NOUN. 99% lemmas (2582) occur only with one value of Gender.

ADJ

7811 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (7018; 90%), Variant=EMPTY (6936; 89%), Number=Sing (5687; 73%).

ADJ tokens may have the following values of Gender:

Paradigm великийMascFemNeut
Animacy=Anim|Case=Acc|Number=Singвеликого, великаго, великог[о], великѡг[о]
Animacy=Anim|Case=Acc|Number=Plurвеликихъ
Case=Acc|Number=Singвеликий, великии, великой, Великійвеликуювеликое
Case=Acc|Number=Sing|Variant=Shortвеликъвеликувелико
Case=Acc|Number=Plurвеликиевеликая, великаꙗ
Case=Dat|Number=Singвеликому, великомꙋвеликои, великойвеликому
Case=Dat|Number=Sing|Variant=Shortвеликувелику
Case=Dat|Number=Plurвеликимъ, великимвеликимъ
Case=Gen|Number=Singвеликого, великаго, великог[о]великія, великия, великои, великие, великое, великойвеликого, великаго, великог[о]
Case=Gen|Number=Plurвеликихъ, великих, великыхвеликихъвеликих, великихъ
Case=Gen|Number=Plur|Typo=Yesвеликии
Case=Ins|Number=Singвеликим, великимъвеликоювеликимъ
Case=Ins|Number=Plurвеликим, великими
Case=Loc|Number=Singвеликом, великомъ, Велікомъвеликой, велицеивеликом, великомъ, велицем
Case=Loc|Number=Plurвеликихвеликихъ
Case=Nom|Number=Singвеликии, великий, великій, великiй, великой, Великиⸯвеликая, великаꙗвеликое
Case=Nom|Number=Sing|Variant=Shortвелики, велик, великъвеликавелико
Case=Nom|Number=Plurвеликии, великиие, великия, великіе, велицыи
Case=Nom|Number=Plur|Variant=Shortвелики

PROPN

7071 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (7001; 99%).

PROPN tokens may have the following values of Gender:

Paradigm ВасильевъMascFem
Animacy=Anim|Case=Acc|NameType=Sur|Number=SingВасильева
Case=Acc|NameType=Pat|Number=PlurВасильевы
Case=Dat|NameType=Pat|Number=SingВасильевѣ
Case=Gen|NameType=Pat|Number=PlurВасильевых
Case=Gen|NameType=Sur|Number=SingВасил(ь)ева
Case=Nom|NameType=Pat|Number=SingВасильев
Case=Nom|NameType=Sur|Number=SingВасильев, Васильевъ

Gender seems to be lexical feature of PROPN. 99% lemmas (1991) occur only with one value of Gender.

DET

4491 DET tokens (99% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Reflex=EMPTY (3972; 88%), Number=Sing (3010; 67%), Poss=EMPTY (2949; 66%).

DET tokens may have the following values of Gender:

Paradigm тотъMascFemNeut
Animacy=Anim|Case=Acc|Number=Singтого, тово, тог[о]
Animacy=Anim|Case=Acc|Number=Plurтѣхъ, тех, тѣх
Case=Acc|Number=Singтотъ, тот, тои, той, тыи, тѣитое, ту, тою, тои, тоѣ, туюто, того
Case=Acc|Number=Plurтѣ, тете, тѣ, тата, те, тѣ
Case=Dat|Number=Singтому, томꙋтой, тоитому, томꙋ
Case=Dat|Number=Plurтѣмъ, тем, тымътем, тѣмъ, тымътем, тѣмъ
Case=Gen|Number=Singтого, тово, таво, тог[о], тоетое, тои, тоя, той, то(й), тоейтого, тово, тог[о], таво
Case=Gen|Number=Plurтех, тѣхъ, тихъ, тѣх, тыхътѣхъ, техтех
Case=Ins|Number=Singтѣмъ, тѣм, тем, темъ, тимътою, тои, тойтем, темъ, тѣмъ, тѣм
Case=Ins|Number=Plurтѣми, теми, тымитѣми, теми, тѣнитѣми
Case=Loc|Number=Singтомъ, томтой, тои, тоітомъ, том, то(м)
Case=Loc|Number=Plurтех, тѣхътѣхъ, тех, тѣхтех, тѣх, тѣхъ
Case=Nom|Number=Singтот, той, тотъ, тыита, те, т[а]то
Case=Nom|Number=Plurте, тѣ, тии, тоете, тѣ, тата, те, тые

VERB

2907 VERB tokens (36% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (2907; 100%), Mood=EMPTY (2901; 100%), Tense=Past (2720; 94%), Number=Sing (2592; 89%), Variant=EMPTY (1835; 63%), Voice=Act (1593; 55%), Case=EMPTY (1513; 52%), VerbForm=PartRes (1500; 52%).

VERB tokens may have the following values of Gender:

Paradigm велѣтиMascFemNeut
Case=Loc|Number=Plur|Tense=Pres|VerbForm=Part|Voice=Actвелящих
Case=Nom|Number=Sing|Tense=Past|Variant=Short|VerbForm=Part|Voice=Passвелѣно, велено
Number=Sing|Tense=Past|VerbForm=PartRes|Voice=Actвелѣлъ, велел, велѣл, велелъвелела, велѣла

PRON

2857 PRON tokens (67% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (2126; 74%), PronType=Prs (1808; 63%), Person=3 (1807; 63%).

PRON tokens may have the following values of Gender:

Paradigm ониMascFemNeut
Case=Acc|Number=Plurих, ихъ, нихъ, них, іхъ, iхъіхъіх
Case=Dat|Number=Singим
Case=Dat|Number=Plurимъ, им, нимъ, нимим, имъ, ним, нимъ
Case=Gen|Number=Plurих, ихъ, них, нихъ, іхъих, них, ихъ, нихъ
Case=Ins|Number=Singнимъ
Case=Ins|Number=Plurними, ими, ним[и], імиими
Case=Loc|Number=Plurних, нихъних, нихъ
Case=Nom|Number=Singани
Case=Nom|Number=Plurони, ани

NUM

906 NUM tokens (36% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (758; 84%), NumForm=Word (658; 73%).

NUM tokens may have the following values of Gender:

Paradigm дваMascFemNeut
Animacy=Anim|Case=Accдву
Case=Accдва, дв[а]две, двѣ, [д]ведва
Case=Datдвумъ
Case=Genдву, двухъдву, дви, двух, двухъдву
Case=Insдвема, двома, двѣмя
Case=Locдвудву, двухъдву, двухъ
Case=Nomдва, д[ва], дв[а]две, два, двѣ, дв[е], двидва

AUX

161 AUX tokens (20% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Person=EMPTY (161; 100%), Voice=Act (161; 100%), Number=Sing (157; 98%), Tense=Past (155; 96%), Analyt=EMPTY (153; 95%), Mood=EMPTY (153; 95%), VerbForm=PartRes (150; 93%).

AUX tokens may have the following values of Gender:

Paradigm бытиMascFemNeut
Analyt=Yes|Mood=Cnd|Number=Sing|Tense=Past|VerbForm=PartResбылъбыло
Analyt=Yes|Number=Sing|Tense=Past|VerbForm=PartResбыло
Case=Acc|Number=Plur|Tense=Past|VerbForm=Partбывшіꙗ
Case=Dat|Number=Sing|Tense=Past|Variant=Short|VerbForm=Partбывшу
Case=Dat|Number=Sing|Tense=Pres|Variant=Short|VerbForm=Partсушу, сущу, сꙋщꙋ
Case=Dat|Number=Sing|Tense=Pres|VerbForm=Partсущу
Case=Dat|Number=Plur|Tense=Past|Variant=Short|VerbForm=Partбывши
Case=Gen|Number=Plur|Tense=Past|VerbForm=Partбывших
Case=Nom|Number=Sing|Tense=Pres|Variant=Short|VerbForm=Partсущъ
Case=Nom|Number=Sing|Tense=Pres|VerbForm=Partсущий
Case=Nom|Number=Plur|Tense=Past|Variant=Short|VerbForm=Partбывше
Mood=Cnd|Number=Sing|Tense=Past|VerbForm=PartResбыло
Number=Sing|Tense=Past|VerbForm=PartResбылъ, былбылабыло, была

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (6191; 98%), NOUN –[det]–> DET (3502; 98%), PROPN –[flat:name]–> PROPN (2069; 100%), NOUN –[conj]–> NOUN (1913; 59%), NOUN –[appos]–> PROPN (1449; 91%), NOUN –[appos]–> NOUN (585; 81%), PROPN –[conj]–> PROPN (478; 91%), ADJ –[conj]–> ADJ (384; 93%), PROPN –[appos]–> NOUN (278; 93%), PRON –[appos]–> PROPN (261; 91%).