home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-RNC: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

24379 tokens (50%) have a non-empty value of Gender. 9204 types (80%) occur at least once with a non-empty value of Gender. 4413 lemmas (76%) occur at least once with a non-empty value of Gender. The feature is used with 9 part-of-speech tags: NOUN (11120; 23% instances), ADJ (4263; 9% instances), PROPN (3467; 7% instances), DET (2022; 4% instances), VERB (1537; 3% instances), PRON (1355; 3% instances), NUM (558; 1% instances), AUX (56; 0% instances), SCONJ (1; 0% instances).

NOUN

11120 NOUN tokens (98% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (7295; 66%).

NOUN tokens may have the following values of Gender:

Paradigm гривенкаMascFem
Case=Acc|Number=Singхривенку
Case=Acc|Number=Countгривенки
Case=Gen|Number=Singгривенки, [гри]венки
Case=Gen|Number=Plurгри{л._140_об.}венокгривенок, гривенак, гр[и]венок, гри{л._130_об.}венок, грив[е]нак, гриве[но]к, гривен[ок], гривено[к]
Case=Nom|Number=Singгривенка
Case=Nom|Number=Countгривенки, гри[ве]нки, гривенак

Gender seems to be lexical feature of NOUN. 99% lemmas (1770) occur only with one value of Gender.

ADJ

4263 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (3975; 93%), Number=Sing (2932; 69%).

ADJ tokens may have the following values of Gender:

Paradigm великийMascFemNeut
Animacy=Anim|Case=Acc|Number=Singвеликого, великаго, великог[о], великѡг[о]
Case=Acc|Number=SingВеликии, Великий, ВеликійВеликуювеликое
Case=Acc|Number=Sing|Variant=Shortвеликъвеликувелико
Case=Acc|Number=Plurвеликаꙗ
Case=Dat|Number=Singвеликому, великомꙋвеликому
Case=Dat|Number=Sing|Variant=Shortвеликувелику
Case=Dat|Number=Plurвеликимъвеликимъ
Case=Gen|Number=Singвеликого, великаго, Великіявеликіявеликаго, великог[о]
Case=Gen|Number=Plurвеликихъ, великих, великыхвеликихъвеликихъ
Case=Ins|Number=Singвеликим
Case=Ins|Number=Plurвеликим
Case=Loc|Number=Singвеликом, великомъ, Велікомъвеликом, великомъ
Case=Loc|Number=Plurвеликихъ
Case=Nom|Number=Singвеликий, великій, великии, Великиⸯвеликаꙗ, великая
Case=Nom|Number=Sing|Variant=Shortвелики, великъвелика
Case=Nom|Number=Plurвеликии, великіе, велицыи
Case=Nom|Number=Plur|Variant=Shortвелики

PROPN

3467 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (3433; 99%), NameType=EMPTY (2382; 69%).

PROPN tokens may have the following values of Gender:

Paradigm ПсковъMascFem
Case=AccПсковъ
Case=DatПскову, ПъсковуПсковѣ
Case=GenПскова, Пъскова, Пьскова
Case=LocПскове, Пъскове, Пъсковѣ, Пьсковѣ

Gender seems to be lexical feature of PROPN. 99% lemmas (1065) occur only with one value of Gender.

DET

2022 DET tokens (98% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Poss=EMPTY (1386; 69%), Number=Sing (1350; 67%).

DET tokens may have the following values of Gender:

Paradigm тотъMascFemNeut
Animacy=Anim|Case=Acc|Number=Singтого, тово
Animacy=Anim|Case=Acc|Number=Plurтѣхъ, тех, тѣх
Case=Acc|Number=Singтот, тотъ, тои, то, тыи, тѣитое, ту, тою, тоѣто
Case=Acc|Number=Plurтѣ, тетѣ, те, тате, та, тѣ
Case=Dat|Number=Singтому, томꙋ, тѣмътои, тойтому, томꙋ
Case=Dat|Number=Plurтѣмъ, темтымъ, тѣмъ
Case=Gen|Number=Singтого, тово, таво, тоетое, тои, той, тоятово, того
Case=Gen|Number=Plurтех, тѣхъ, тыхътѣхъ, тех
Case=Ins|Number=Singтѣмъ, тем, тѣм, темътою, тои, тойтемъ, тем, тѣмъ
Case=Ins|Number=Plurтеми, тымитеми, тѣми, тѣни
Case=Loc|Number=Singтомътои, той, тоітомъ, том, тѡм
Case=Loc|Number=Plurтѣхъ, техтѣхъ, тѣхтех
Case=Nom|Number=Singтот, той, тотъ, тыита, те, т[а]то
Case=Nom|Number=Plurте, тѣ, тии, тыете, тѣ, тата, те

VERB

1537 VERB tokens (37% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (1537; 100%), Mood=EMPTY (1530; 100%), Tense=Past (1431; 93%), Number=Sing (1344; 87%), VerbForm=Part (892; 58%), Aspect=Perf (856; 56%), Variant=EMPTY (830; 54%), Voice=Pass (775; 50%).

VERB tokens may have the following values of Gender:

Paradigm взятиMascFemNeut
Animacy=Anim|Case=Acc|Number=Plur|Tense=Past|VerbForm=Part|Voice=Passвзятыхъ
Case=Acc|Number=Plur|Tense=Past|VerbForm=Part|Voice=Passвзꙗтыꙗ
Case=Gen|Number=Sing|Tense=Past|Variant=Short|VerbForm=Part|Voice=Passвзята
Case=Gen|Number=Plur|Tense=Past|VerbForm=Part|Voice=Passвзятыхъ
Case=Nom|Number=Sing|Tense=Past|Variant=Short|VerbForm=Part|Voice=Actвзем
Case=Nom|Number=Sing|Tense=Past|Variant=Short|VerbForm=Part|Voice=Passвзят, взята, взятъ, взатвзятавзято, взат(о), взато, взета, взя[то]
Case=Nom|Number=Plur|Tense=Past|Variant=Short|VerbForm=Part|Voice=Passвзяты, взатывзяты
Number=Sing|Tense=Fut|VerbForm=PartRes|Voice=Actвзял
Number=Sing|Tense=Past|VerbForm=PartRes|Voice=Actвзял, взялъ, взѧл
Number=Plur|Tense=Past|Variant=Short|VerbForm=Part|Voice=Passвзятывзяты

PRON

1355 PRON tokens (65% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (977; 72%), PronType=Prs (868; 64%), Person=3 (831; 61%).

PRON tokens may have the following values of Gender:

Paradigm онъMascFemNeut
Animacy=Anim|Case=Acc|Number=Sing|Person=3ево, его
Case=Acc|Number=Sing|Person=3ево, его, негоего
Case=Acc|Number=Plur|Person=3их
Case=Dat|Number=Sing|Person=3ему, емꙋ, нему
Case=Dat|Number=Plur|Person=3им
Case=Gen|Number=Sing|Person=3его, ево, нево, него, ег[о]еꙗ
Case=Ins|Number=Sing|Person=3ним, нимъ, имъ
Case=Loc|Number=Sing|Person=3нем, немъ, не(м), не[м]
Case=Loc|Number=Plur|Person=3них
Case=Nom|Number=Sing|Person=3он, онъ
Case=Nom|Number=Singон

NUM

558 NUM tokens (35% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (374; 67%), NumType=Card (293; 53%).

NUM tokens may have the following values of Gender:

Paradigm дваMascFemNeut
Case=Accдвадве, двѣ
Case=Acc|NumForm=Word|NumType=Cardдва, дв[а]две, [д]ведва
Case=Genдву
Case=Gen|NumForm=Word|NumType=Cardдвудву, двух
Case=Insдвема, двома
Case=Locдвудву
Case=Loc|NumForm=Word|NumType=Cardдвудву
Case=Nomдвадве, двѣ
Case=Nom|NumForm=Word|NumType=Cardдва, д[ва], дв[а]две, два, дв[е]два

AUX

56 AUX tokens (16% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Person=EMPTY (56; 100%), Number=Sing (55; 98%), Tense=Past (54; 96%), VerbForm=PartRes (53; 95%), Analyt=EMPTY (48; 86%), Mood=EMPTY (48; 86%), Voice=Act (37; 66%).

AUX tokens may have the following values of Gender:

Paradigm бытиMascFemNeut
Analyt=Yes|Mood=Cnd|Number=Sing|Tense=Past|VerbForm=PartResбылъбыло
Analyt=Yes|Number=Sing|Tense=Past|VerbForm=PartResбыло
Case=Acc|Number=Plur|Tense=Past|VerbForm=Part|Voice=Actбывшіꙗ
Case=Dat|Number=Sing|Tense=Pres|Variant=Short|VerbForm=Part|Voice=Actсꙋщꙋ
Case=Dat|Number=Sing|Tense=Pres|VerbForm=Part|Voice=Actсущу
Mood=Cnd|Number=Sing|Tense=Past|VerbForm=PartResбыло
Number=Sing|Tense=Past|VerbForm=PartResбыл, былъбылабыло
Number=Sing|Tense=Past|VerbForm=PartRes|Voice=Actбылъ, былбылабыло, была

SCONJ

1 SCONJ tokens (0% of all SCONJ tokens) have a non-empty value of Gender.

SCONJ tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (3311; 96%), NOUN –[det]–> DET (1570; 96%), PROPN –[flat:name]–> PROPN (1054; 100%), NOUN –[conj]–> NOUN (974; 58%), NOUN –[appos]–> PROPN (567; 93%), NOUN –[appos]–> NOUN (259; 78%), PROPN –[conj]–> PROPN (232; 88%), ADJ –[conj]–> ADJ (204; 96%), NOUN –[nummod]–> NUM (147; 71%), PROPN –[appos]–> NOUN (119; 88%).