home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_Russian-RNC: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

8524 tokens (45%) have a non-empty value of Gender. 3741 types (77%) occur at least once with a non-empty value of Gender. 1884 lemmas (74%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (3711; 20% instances), ADJ (1484; 8% instances), PROPN (1184; 6% instances), DET (912; 5% instances), PRON (612; 3% instances), VERB (534; 3% instances), NUM (64; 0% instances), AUX (23; 0% instances).

NOUN

3711 NOUN tokens (95% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (2432; 66%).

NOUN tokens may have the following values of Gender:

Paradigm лещьMascNeut
Case=Acc|Number=Singлеща
Case=Gen|Number=Plurлещей, лещевлещ
Case=Ins|Number=Plurлещами
Case=Nom|Number=Singлещ, лещьлещ
Case=Nom|Number=Plurлещи
Case=Nom|Number=Adnumлеща

Gender seems to be lexical feature of NOUN. 99% lemmas (815) occur only with one value of Gender.

ADJ

1484 ADJ tokens (97% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (1414; 95%), Variant=Long (1266; 85%), Number=Sing (912; 61%).

ADJ tokens may have the following values of Gender:

Paradigm великийMascFemNeut
Animacy=Anim|Case=Acc|Number=Sing|Variant=Longвеликого, великог[о], великѡг[о]
Case=Acc|Number=Sing|Variant=LongВеликийВеликую
Case=Dat|Number=Sing|Variant=Longвеликомꙋ, великомувеликому
Case=Gen|Number=Sing|Variant=Longвеликого, великаго, ВеликіяВеликіявеликаго, великог[о]
Case=Gen|Number=Plur|Variant=Longвеликых
Case=Ins|Number=Sing|Variant=Longвеликим
Case=Ins|Number=Plur|Variant=Longвеликим
Case=Loc|Number=Sing|Variant=Longвеликом, Великомъвеликом
Case=Nom|Number=Sing|Variant=Longвеликий, великійвеликая
Case=Nom|Number=Sing|Variant=Shortвелики, великъвелика
Case=Nom|Number=Plur|Variant=Longвеликіе
Case=Nom|Number=Plur|Variant=Shortвелики

PROPN

1184 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1174; 99%).

PROPN tokens may have the following values of Gender:

Paradigm ГороденьMascFem
Case=AccГородень
Case=GenГоро[д]н[я], Городня, ГородяГородня

Gender seems to be lexical feature of PROPN. 100% lemmas (373) occur only with one value of Gender.

DET

912 DET tokens (97% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=EMPTY (762; 84%), Poss=EMPTY (623; 68%), Number=Sing (596; 65%).

DET tokens may have the following values of Gender:

Paradigm тотъMascFemNeut
Animacy=Anim|Case=Acc|Number=Singтого
Animacy=Anim|Case=Acc|Number=Plurтѣхъ
Case=Acc|Number=Singтот, тотътою, тое, тоѣто
Case=Acc|Number=Sing|PronType=Demтот, товотуто
Case=Acc|Number=Plurтѣ, те, тѣхтѣ, тете
Case=Acc|Number=Plur|PronType=Demтете
Case=Acc|Number=Plur|Variant=Shortтѣ
Case=Dat|Number=Singтому, томꙋ, тѣмътоитому, томꙋ
Case=Dat|Number=Sing|PronType=Demтомꙋ
Case=Dat|Number=Plurтѣмътымъ, тѣмъ
Case=Dat|Number=Plur|PronType=Demтем
Case=Gen|Number=Singтого, товотоятого, тово
Case=Gen|Number=Sing|PronType=Demтово, таво, тоетое, тойтово, того
Case=Gen|Number=Plurтѣхъ, техтѣхъ
Case=Gen|Number=Plur|PronType=Demтех
Case=Ins|Number=Singтѣм, тем, темъ, тѣмътоитемъ
Case=Ins|Number=Sing|PronType=Demтемтой
Case=Ins|Number=Plurтѣни
Case=Ins|Number=Plur|PronType=Demтеми
Case=Loc|Number=Singтомътой, тоитом, томъ, тѡм
Case=Loc|Number=Sing|PronType=Demтомъ
Case=Loc|Number=Plurтѣхътѣхъ, тѣх
Case=Loc|Number=Plur|PronType=Demтех
Case=Nom|Number=Singтоттато
Case=Nom|Number=Sing|PronType=Demтотта, тето
Case=Nom|Number=Plurтѣ, тетѣ, тета
Case=Nom|Number=Plur|PronType=Demтете

PRON

612 PRON tokens (55% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (414; 68%), PronType=Prs (367; 60%), Person=3 (338; 55%).

PRON tokens may have the following values of Gender:

Paradigm которыйMascFemNeut
Animacy=Anim|Case=Acc|Number=Plur|PronType=Relкоторыхъ
Case=Acc|Number=Sing|PronType=Relкоторую
Case=Acc|Number=Plur|PronType=Relкоторые
Case=Dat|Number=Plur|PronType=Relкоторым
Case=Ins|Number=Sing|PronType=Relкоторой
Case=Loc|Number=Sing|PronType=Indкоторомъ
Case=Loc|Number=Sing|PronType=Relкоторомъ
Case=Loc|Number=Plur|PronType=Indкоторыхъ
Case=Nom|Number=Sing|PronType=Relкоторои, которыи
Case=Nom|Number=Plur|PronType=Relкоторые, ко[то]рыекоторыекотораа

VERB

534 VERB tokens (30% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (534; 100%), Mood=EMPTY (527; 99%), Tense=Past (496; 93%), Number=Sing (462; 87%), Case=EMPTY (328; 61%), Variant=EMPTY (322; 60%), VerbForm=PartRes (321; 60%), Aspect=Perf (283; 53%).

VERB tokens may have the following values of Gender:

Paradigm послатиMascFemNeut
Aspect=Perf|Case=Gen|Number=Plur|Tense=Past|Variant=Long|VerbForm=Part|Voice=Passпосланныхъ
Aspect=Perf|Case=Nom|Number=Sing|Tense=Past|Variant=Short|VerbForm=Part|Voice=Passпослан, посланъпосланапослано
Aspect=Perf|Case=Nom|Number=Plur|Tense=Past|Variant=Short|VerbForm=Part|Voice=Passпосланыпосланы
Aspect=Perf|Number=Sing|Tense=Fut|VerbForm=PartResпослал
Aspect=Perf|Number=Sing|Tense=Past|VerbForm=PartResпослал, послалъпослала
Aspect=Perf|Number=Plur|Tense=Past|Variant=Short|VerbForm=Part|Voice=Passпосланы
Case=Nom|Number=Sing|Tense=Past|Variant=Short|VerbForm=Part|Voice=Passпослано
Number=Sing|Tense=Past|VerbForm=PartRes|Voice=Actпослал

NUM

64 NUM tokens (30% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Number=EMPTY (40; 63%).

NUM tokens may have the following values of Gender:

Paradigm одинъMascFemNeut
Animacy=Anim|Case=Accодново
Case=Accаднꙋ, одну
Case=Genодиного, однова
Case=Locодномодном
Case=Nomодин, адин, ѡдинъодна

AUX

23 AUX tokens (14% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Number=Sing (23; 100%), Person=EMPTY (23; 100%), Tense=Past (23; 100%), VerbForm=PartRes (23; 100%), Voice=EMPTY (19; 83%), Analyt=EMPTY (15; 65%), Mood=EMPTY (15; 65%).

AUX tokens may have the following values of Gender:

Paradigm бытиMascFemNeut
Analyt=Yes|Mood=Cndбылъбыло
Analyt=Yesбыло
Mood=Cndбыло
был, былъбылабыло
Voice=Actбылбылабыло

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (1137; 95%), NOUN –[det]–> DET (685; 96%), PROPN –[flat:name]–> PROPN (389; 100%), NOUN –[conj]–> NOUN (298; 63%), NOUN –[appos]–> PROPN (274; 94%), NOUN –[nmod]–> NOUN (209; 52%), NOUN –[appos]–> NOUN (126; 88%), ADJ –[conj]–> ADJ (99; 95%), VERB –[nsubj]–> PROPN (45; 70%), PROPN –[appos]–> NOUN (40; 82%).