This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home cu/feat issue tracker

Gender: gender

This document is a placeholder for the language-specific documentation for Gender.


Treebank Statistics (UD_Old_Church_Slavonic)

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut. Some words have combined values of the feature; 3 combinations have been observed: Fem|Masc, Fem|Neut, Masc|Neut.

24745 tokens (43%) have a non-empty value of Gender. 6465 types (65%) occur at least once with a non-empty value of Gender. 2152 lemmas (73%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: NOUN (9618; 17% instances), PRON (5785; 10% instances), ADJ (3903; 7% instances), VERB (3278; 6% instances), PROPN (1554; 3% instances), NUM (596; 1% instances), DET (11; 0% instances).

NOUN

9618 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (6901; 72%).

NOUN tokens may have the following values of Gender:

Paradigm вьсьMascFemNeut
Case=Acc|Number=Singвесь
Case=Acc|Number=Plurвьсивьси
Case=Gen|Number=Singвьсивьсего
Case=Loc|Number=Singвьси
Case=Loc|Number=Plurвьсехъ
Case=Nom|Number=Plurвьси

Gender seems to be lexical feature of NOUN. 97% lemmas (963) occur only with one value of Gender.

PRON

5785 PRON tokens (61% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (5435; 94%), Poss=EMPTY (4760; 82%), Number=Sing (4178; 72%), PronType=Prs (4121; 71%), Person=3 (3348; 58%).

PRON tokens may have the following values of Gender:

Paradigm своиFem,MascFem,NeutMascMasc,NeutFemNeut
Case=Acc|Number=Singсвоисвоѭ, своѭ҄, своѭ҅свое, свое҅
Case=Acc|Number=Dualсвоисвоѣсвои
Case=Acc|Number=Plurсвоѩ, своѩ҅своѩсвоѩ, своѩ҅своѣ
Case=Dat|Number=Singсвоемоусвоеи, своеи҅
Case=Dat|Number=Plurсвоимъ
Case=Gen|Number=Singсвоегосвоегосвоеѩ, своеѩ҅своего
Case=Gen|Number=Plurсвоихъсвоихъсвоихъ
Case=Ins|Number=Singсвоимь, своимъсвоимь, своимъсвоеѭ, своеѭ҄
Case=Loc|Number=Singсвоемь, своемъсвоеи
Case=Loc|Number=Plurсвоихъ
Case=Nom|Number=Singсвоисвоѣ
Case=Nom|Number=Plurсвоисвоѩ҅

ADJ

3903 ADJ tokens (97% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (2876; 74%), Degree=Pos (2338; 60%).

ADJ tokens may have the following values of Gender:

Paradigm исоусовъFem,MascFem,NeutMascMasc,NeutFemNeut
Case=Acc|Number=Singи҃свъ, и҅сѵсовъи҃сво
Case=Acc|Number=Dualи҃свѣ
Case=Acc|Number=Plurи҃свꙑ
Case=Dat|Number=Dualи҃свамаи҃свама
Case=Gen|Number=Singи҃сва
Case=Loc|Number=Singи҃свѣ
Case=Nom|Number=Singи҃свъи҃сваи҃сво

VERB

3278 VERB tokens (22% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: VerbForm=Part (3278; 100%), Person=EMPTY (3278; 100%), Mood=EMPTY (3278; 100%), Aspect=EMPTY (3080; 94%), Voice=Act (2816; 86%), Strength=Strong (2618; 80%), Case=Nom (2516; 77%), Number=Sing (2203; 67%).

VERB tokens may have the following values of Gender:

Paradigm бꙑтиFem,MascMascMasc,NeutFemNeut
Aspect=Res|Case=Nom|Number=Sing|Strength=Strong|Voice=Actбꙑлъ, бъилъбꙑлабꙑло
Aspect=Res|Case=Nom|Number=Plur|Strength=Strong|Voice=Actбꙑлибꙑлꙑ
Case=Acc|Number=Sing|Strength=Strong|Tense=Past|Voice=Actбꙑвъшъ
Case=Acc|Number=Sing|Strength=Strong|Tense=Pres|Voice=Actсѫштъсѫштѫ, сѫщѫ
Case=Acc|Number=Sing|Strength=Weak|Tense=Fut|Voice=Actбѫдѫщии
Case=Acc|Number=Sing|Strength=Weak|Tense=Past|Voice=Actбꙑвъшии҅бꙑвъшѫѭбꙑвъшее
Case=Acc|Number=Plur|Strength=Strong|Tense=Past|Voice=Actбꙑвъша
Case=Acc|Number=Plur|Strength=Strong|Tense=Pres|Voice=Actсѫщѧѩ
Case=Acc|Number=Plur|Strength=Weak|Tense=Past|Voice=Actбꙑвъшаа
Case=Acc|Number=Plur|Strength=Weak|Tense=Pres|Voice=Actсѫщѧѩсѫштаа
Case=Dat|Number=Sing|Strength=Strong|Tense=Past|Voice=Actбꙑвъшоу, бꙑвъшю, бъвъшюбꙑвъшибꙑвъшю, бꙑвъшоу
Case=Dat|Number=Sing|Strength=Strong|Tense=Pres|Voice=Actсѫштю, сѫштоу, сѫщю, сѫщоусѫшти
Case=Dat|Number=Sing|Strength=Weak|Tense=Past|Voice=Actбꙑвъшюмоу
Case=Dat|Number=Plur|Strength=Strong|Tense=Past|Voice=Actбꙑвъшамъ
Case=Dat|Number=Plur|Strength=Weak|Tense=Past|Voice=Actбꙑвъшимъ
Case=Dat|Number=Plur|Strength=Weak|Tense=Pres|Voice=Actсѫштиимъ, сѫщимъ, сѫштимъ
Case=Gen|Number=Sing|Strength=Strong|Tense=Past|Voice=Actбꙑвъша
Case=Gen|Number=Sing|Strength=Strong|Tense=Pres|Voice=Actсѫштасѫщасѫштѧ
Case=Gen|Number=Sing|Strength=Weak|Tense=Past|Voice=Actбꙑвъшаагобꙑвъшааго, бꙑвъшаго
Case=Gen|Number=Plur|Strength=Strong|Tense=Pres|Voice=Actсѫшть
Case=Gen|Number=Plur|Strength=Weak|Tense=Past|Voice=Actбꙑвъшиихъ
Case=Gen|Number=Plur|Strength=Weak|Tense=Pres|Voice=Actсѫщтихъ
Case=Ins|Number=Sing|Strength=Strong|Tense=Pres|Voice=Actсѫштеѭ
Case=Ins|Number=Sing|Strength=Weak|Tense=Pres|Voice=Actсѫштиими
Case=Nom|Number=Sing|Strength=Strong|Tense=Past|Voice=Actбꙑвъ
Case=Nom|Number=Sing|Strength=Strong|Tense=Pres|Voice=Actсꙑсꙑсѫшти
Case=Nom|Number=Sing|Strength=Weak|Tense=Past|Voice=Actбꙑвъшее
Case=Nom|Number=Sing|Strength=Weak|Tense=Pres|Voice=Actсꙙи, сѫи, сꙑисѫштее
Case=Nom|Number=Dual|Strength=Weak|Tense=Pres|Voice=Actсѫштаа
Case=Nom|Number=Plur|Strength=Strong|Tense=Past|Voice=Actбꙑвъшебꙑвьшѧ
Case=Nom|Number=Plur|Strength=Strong|Tense=Pres|Voice=Actсѫште, сѫще
Case=Nom|Number=Plur|Strength=Strong|Tense=Pres|Voice=Passсѫще
Case=Nom|Number=Plur|Strength=Weak|Tense=Past|Voice=Actбꙑвъшеибꙑвъшѧѩ
Case=Nom|Number=Plur|Strength=Weak|Tense=Pres|Voice=Actсѫштеи, сѫщеи, сѫщи

PROPN

1554 PROPN tokens (99% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1544; 99%), Case=Nom (903; 58%).

PROPN tokens may have the following values of Gender:

Gender seems to be lexical feature of PROPN. 100% lemmas (108) occur only with one value of Gender.

NUM

596 NUM tokens (89% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Number=Sing (400; 67%).

NUM tokens may have the following values of Gender:

Paradigm ѥдинъFem,MascMascMasc,NeutFemNeut
Case=Acc|Number=Singединъединѫ, единѫѭ, единѫѭ҄едино
Case=Acc|Number=Plurединꙑединꙑ
Case=Dat|Number=Singединомоу, единоуемоуединомоуединои
Case=Dat|Number=Plurединѣмъ
Case=Gen|Number=Singединогоединого, единаагоединоѩ, единꙑединого
Case=Ins|Number=Singединѣмь, единѣмъ
Case=Loc|Number=Singединомъединомь, единомъ
Case=Nom|Number=Singединъ, единь, Е҅динꙑединаедино
Case=Nom|Number=Plurедини

DET

11 DET tokens (100% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (10; 91%), Case=Nom (7; 64%).

DET tokens may have the following values of Gender:

Paradigm ижеMascNeut
Case=Acc|Number=Singеже
Case=Nom|Number=Singеже
Case=Nom|Number=Plurиже

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[nmod]–> PRON (1068; 54%), NOUN –[amod]–> ADJ (893; 68%), NOUN –[conj]–> NOUN (308; 56%), NOUN –[nmod]–> ADJ (300; 68%), VERB –[conj]–> VERB (178; 77%), PROPN –[appos]–> NOUN (123; 98%), PROPN –[conj]–> PROPN (92; 94%), ADJ –[amod]–> ADJ (86; 96%), ADJ –[conj]–> ADJ (66; 94%), ADJ –[nmod]–> PRON (53; 58%).


Gender in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]