home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Belarusian-HSE: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

126689 tokens (42%) have a non-empty value of Gender. 37278 types (72%) occur at least once with a non-empty value of Gender. 18160 lemmas (62%) occur at least once with a non-empty value of Gender. The feature is used with 11 part-of-speech tags: NOUN (71386; 23% instances), PROPN (18806; 6% instances), ADJ (17068; 6% instances), VERB (8180; 3% instances), PRON (5555; 2% instances), DET (4289; 1% instances), AUX (848; 0% instances), NUM (516; 0% instances), ADV (32; 0% instances), SYM (8; 0% instances), CCONJ (1; 0% instances).

NOUN

71386 NOUN tokens (98% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Animacy=Inan (59221; 83%), Number=Sing (51024; 71%).

NOUN tokens may have the following values of Gender:

Paradigm месцаMascFemNeut
Case=Acc|Number=Singмесца
Case=Acc|Number=Plurмесцымесцы
Case=Dat|Number=Singмесцу
Case=Dat|Number=Plurмесцам
Case=Gen|Number=Singмесца
Case=Gen|Number=Plurмесцаў, месц
Case=Ins|Number=Singмесцам
Case=Ins|Number=PlurМесцамі
Case=Loc|Number=Singмесцымесцы
Case=Loc|Number=Plurмесцах
Case=Nom|Number=Singмецамесца
Case=Nom|Number=Sing|Typo=Yesмесяца
Case=Nom|Number=Plurмесцы

Gender seems to be lexical feature of NOUN. 99% lemmas (8881) occur only with one value of Gender.

PROPN

18806 PROPN tokens (92% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (17673; 94%), Animacy=Anim (10653; 57%).

PROPN tokens may have the following values of Gender:

Paradigm КалядаMascFemNeut
Animacy=Anim|Case=Nom|NameType=Prs|Number=SingКаляда
Animacy=Anim|Case=Nom|NameType=Sur|Number=SingКаляда
Animacy=Inan|Case=Acc|NameType=Oth|Number=PlurКаляды

Gender seems to be lexical feature of PROPN. 98% lemmas (3641) occur only with one value of Gender.

ADJ

17068 ADJ tokens (64% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (17065; 100%), Degree=Pos (16851; 99%), Animacy=EMPTY (15302; 90%).

ADJ tokens may have the following values of Gender:

Paradigm беларускіMascFemNeut
Animacy=Anim|Case=Accбеларускага
Animacy=Inan|Case=Accбеларускібеларускае
Case=Accбеларускую
Case=Datбеларускамубеларускайбеларускаму
Case=Genбеларускагабеларускай, беларускае, беларускаябеларускага
Case=Gen|Typo=Yesбеларускай
Case=Insбеларускімбеларускайбеларускім
Case=Locбеларускімбеларускайбеларускім
Case=Nomбеларускі, Беларускiбеларускаябеларускае

VERB

8180 VERB tokens (26% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (8178; 100%), Number=Sing (8175; 100%), Tense=Past (8152; 100%), Mood=Ind (6747; 82%), VerbForm=Fin (6747; 82%), Aspect=Perf (6534; 80%), Voice=Act (5260; 64%).

VERB tokens may have the following values of Gender:

Paradigm магчыMascFemNeut
могмагламагло

PRON

5555 PRON tokens (54% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (5552; 100%), Person=EMPTY (4056; 73%), Case=Nom (2910; 52%).

PRON tokens may have the following values of Gender:

Paradigm якіMascFemNeut
Animacy=Anim|Case=Acc|Number=Sing|PronType=Relякогаякога
Animacy=Anim|Case=Ins|Number=Sing|PronType=Relякiм
Animacy=Inan|Case=Acc|Number=Sing|PronType=Relякі, якiякое
Animacy=Inan|Case=Dat|Number=Sing|PronType=Relякому
Case=Acc|Number=Singякую
Case=Acc|Number=Sing|PronType=Relякую
Case=Dat|Number=Sing|PronType=Relякомуякой
Case=Dat|Number=Plur|PronType=Relякім
Case=Gen|Number=Sing|PronType=Relякогаякойякога
Case=Ins|Number=Sing|PronType=Relякімякой, якоюЯкім
Case=Loc|Number=Sing|PronType=Relякімякойякім
Case=Nom|Number=Singякая
Case=Nom|Number=Sing|PronType=Relякі, якiякаяякое

DET

4289 DET tokens (64% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (4285; 100%), Reflex=EMPTY (3737; 87%), Animacy=EMPTY (3452; 80%), Poss=EMPTY (2743; 64%).

DET tokens may have the following values of Gender:

Paradigm нашMascFemNeut
Animacy=Anim|Case=Accнашага
Animacy=Inan|Case=Accнаш, намнашае, наша
Case=Accнашу, нашую
Case=Datнашамунашай
Case=Genнашага, нашанашай, нашаенашага
Case=Insнашымнашайнашым
Case=Locнашымнашайнашым
Case=Nomнаш, НАШЫнаша, нашая, Нішанаша, нашае, Наше

AUX

848 AUX tokens (41% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Number=Sing (848; 100%), Person=EMPTY (848; 100%), Mood=Ind (845; 100%), Tense=Past (845; 100%), VerbForm=Fin (845; 100%), Voice=Act (845; 100%), Aspect=EMPTY (675; 80%).

AUX tokens may have the following values of Gender:

Paradigm быцьMascFemNeut
Animacy=Inan|Case=Accбуду
Aspect=Imp|Mood=Ind|Tense=Past|VerbForm=Fin|Voice=Actбыўбылабыло, была
Aspect=Perf|Mood=Ind|Tense=Past|VerbForm=Fin|Voice=ActБыў
Case=LocБУДЗЕ
Mood=Ind|Tense=Past|VerbForm=Fin|Voice=Actбыўбылабыло, была

NUM

516 NUM tokens (9% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (498; 97%).

NUM tokens may have the following values of Gender:

Paradigm дваMascFemNeut
Animacy=Anim|Case=Acc|NumType=Cardдвухдзьвюх
Animacy=Inan|Case=Acc|NumType=Cardдвадзьве, дзве, две
Case=Dat|NumType=Cardдвум
Case=Gen|NumType=Cardдвух, дзвюхдзвюх, дзьвюх
Case=Ins|NumType=Cardдвумадзьвюма
Case=Loc|NumType=Cardдвухдзвюх, двух, дзьвюх
Case=Nomдвадзве
Case=Nom|NumType=Cardдвадзьве, дзвеДва

ADV

32 ADV tokens (0% of all ADV tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADV and Gender co-occurred: Degree=Pos (32; 100%).

ADV tokens may have the following values of Gender:

Paradigm канчатковаFemNeut
Case=Genканчаткова
Case=Nomканчаткова

SYM

8 SYM tokens (0% of all SYM tokens) have a non-empty value of Gender.

SYM tokens may have the following values of Gender:

CCONJ

1 CCONJ tokens (0% of all CCONJ tokens) have a non-empty value of Gender.

CCONJ tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (13660; 69%), PROPN –[flat:name]–> PROPN (3630; 97%), NOUN –[det]–> DET (3150; 63%), NOUN –[conj]–> NOUN (2473; 51%), NOUN –[appos]–> PROPN (1650; 70%), VERB –[nsubj]–> PROPN (1029; 55%), PROPN –[conj]–> PROPN (731; 70%), VERB –[nsubj:pass]–> NOUN (477; 62%), ADJ –[conj]–> ADJ (440; 91%), PROPN –[amod]–> ADJ (412; 82%).