home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Belarusian-HSE: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

126682 tokens (42%) have a non-empty value of Gender. 37280 types (72%) occur at least once with a non-empty value of Gender. 18191 lemmas (62%) occur at least once with a non-empty value of Gender. The feature is used with 11 part-of-speech tags: NOUN (71309; 23% instances), PROPN (18878; 6% instances), ADJ (17063; 6% instances), VERB (8178; 3% instances), PRON (5554; 2% instances), DET (4294; 1% instances), AUX (849; 0% instances), NUM (516; 0% instances), ADV (32; 0% instances), SYM (8; 0% instances), CCONJ (1; 0% instances).

NOUN

71309 NOUN tokens (98% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Animacy=Inan (59189; 83%), Number=Sing (50922; 71%).

NOUN tokens may have the following values of Gender:

Paradigm месцаMascFemNeut
Case=Acc|Number=Singмесца
Case=Acc|Number=Plurмесцымесцы
Case=Dat|Number=Singмесцу
Case=Dat|Number=Plurмесцам
Case=Gen|Number=Singмесца
Case=Gen|Number=Plurмесцаў, месц
Case=Ins|Number=Singмесцам
Case=Ins|Number=PlurМесцамі
Case=Loc|Number=Singмесцымесцы
Case=Loc|Number=Plurмесцах
Case=Nom|Number=Singмецамесца
Case=Nom|Number=Sing|Typo=Yesмесяца
Case=Nom|Number=Plurмесцы

Gender seems to be lexical feature of NOUN. 99% lemmas (8873) occur only with one value of Gender.

PROPN

18878 PROPN tokens (92% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (17746; 94%), Animacy=Anim (10698; 57%).

PROPN tokens may have the following values of Gender:

Paradigm КурапатыMascFemNeut
Animacy=Anim|Case=Nom|Number=SingКУРАПАТАЎ
Animacy=Inan|Case=Acc|Number=PlurКурапаты, КУРАПАТЫ
Animacy=Inan|Case=Gen|Number=PlurКУРАПАТАЎКурапатаў, Курапат, КУРАПАТАЎ
Animacy=Inan|Case=Ins|Number=PlurКурапатамі
Animacy=Inan|Case=Loc|Number=PlurКурапатах, КУРАПАТАХкурапатах
Animacy=Inan|Case=Nom|Number=PlurКурапаты, КУРАПАТЫ

Gender seems to be lexical feature of PROPN. 97% lemmas (3654) occur only with one value of Gender.

ADJ

17063 ADJ tokens (64% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (17061; 100%), Degree=Pos (16899; 99%), Animacy=EMPTY (15304; 90%).

ADJ tokens may have the following values of Gender:

Paradigm беларускіMascFemNeut
Animacy=Anim|Case=Accбеларускага
Animacy=Inan|Case=Accбеларускібеларускае
Case=Accбеларускую
Case=Datбеларускамубеларускайбеларускаму
Case=Genбеларускагабеларускай, беларускае, беларускаябеларускага
Case=Gen|Typo=Yesбеларускай
Case=Insбеларускімбеларускайбеларускім
Case=Locбеларускімбеларускайбеларускім
Case=Nomбеларускі, Беларускiбеларускаябеларускае

VERB

8178 VERB tokens (26% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (8176; 100%), Number=Sing (8173; 100%), Tense=Past (8150; 100%), Mood=Ind (6746; 82%), VerbForm=Fin (6746; 82%), Aspect=Perf (6534; 80%), Voice=Act (5258; 64%).

VERB tokens may have the following values of Gender:

Paradigm магчыMascFemNeut
могмагламагло

PRON

5554 PRON tokens (54% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (5551; 100%), Person=EMPTY (4055; 73%), Case=Nom (2911; 52%).

PRON tokens may have the following values of Gender:

Paradigm якіMascFemNeut
Animacy=Anim|Case=Acc|Number=Sing|PronType=Relякогаякога
Animacy=Anim|Case=Ins|Number=Sing|PronType=Relякiм
Animacy=Inan|Case=Acc|Number=Sing|PronType=Relякі, якiякое
Animacy=Inan|Case=Dat|Number=Sing|PronType=Relякому
Case=Acc|Number=Singякую
Case=Acc|Number=Sing|PronType=Relякую
Case=Dat|Number=Sing|PronType=Relякомуякой
Case=Dat|Number=Plur|PronType=Relякім
Case=Gen|Number=Sing|PronType=Relякогаякойякога
Case=Ins|Number=Sing|PronType=Relякімякой, якоюЯкім
Case=Loc|Number=Sing|PronType=Relякімякойякім
Case=Nom|Number=Singякая
Case=Nom|Number=Sing|PronType=Relякі, якiякаяякое

DET

4294 DET tokens (64% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (4289; 100%), Reflex=EMPTY (3742; 87%), Animacy=EMPTY (3455; 80%), Poss=EMPTY (2748; 64%).

DET tokens may have the following values of Gender:

Paradigm нашMascFemNeut
Animacy=Anim|Case=Acc|Poss=Yes|PronType=Prsнашага
Animacy=Inan|Case=Accнам
Animacy=Inan|Case=Acc|Poss=Yes|PronType=Prsнашнашае, наша
Case=Acc|Poss=Yes|PronType=Prsнашу, нашую
Case=Dat|Poss=Yes|PronType=Prsнашамунашай
Case=Gen|Poss=Yes|PronType=Prsнашага, нашанашай, нашаенашага
Case=Ins|Poss=Yes|PronType=Prsнашымнашайнашым
Case=Loc|Poss=Yes|PronType=Prsнашымнашайнашым
Case=Nom|Poss=Yes|PronType=Prsнаш, НАШЫнаша, нашая, Нішанаша, нашае, Наше

AUX

849 AUX tokens (41% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Number=Sing (849; 100%), Person=EMPTY (849; 100%), Mood=Ind (846; 100%), Tense=Past (846; 100%), VerbForm=Fin (846; 100%), Voice=Act (846; 100%), Aspect=EMPTY (676; 80%).

AUX tokens may have the following values of Gender:

Paradigm быцьMascFemNeut
Animacy=Inan|Case=Accбуду
Aspect=Imp|Mood=Ind|Tense=Past|VerbForm=Fin|Voice=Actбыўбылабыло, была
Aspect=Perf|Mood=Ind|Tense=Past|VerbForm=Fin|Voice=ActБыў
Case=LocБУДЗЕ
Mood=Ind|Tense=Past|VerbForm=Fin|Voice=Actбыўбылабыло, была

NUM

516 NUM tokens (9% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (498; 97%).

NUM tokens may have the following values of Gender:

Paradigm дваMascFemNeut
Animacy=Anim|Case=Acc|NumType=Cardдвухдзьвюх
Animacy=Inan|Case=Acc|NumType=Cardдвадзьве, дзве, две
Case=Dat|NumType=Cardдвум
Case=Gen|NumType=Cardдвух, дзвюхдзвюх, дзьвюх
Case=Ins|NumType=Cardдвумадзьвюма
Case=Loc|NumType=Cardдвухдзвюх, двух, дзьвюх
Case=Nomдвадзве
Case=Nom|NumType=Cardдвадзьве, дзвеДва

ADV

32 ADV tokens (0% of all ADV tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADV and Gender co-occurred: Degree=Pos (32; 100%).

ADV tokens may have the following values of Gender:

Paradigm канчатковаFemNeut
Case=Genканчаткова
Case=Nomканчаткова

SYM

8 SYM tokens (0% of all SYM tokens) have a non-empty value of Gender.

SYM tokens may have the following values of Gender:

CCONJ

1 CCONJ tokens (0% of all CCONJ tokens) have a non-empty value of Gender.

CCONJ tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (13644; 69%), PROPN –[flat:name]–> PROPN (3633; 97%), NOUN –[det]–> DET (3150; 63%), NOUN –[conj]–> NOUN (2471; 51%), NOUN –[appos]–> PROPN (1647; 70%), VERB –[nsubj]–> PROPN (1036; 55%), PROPN –[conj]–> PROPN (731; 70%), VERB –[nsubj:pass]–> NOUN (476; 62%), ADJ –[conj]–> ADJ (440; 91%), PROPN –[amod]–> ADJ (426; 82%).