home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Belarusian: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

3820 tokens (47%) have a non-empty value of Gender. 2163 types (71%) occur at least once with a non-empty value of Gender. 1521 lemmas (70%) occur at least once with a non-empty value of Gender. The feature is used with 9 part-of-speech tags: NOUN (2087; 26% instances), ADJ (615; 8% instances), PROPN (582; 7% instances), VERB (251; 3% instances), PRON (151; 2% instances), DET (81; 1% instances), AUX (22; 0% instances), NUM (22; 0% instances), X (9; 0% instances).

NOUN

2087 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Animacy=Inan (1840; 88%), Number=Sing (1488; 71%).

NOUN tokens may have the following values of Gender:

Paradigm справаMascFem
Case=Acc|Number=Singсправу
Case=Gen|Number=Singсправысправы
Case=Gen|Number=Plurспраў
Case=Ins|Number=Singсправай
Case=Loc|Number=Singсправе
Case=Loc|Number=Plurсправах
Case=Nom|Number=Plurсправы

Gender seems to be lexical feature of NOUN. 98% lemmas (810) occur only with one value of Gender.

ADJ

615 ADJ tokens (70% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (615; 100%), Degree=Pos (611; 99%), Animacy=EMPTY (574; 93%).

ADJ tokens may have the following values of Gender:

Paradigm беларускіMascFemNeut
Animacy=Inan|Case=Accбеларускі
Case=Accбеларускую
Case=Datбеларускамубеларускай
Case=Genбеларускагабеларускайбеларускага
Case=Insбеларускімбеларускайбеларускім
Case=Locбеларускімбеларускайбеларускім
Case=Nomбеларускібеларускаябеларускае

PROPN

582 PROPN tokens (99% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (573; 98%), Animacy=Inan (413; 71%).

PROPN tokens may have the following values of Gender:

Paradigm гроднаMascNeut
Case=GenГродна
Case=LocГродне

Gender seems to be lexical feature of PROPN. 99% lemmas (214) occur only with one value of Gender.

VERB

251 VERB tokens (30% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Number=Sing (251; 100%), Person=EMPTY (251; 100%), Tense=Past (249; 99%), Mood=Ind (208; 83%), VerbForm=Fin (208; 83%), Aspect=Perf (198; 79%), Voice=Act (172; 69%).

VERB tokens may have the following values of Gender:

Paradigm паведаміцьMascFem
паведаміўпаведаміла

PRON

151 PRON tokens (48% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (151; 100%), Person=EMPTY (121; 80%), Animacy=EMPTY (80; 53%).

PRON tokens may have the following values of Gender:

Paradigm якіMascFemNeut
Animacy=Inan|Case=Acc|PronType=Relякі
Case=Accякую
Case=Gen|PronType=Relякогаякой
Case=Ins|PronType=Relякім
Case=Loc|PronType=Relякімякой
Case=Nom|PronType=Relякіякаяякое

Gender seems to be lexical feature of PRON. 92% lemmas (11) occur only with one value of Gender.

DET

81 DET tokens (57% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (81; 100%), Animacy=EMPTY (69; 85%), Poss=EMPTY (68; 84%).

DET tokens may have the following values of Gender:

Paradigm гэтыMascFemNeut
Animacy=Inan|Case=Acc|PronType=Demгэтыгэта
Case=Acc|PronType=Demгэтую, гэтае
Case=Gen|PronType=Demгэтагагэтай
Case=Ins|PronType=Demгэтым
Case=Locгэтым
Case=Loc|PronType=Demгэтымгэтай
Case=Nom|PronType=Demгэты

AUX

22 AUX tokens (32% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Aspect=Imp (22; 100%), Mood=Ind (22; 100%), Number=Sing (22; 100%), Person=EMPTY (22; 100%), Tense=Past (22; 100%), VerbForm=Fin (22; 100%), Voice=Act (22; 100%).

AUX tokens may have the following values of Gender:

Paradigm быцьMascFemNeut
быўбылабыло, была

NUM

22 NUM tokens (15% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (18; 82%).

NUM tokens may have the following values of Gender:

Paradigm дваMascFem
Animacy=Anim|Case=Acc|NumType=Cardдвух
Animacy=Inan|Case=Acc|NumType=Cardдва
Case=Gen|NumType=Cardдвух
Case=Ins|NumType=Cardдвума
Case=Loc|NumType=Cardдвух
Case=Nomдвадзве

X

9 X tokens (20% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Animacy=Anim (9; 100%), Case=Gen (9; 100%), Number=Sing (9; 100%).

X tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (448; 62%), PROPN –[flat]–> PROPN (75; 94%), NOUN –[conj]–> NOUN (74; 56%), NOUN –[det]–> DET (61; 51%), PROPN –[conj]–> PROPN (57; 81%), VERB –[nsubj]–> PROPN (43; 51%), NOUN –[flat]–> PROPN (27; 63%), ADJ –[conj]–> ADJ (22; 92%), VERB –[conj]–> VERB (15; 60%), NOUN –[appos]–> PROPN (13; 81%).