Gender: gender
Gender is usually a lexical feature of nouns and inflectional feature of other parts of speech (adjectives, verbs) that mark agreement with nouns. In Bulgarian gender is grammatical.
There are three genders: masculine(m), feminine (f) and neuter (n).
Masc: masculine gender
Nouns denoting male persons are masculine. Other nouns may be also grammatically masculine, without any relation to sex.
Example: [bg] замък / zamak “castle”
Fem: feminine gender
Nouns denoting female persons are feminine. Other nouns may be also grammatically feminine, without any relation to sex.
Example: [bg] маса / masa “table”
Neut: neuter gender
Neither masculine nor feminine (grammatically).
Example: [bg] дете / dete “child”
Treebank Statistics (UD_Bulgarian)
This feature is universal.
It occurs with 3 different values: Fem, Masc, Neut.
58268 tokens (37%) have a non-empty value of Gender.
19359 types (73%) occur at least once with a non-empty value of Gender.
11215 lemmas (75%) occur at least once with a non-empty value of Gender.
The feature is used with 8 part-of-speech tags: bg-pos/NOUN (33290; 21% instances), bg-pos/ADJ (9449; 6% instances), bg-pos/PROPN (8146; 5% instances), bg-pos/PRON (3210; 2% instances), bg-pos/VERB (1909; 1% instances), bg-pos/DET (1692; 1% instances), bg-pos/NUM (494; 0% instances), bg-pos/AUX (78; 0% instances).
NOUN
33290 bg-pos/NOUN tokens (97% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (23776; 71%), Definite=Ind (20478; 62%).
NOUN tokens may have the following values of Gender:
Fem(11987; 36% of non-emptyGender): г., година, години, част, страната, страна, политика, пари, работа, страниMasc(14028; 42% of non-emptyGender): %, лв., млн., път, края, съвет, човек, президентът, министър, денNeut(7275; 22% of non-emptyGender): време, събрание, решение, място, правителството, право, началото, времето, решението, делотоEMPTY(859): хората, хора, души, президентът, преговори, преговорите, глава, стратегии, властта, собственост
| Paradigm глава | Masc | Fem |
|---|---|---|
| Definite=Def|Number=Sing | главата | |
| Definite=Def|Number=Plur | главите | |
| Definite=Ind|Number=Sing | глава | глава |
| Definite=Ind|Number=Plur | глави |
Gender seems to be lexical feature of NOUN. 100% lemmas (5476) occur only with one value of Gender.
ADJ
9449 bg-pos/ADJ tokens (70% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (9449; 100%), VerbForm=EMPTY (8610; 91%), Aspect=EMPTY (8610; 91%), Voice=EMPTY (8610; 91%), Degree=Pos (7855; 83%), Definite=Ind (5206; 55%).
ADJ tokens may have the following values of Gender:
Fem(3819; 40% of non-emptyGender): българската, нова, 2001, цялата, 2000, европейската, голяма, миналата, националната, новатаMasc(3631; 38% of non-emptyGender): друг, новия, европейския, нов, 1, новият, българския, втори, първи, бившияNeut(1999; 21% of non-emptyGender): народното, същото, друго, цялото, народно, българското, ново, новото, голямо, първотоEMPTY(4140): други, другите, последните, нови, новите, първите, различни, българските, големи, въоръжените
| Paradigm нов | Masc | Fem | Neut |
|---|---|---|---|
| Case=Voc|Degree=Pos | Нови | ||
| Definite=Def|Degree=Pos | новия, новият | новата | новото |
| Definite=Def|Degree=Sup | най-новият | най-новата | Най-новото |
| Definite=Ind|Degree=Pos | нов | нова | ново |
PROPN
8146 bg-pos/PROPN tokens (97% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (8018; 98%), Definite=Ind (7868; 97%).
PROPN tokens may have the following values of Gender:
Fem(2669; 33% of non-emptyGender): България, София, Европа, Турция, Югославия, Русия, БСП, Франция, Германия, РумънияMasc(5113; 63% of non-emptyGender): Иван, ЕС, СДС, Костов, Петър, Георги, Стоянов, САЩ, Димитър, ЙорданNeut(364; 4% of non-emptyGender): МВР, Косово, ДПС, Русе, БНР, НС, Панчарево, РМД, Търново, АудиEMPTY(282): Стоянов, Петър, България, Македония, де, Р-300, ван, -, 2000, Велико
| Paradigm а | Masc | Fem | Neut |
|---|---|---|---|
| А | А | а |
Gender seems to be lexical feature of PROPN. 99% lemmas (2820) occur only with one value of Gender.
PRON
3210 bg-pos/PRON tokens (32% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Poss=EMPTY (3210; 100%), Number=Sing (3210; 100%), Reflex=EMPTY (3210; 100%), Case=Nom (2206; 69%), PronType=Prs (1765; 55%), Person=3 (1765; 55%).
PRON tokens may have the following values of Gender:
Fem(680; 21% of non-emptyGender): тя, която, я, нея, й, коя, Едната, ТазиMasc(1525; 48% of non-emptyGender): той, го, който, му, него, кой, никой, някой, кого, всекиNeut(1005; 31% of non-emptyGender): това, което, го, то, всичко, нищо, нещо, него, кое, всичкотоEMPTY(6884): се, си, които, му, ни, те, им, ми, аз, ти
| Paradigm аз | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc | го, него | я, нея | го, него |
| Case=Dat | му, нему | й | му |
| Case=Nom | той | тя | то |
| й |
VERB
1909 bg-pos/VERB tokens (10% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Definite=Ind (1909; 100%), Number=Sing (1909; 100%), Person=EMPTY (1909; 100%), VerbForm=Fin (1808; 95%), Mood=EMPTY (1808; 95%), Aspect=Perf (1366; 72%), Voice=Act (1206; 63%).
VERB tokens may have the following values of Gender:
Fem(522; 27% of non-emptyGender): била, могла, можела, получила, представена, приета, започнала, избрана, поставена, щялаMasc(1014; 53% of non-emptyGender): бил, направил, дал, заминал, искал, казал, дошъл, избран, станал, ималNeut(373; 20% of non-emptyGender): имало, било, трябвало, направено, станало, нямало, налагало, могло, прието, свързаноEMPTY(17643): е, са, има, няма, може, трябва, беше, каза, могат, съобщи
| Paradigm съм | Masc | Fem | Neut |
|---|---|---|---|
| бил | била | било |
DET
1692 bg-pos/DET tokens (70% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (1692; 100%), Person=EMPTY (1444; 85%), Poss=EMPTY (1366; 81%), Definite=EMPTY (1105; 65%).
DET tokens may have the following values of Gender:
Fem(584; 35% of non-emptyGender): тази, една, всяка, каква, нашата, такава, тая, неговата, своята, някакваMasc(610; 36% of non-emptyGender): този, един, всеки, такъв, своя, какъв, тоя, никакъв, някой, някакъвNeut(498; 29% of non-emptyGender): това, какво, едно, всяко, такова, своето, негово, някакво, тяхното, нашетоEMPTY(741): тези, всички, нашите, някои, какви, своите, такива, техните, наши, тия
| Paradigm този | Masc | Fem | Neut |
|---|---|---|---|
| този, тоя, оня, онзи | тази, тая, онази, тeзи | това, онова, туй |
NUM
494 bg-pos/NUM tokens (23% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (494; 100%), Definite=Ind (433; 88%), Number=Plur (266; 54%).
NUM tokens may have the following values of Gender:
Fem(194; 39% of non-emptyGender): две, една, двете, 2, 1, 0, 22, 52, 42, 0.00Masc(204; 41% of non-emptyGender): един, два, 2, двата, 1, 22, Единият, 32, 4162, 62Neut(96; 19% of non-emptyGender): едно, 1, две, двете, едното, 42EMPTY(1610): 3, три, 10, двамата, 20, 000, 15, 4, 5, 6
| Paradigm два | Masc | Fem | Neut |
|---|---|---|---|
| Definite=Def | двата | двете | двете |
| Definite=Ind | два, 2 | две, 2 | две |
AUX
78 bg-pos/AUX tokens (4% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Tense=Past (78; 100%), Aspect=Imp (78; 100%), Number=Sing (78; 100%), Voice=Act (78; 100%), VerbForm=Part (78; 100%), Person=EMPTY (78; 100%), Mood=EMPTY (78; 100%).
AUX tokens may have the following values of Gender:
Fem(20; 26% of non-emptyGender): билаMasc(46; 59% of non-emptyGender): билNeut(12; 15% of non-emptyGender): билоEMPTY(1953): е, са, бе, бъде, бъдат, бяха, беше, съм, би, сме
| Paradigm съм | Masc | Fem | Neut |
|---|---|---|---|
| бил | била | било |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[amod]–> ADJ (7993; 71%),
NOUN –[nmod]–> PROPN (1775; 56%),
NOUN –[det]–> DET (1360; 70%),
PROPN –[name]–> PROPN (1160; 93%),
PROPN –[nmod]–> PROPN (528; 85%),
PROPN –[conj]–> PROPN (405; 71%),
VERB –[nsubjpass]–> NOUN (365; 54%),
PROPN –[amod]–> ADJ (256; 82%),
ADJ –[conj]–> ADJ (244; 98%),
PROPN –[nmod]–> NOUN (225; 66%).
Gender in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]