Treebank Statistics: UD_Serbian-SET: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem, Masc, Neut.
This is a layered feature with the following layers: Gender, Gender[psor].
50382 tokens (52%) have a non-empty value of Gender.
16568 types (90%) occur at least once with a non-empty value of Gender.
8063 lemmas (84%) occur at least once with a non-empty value of Gender.
The feature is used with 8 part-of-speech tags: NOUN (23810; 24% instances), ADJ (10967; 11% instances), PROPN (7408; 8% instances), DET (3490; 4% instances), VERB (3352; 3% instances), PRON (749; 1% instances), AUX (304; 0% instances), NUM (302; 0% instances).
NOUN
23810 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (17402; 73%).
NOUN tokens may have the following values of Gender:
Fem(9516; 40% of non-emptyGender): godine, zemlje, godina, vlada, stranke, zemalja, vlade, zemlja, vlasti, nedeljeMasc(11136; 47% of non-emptyGender): evra, predsednik, ministar, poslova, ljudi, miliona, ponedeljak, premijer, dana, utorakNeut(3158; 13% of non-emptyGender): prava, vreme, pitanja, članstvo, pitanje, mesto, nasilje, saopštenju, pitanju, mestaEMPTY(7): km, br., cm, m
| Paradigm delo | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Number=Plur | dela | ||
| Case=Gen|Number=Plur | dela | dela | |
| Case=Loc|Number=Plur | delima | ||
| Case=Nom|Number=Sing | dela | delo | |
| Case=Nom|Number=Plur | dela |
Gender seems to be lexical feature of NOUN. 99% lemmas (3201) occur only with one value of Gender.
ADJ
10967 ADJ tokens (94% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (10538; 96%), Definite=Def (9885; 90%), Number=Sing (7441; 68%).
ADJ tokens may have the following values of Gender:
Fem(4604; 42% of non-emptyGender): prošle, srpske, Crne, evropske, političke, druge, demokratske, nove, Crna, jugoistočneMasc(5074; 46% of non-emptyGender): novi, drugi, inostranih, bivši, prvi, glavni, mnogi, novog, veliki, unutrašnjihNeut(1289; 12% of non-emptyGender): potrebno, drugo, moguće, ljudskih, sve, ljudska, održano, radnih, važno, CrnogEMPTY(694): 2007., 2004., 21., 1., 9., 12., 2008., 28., 17., 14.
| Paradigm nov | Masc | Fem | Neut |
|---|---|---|---|
| Animacy=Anim|Case=Acc|Definite=Def|Degree=Pos|Number=Sing | novog | ||
| Animacy=Inan|Case=Acc|Definite=Def|Degree=Pos|Number=Sing | novi | ||
| Animacy=Inan|Case=Acc|Definite=Ind|Degree=Pos|Number=Sing | nov | ||
| Case=Acc|Definite=Def|Degree=Pos|Number=Sing | novu | novo | |
| Case=Acc|Definite=Def|Degree=Pos|Number=Plur | nove | nove | nova |
| Case=Acc|Definite=Def|Degree=Cmp|Number=Sing | novije | ||
| Case=Acc|Definite=Def|Degree=Sup|Number=Sing | najnoviju | ||
| Case=Acc|Definite=Def|Degree=Sup|Number=Plur | najnovije | najnovije | |
| Case=Dat|Definite=Def|Degree=Pos|Number=Sing | novom | Novoj | |
| Case=Gen|Definite=Def|Degree=Pos|Number=Sing | novog | nove | novog |
| Case=Gen|Definite=Def|Degree=Pos|Number=Plur | novih | novih | novih |
| Case=Gen|Definite=Def|Degree=Sup|Number=Sing | najnovije | ||
| Case=Gen|Definite=Def|Degree=Sup|Number=Plur | najnovijih | najnovijih | |
| Case=Ins|Definite=Def|Degree=Pos|Number=Sing | novim | novim | |
| Case=Ins|Definite=Def|Degree=Pos|Number=Plur | novim | Novim | |
| Case=Loc|Definite=Def|Degree=Pos|Number=Sing | novom | novoj | Novom |
| Case=Loc|Definite=Def|Degree=Pos|Number=Plur | novim | novim | |
| Case=Loc|Definite=Def|Degree=Cmp|Number=Sing | novijoj | ||
| Case=Loc|Definite=Def|Degree=Sup|Number=Sing | najnovijem | ||
| Case=Loc|Definite=Def|Degree=Sup|Number=Plur | najnovijim | ||
| Case=Nom|Definite=Def|Degree=Pos|Number=Sing | novi | nova | novo |
| Case=Nom|Definite=Def|Degree=Pos|Number=Plur | novi | nove | nova |
| Case=Nom|Definite=Def|Degree=Sup|Number=Sing | najnoviji | najnovija | |
| Case=Nom|Definite=Def|Degree=Sup|Number=Plur | najnoviji | ||
| Case=Nom|Definite=Ind|Degree=Pos|Number=Sing | nov |
PROPN
7408 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (7207; 97%), Case=Nom (4147; 56%).
PROPN tokens may have the following values of Gender:
Fem(2188; 30% of non-emptyGender): Srbije, Srbija, Srbiji, Makedonija, Turska, Turske, Makedoniji, Bugarska, Evrope, HrvatskaMasc(4842; 65% of non-emptyGender): EU, BiH, UN, Beogradu, NATO, UN-a, SETimes, NATO-u, Balkanu, EBRDNeut(378; 5% of non-emptyGender): Kosova, Kosovo, Kosovu, Skoplju, Sarajevu, Belene, Kosovom, Skoplja, Skoplje, VetvendosjeEMPTY(4): V., Dž.
| Paradigm INA | Masc | Fem | Neut |
|---|---|---|---|
| Case=Gen | INA-e, INE | ||
| Case=Nom | INA | INA | INA |
Gender seems to be lexical feature of PROPN. 98% lemmas (2087) occur only with one value of Gender.
DET
3490 DET tokens (96% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Number[psor]=EMPTY (3119; 89%), Person=EMPTY (3118; 89%), Poss=EMPTY (2728; 78%), Number=Sing (2383; 68%).
DET tokens may have the following values of Gender:
Fem(1240; 36% of non-emptyGender): koja, koje, ove, koju, svoje, svoju, te, kojoj, ta, sveMasc(1490; 43% of non-emptyGender): koji, taj, svoj, neki, svog, ovog, tog, koje, svoje, kojimNeut(760; 22% of non-emptyGender): to, toga, tome, koja, koje, ovo, sve, svoje, tom, timEMPTY(149): nekoliko, više, puno, koliko, mnogo, toliko, malo, manje, odsto, Oko
| Paradigm koji | Masc | Fem | Neut |
|---|---|---|---|
| Animacy=Anim|Case=Acc|Number=Sing | koji, kojeg | ||
| Animacy=Inan|Case=Acc|Number=Sing | koji | ||
| Case=Acc|Number=Sing | koju | koje | |
| Case=Acc|Number=Plur | koje | koje | koja |
| Case=Dat|Number=Sing | kojem | kojoj | |
| Case=Dat|Number=Plur | kojima, koji | kojima | |
| Case=Gen|Number=Sing | kojeg, kog | koje | kojeg |
| Case=Gen|Number=Plur | kojih | kojih | kojih |
| Case=Ins|Number=Sing | kojim | kojom | kojim |
| Case=Ins|Number=Plur | kojima | kojima | kojima |
| Case=Loc|Number=Sing | kojem, kom, kome | kojoj | kojem, kome, kom |
| Case=Loc|Number=Plur | kojima | kojima | kojima |
| Case=Nom|Number=Sing | koji | koja | koje |
| Case=Nom|Number=Plur | koji | koje | koja, koje |
VERB
3352 VERB tokens (40% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (3352; 100%), Person=EMPTY (3352; 100%), Tense=Past (3352; 100%), VerbForm=Part (3352; 100%), Voice=Act (3352; 100%), Number=Sing (2577; 77%).
VERB tokens may have the following values of Gender:
Fem(1010; 30% of non-emptyGender): rekla, mogla, saopštila, dobila, postala, imala, osvojila, povećala, objavila, potpisalaMasc(1986; 59% of non-emptyGender): rekao, izjavio, dodao, sastao, pozvao, ukazao, izrazio, dobio, mogao, postaoNeut(356; 11% of non-emptyGender): trebalo, moglo, došlo, pokazalo, omogućilo, postalo, dobilo, okupilo, prisustvovalo, dogodiloEMPTY(5061): kaže, ima, može, treba, mora, mogu, navodi, postoji, kažu, očekuje
| Paradigm reći | Masc | Fem | Neut |
|---|---|---|---|
| Number=Sing | rekao | rekla | reklo |
| Number=Plur | rekli | rekle |
PRON
749 PRON tokens (31% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (749; 100%), Case=Nom (518; 69%), Person=3 (480; 64%), PronType=Prs (480; 64%), Number=Sing (416; 56%).
PRON tokens may have the following values of Gender:
Fem(98; 13% of non-emptyGender): ona, je, joj, one, nje, ju, njoj, njom, njuMasc(423; 56% of non-emptyGender): on, oni, ga, njega, ko, mu, niko, neko, njemu, kogaNeut(228; 30% of non-emptyGender): što, šta, ništa, ono, nešto, čime, čega, čemu, ona, komeEMPTY(1654): se, mi, ih, im, njih, nam, nas, njima, sebe, ja
| Paradigm on | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Number=Sing | ga, njega | je, ju, nju | ono |
| Case=Dat|Number=Sing | mu, njemu | joj | |
| Case=Gen|Number=Sing | njega | nje | |
| Case=Ins|Number=Sing | njim, Njime | njom | |
| Case=Loc|Number=Sing | njemu | njoj | |
| Case=Nom|Number=Sing | on | ona | ono |
| Case=Nom|Number=Plur | ona |
AUX
304 AUX tokens (5% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (304; 100%), Person=EMPTY (304; 100%), Tense=Past (304; 100%), VerbForm=Part (304; 100%), Number=Sing (247; 81%).
AUX tokens may have the following values of Gender:
Fem(79; 26% of non-emptyGender): bila, bileMasc(146; 48% of non-emptyGender): bio, biliNeut(79; 26% of non-emptyGender): bilo, bilaEMPTY(5899): je, su, će, bi, nije, biti, bude, smo, nisu, neće
| Paradigm biti | Masc | Fem | Neut |
|---|---|---|---|
| Number=Sing | bio | bila | bilo |
| Number=Plur | bili | bile | bila |
NUM
302 NUM tokens (24% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (277; 92%), Number=Sing (206; 68%), Case=Nom (173; 57%).
NUM tokens may have the following values of Gender:
Fem(153; 51% of non-emptyGender): dve, jedna, jedne, obe, jednoj, jednu, dveju, jednom, obejuMasc(136; 45% of non-emptyGender): jedan, jednog, jednom, oba, jednim, dva, jedni, nijedanNeut(13; 4% of non-emptyGender): jedno, dva, jednomEMPTY(976): tri, dva, pet, četiri, 20, deset, šest, 50, 10, sedam
| Paradigm jedan | Masc | Fem | Neut |
|---|---|---|---|
| Animacy=Anim|Case=Acc|Number=Sing | jednog | ||
| Animacy=Inan|Case=Acc|Number=Sing | jedan | ||
| Case=Acc|Number=Sing | jednu | jedno | |
| Case=Dat|Number=Sing | jednom | jednoj | |
| Case=Gen|Number=Sing | jednog | jedne | |
| Case=Ins|Number=Sing | jednim | jednom | |
| Case=Loc|Number=Sing | jednom | jednoj | jednom |
| Case=Nom|Number=Sing | jedan | jedna | jedno |
| Case=Nom|Number=Plur | jedni |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[amod]–> ADJ (8276; 93%),
NOUN –[det]–> DET (1659; 98%),
PROPN –[flat]–> PROPN (1313; 99%),
NOUN –[flat]–> PROPN (679; 82%),
ADJ –[nsubj]–> NOUN (672; 91%),
VERB –[nsubj]–> PROPN (655; 56%),
NOUN –[acl]–> ADJ (378; 83%),
PROPN –[conj]–> PROPN (338; 72%),
ADJ –[conj]–> ADJ (297; 87%),
VERB –[nsubj]–> PRON (240; 58%).