Treebank Statistics: UD_Serbian-SET: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem
, Masc
, Neut
.
This is a layered feature with the following layers: Gender, Gender[psor].
50383 tokens (52%) have a non-empty value of Gender
.
16569 types (90%) occur at least once with a non-empty value of Gender
.
8064 lemmas (84%) occur at least once with a non-empty value of Gender
.
The feature is used with 8 part-of-speech tags: NOUN (23811; 24% instances), ADJ (10967; 11% instances), PROPN (7408; 8% instances), DET (3490; 4% instances), VERB (3352; 3% instances), PRON (749; 1% instances), AUX (304; 0% instances), NUM (302; 0% instances).
NOUN
23811 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (17402; 73%).
NOUN
tokens may have the following values of Gender
:
Fem
(9516; 40% of non-emptyGender
): godine, zemlje, godina, vlada, stranke, zemalja, vlade, zemlja, vlasti, nedeljeMasc
(11137; 47% of non-emptyGender
): evra, predsednik, ministar, poslova, ljudi, miliona, ponedeljak, premijer, dana, utorakNeut
(3158; 13% of non-emptyGender
): prava, vreme, pitanja, članstvo, pitanje, mesto, nasilje, saopštenju, pitanju, mestaEMPTY
(7): km, br., cm, m
Paradigm delo | Masc | Fem | Neut |
---|---|---|---|
Case=Acc|Number=Plur | dela | ||
Case=Gen|Number=Plur | dela | dela | |
Case=Loc|Number=Plur | delima | ||
Case=Nom|Number=Sing | dela | delo | |
Case=Nom|Number=Plur | dela |
Gender
seems to be lexical feature of NOUN
. 99% lemmas (3202) occur only with one value of Gender
.
ADJ
10967 ADJ tokens (94% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Degree=Pos (10538; 96%), Definite=Def (9885; 90%), Number=Sing (7441; 68%).
ADJ
tokens may have the following values of Gender
:
Fem
(4604; 42% of non-emptyGender
): prošle, srpske, Crne, evropske, političke, druge, demokratske, nove, Crna, jugoistočneMasc
(5074; 46% of non-emptyGender
): novi, drugi, inostranih, bivši, prvi, glavni, mnogi, novog, veliki, unutrašnjihNeut
(1289; 12% of non-emptyGender
): potrebno, drugo, moguće, ljudskih, sve, ljudska, održano, radnih, važno, CrnogEMPTY
(694): 2007., 2004., 21., 1., 9., 12., 2008., 28., 17., 14.
Paradigm nov | Masc | Fem | Neut |
---|---|---|---|
Animacy=Anim|Case=Acc|Definite=Def|Degree=Pos|Number=Sing | novog | ||
Animacy=Inan|Case=Acc|Definite=Def|Degree=Pos|Number=Sing | novi | ||
Animacy=Inan|Case=Acc|Definite=Ind|Degree=Pos|Number=Sing | nov | ||
Case=Acc|Definite=Def|Degree=Pos|Number=Sing | novu | novo | |
Case=Acc|Definite=Def|Degree=Pos|Number=Plur | nove | nove | nova |
Case=Acc|Definite=Def|Degree=Cmp|Number=Sing | novije | ||
Case=Acc|Definite=Def|Degree=Sup|Number=Sing | najnoviju | ||
Case=Acc|Definite=Def|Degree=Sup|Number=Plur | najnovije | najnovije | |
Case=Dat|Definite=Def|Degree=Pos|Number=Sing | novom | Novoj | |
Case=Gen|Definite=Def|Degree=Pos|Number=Sing | novog | nove | novog |
Case=Gen|Definite=Def|Degree=Pos|Number=Plur | novih | novih | novih |
Case=Gen|Definite=Def|Degree=Sup|Number=Sing | najnovije | ||
Case=Gen|Definite=Def|Degree=Sup|Number=Plur | najnovijih | najnovijih | |
Case=Ins|Definite=Def|Degree=Pos|Number=Sing | novim | novim | |
Case=Ins|Definite=Def|Degree=Pos|Number=Plur | novim | Novim | |
Case=Loc|Definite=Def|Degree=Pos|Number=Sing | novom | novoj | Novom |
Case=Loc|Definite=Def|Degree=Pos|Number=Plur | novim | novim | |
Case=Loc|Definite=Def|Degree=Cmp|Number=Sing | novijoj | ||
Case=Loc|Definite=Def|Degree=Sup|Number=Sing | najnovijem | ||
Case=Loc|Definite=Def|Degree=Sup|Number=Plur | najnovijim | ||
Case=Nom|Definite=Def|Degree=Pos|Number=Sing | novi | nova | novo |
Case=Nom|Definite=Def|Degree=Pos|Number=Plur | novi | nove | nova |
Case=Nom|Definite=Def|Degree=Sup|Number=Sing | najnoviji | najnovija | |
Case=Nom|Definite=Def|Degree=Sup|Number=Plur | najnoviji | ||
Case=Nom|Definite=Ind|Degree=Pos|Number=Sing | nov |
PROPN
7408 PROPN tokens (100% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (7207; 97%), Case=Nom (4147; 56%).
PROPN
tokens may have the following values of Gender
:
Fem
(2188; 30% of non-emptyGender
): Srbije, Srbija, Srbiji, Makedonija, Turska, Turske, Makedoniji, Bugarska, Evrope, HrvatskaMasc
(4842; 65% of non-emptyGender
): EU, BiH, UN, Beogradu, NATO, UN-a, SETimes, NATO-u, Balkanu, EBRDNeut
(378; 5% of non-emptyGender
): Kosova, Kosovo, Kosovu, Skoplju, Sarajevu, Belene, Kosovom, Skoplja, Skoplje, VetvendosjeEMPTY
(4): V., Dž.
Paradigm INA | Masc | Fem | Neut |
---|---|---|---|
Case=Gen | INA-e, INE | ||
Case=Nom | INA | INA | INA |
Gender
seems to be lexical feature of PROPN
. 98% lemmas (2087) occur only with one value of Gender
.
DET
3490 DET tokens (96% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number[psor]=EMPTY (3119; 89%), Person=EMPTY (3118; 89%), Poss=EMPTY (2728; 78%), Number=Sing (2383; 68%).
DET
tokens may have the following values of Gender
:
Fem
(1240; 36% of non-emptyGender
): koja, koje, ove, koju, svoje, svoju, te, kojoj, ta, sveMasc
(1490; 43% of non-emptyGender
): koji, taj, svoj, neki, svog, ovog, tog, koje, svoje, kojimNeut
(760; 22% of non-emptyGender
): to, toga, tome, koja, koje, ovo, sve, svoje, tom, timEMPTY
(149): nekoliko, više, puno, koliko, mnogo, toliko, malo, manje, odsto, Oko
Paradigm koji | Masc | Fem | Neut |
---|---|---|---|
Animacy=Anim|Case=Acc|Number=Sing | koji, kojeg | ||
Animacy=Inan|Case=Acc|Number=Sing | koji | ||
Case=Acc|Number=Sing | koju | koje | |
Case=Acc|Number=Plur | koje | koje | koja |
Case=Dat|Number=Sing | kojem | kojoj | |
Case=Dat|Number=Plur | kojima, koji | kojima | |
Case=Gen|Number=Sing | kojeg, kog | koje | kojeg |
Case=Gen|Number=Plur | kojih | kojih | kojih |
Case=Ins|Number=Sing | kojim | kojom | kojim |
Case=Ins|Number=Plur | kojima | kojima | kojima |
Case=Loc|Number=Sing | kojem, kom, kome | kojoj | kojem, kome, kom |
Case=Loc|Number=Plur | kojima | kojima | kojima |
Case=Nom|Number=Sing | koji | koja | koje |
Case=Nom|Number=Plur | koji | koje | koja, koje |
VERB
3352 VERB tokens (40% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (3352; 100%), Person=EMPTY (3352; 100%), Tense=Past (3352; 100%), VerbForm=Part (3352; 100%), Voice=Act (3352; 100%), Number=Sing (2577; 77%).
VERB
tokens may have the following values of Gender
:
Fem
(1010; 30% of non-emptyGender
): rekla, mogla, saopštila, dobila, postala, imala, osvojila, povećala, objavila, potpisalaMasc
(1986; 59% of non-emptyGender
): rekao, izjavio, dodao, sastao, pozvao, ukazao, izrazio, dobio, mogao, postaoNeut
(356; 11% of non-emptyGender
): trebalo, moglo, došlo, pokazalo, omogućilo, postalo, dobilo, okupilo, prisustvovalo, dogodiloEMPTY
(5061): kaže, ima, može, treba, mora, mogu, navodi, postoji, kažu, očekuje
Paradigm reći | Masc | Fem | Neut |
---|---|---|---|
Number=Sing | rekao | rekla | reklo |
Number=Plur | rekli | rekle |
PRON
749 PRON tokens (31% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Reflex=EMPTY (749; 100%), Case=Nom (518; 69%), Person=3 (480; 64%), PronType=Prs (480; 64%), Number=Sing (416; 56%).
PRON
tokens may have the following values of Gender
:
Fem
(98; 13% of non-emptyGender
): ona, je, joj, one, nje, ju, njoj, njom, njuMasc
(423; 56% of non-emptyGender
): on, oni, ga, njega, ko, mu, niko, neko, njemu, kogaNeut
(228; 30% of non-emptyGender
): što, šta, ništa, ono, nešto, čime, čega, čemu, ona, komeEMPTY
(1654): se, mi, ih, im, njih, nam, nas, njima, sebe, ja
Paradigm on | Masc | Fem | Neut |
---|---|---|---|
Case=Acc|Number=Sing | ga, njega | je, ju, nju | ono |
Case=Dat|Number=Sing | mu, njemu | joj | |
Case=Gen|Number=Sing | njega | nje | |
Case=Ins|Number=Sing | njim, Njime | njom | |
Case=Loc|Number=Sing | njemu | njoj | |
Case=Nom|Number=Sing | on | ona | ono |
Case=Nom|Number=Plur | ona |
AUX
304 AUX tokens (5% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Mood=EMPTY (304; 100%), Person=EMPTY (304; 100%), Tense=Past (304; 100%), VerbForm=Part (304; 100%), Number=Sing (247; 81%).
AUX
tokens may have the following values of Gender
:
Fem
(79; 26% of non-emptyGender
): bila, bileMasc
(146; 48% of non-emptyGender
): bio, biliNeut
(79; 26% of non-emptyGender
): bilo, bilaEMPTY
(5899): je, su, će, bi, nije, biti, bude, smo, nisu, neće
Paradigm biti | Masc | Fem | Neut |
---|---|---|---|
Number=Sing | bio | bila | bilo |
Number=Plur | bili | bile | bila |
NUM
302 NUM tokens (24% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumType=Card (277; 92%), Number=Sing (206; 68%), Case=Nom (173; 57%).
NUM
tokens may have the following values of Gender
:
Fem
(153; 51% of non-emptyGender
): dve, jedna, jedne, obe, jednoj, jednu, dveju, jednom, obejuMasc
(136; 45% of non-emptyGender
): jedan, jednog, jednom, oba, jednim, dva, jedni, nijedanNeut
(13; 4% of non-emptyGender
): jedno, dva, jednomEMPTY
(976): tri, dva, pet, četiri, 20, deset, šest, 50, 10, sedam
Paradigm jedan | Masc | Fem | Neut |
---|---|---|---|
Animacy=Anim|Case=Acc|Number=Sing | jednog | ||
Animacy=Inan|Case=Acc|Number=Sing | jedan | ||
Case=Acc|Number=Sing | jednu | jedno | |
Case=Dat|Number=Sing | jednom | jednoj | |
Case=Gen|Number=Sing | jednog | jedne | |
Case=Ins|Number=Sing | jednim | jednom | |
Case=Loc|Number=Sing | jednom | jednoj | jednom |
Case=Nom|Number=Sing | jedan | jedna | jedno |
Case=Nom|Number=Plur | jedni |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[amod]–> ADJ (8276; 93%),
NOUN –[det]–> DET (1659; 98%),
PROPN –[flat]–> PROPN (1313; 99%),
NOUN –[flat]–> PROPN (679; 82%),
ADJ –[nsubj]–> NOUN (672; 91%),
VERB –[nsubj]–> PROPN (655; 56%),
NOUN –[acl]–> ADJ (378; 83%),
PROPN –[conj]–> PROPN (338; 72%),
ADJ –[conj]–> ADJ (297; 87%),
VERB –[nsubj]–> PRON (240; 58%).