home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Serbian-SET: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

50400 tokens (52%) have a non-empty value of Gender. 16566 types (90%) occur at least once with a non-empty value of Gender. 8063 lemmas (84%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (23811; 24% instances), ADJ (10699; 11% instances), PROPN (7407; 8% instances), DET (3501; 4% instances), VERB (3352; 3% instances), PRON (772; 1% instances), NUM (554; 1% instances), AUX (304; 0% instances).

NOUN

23811 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (17402; 73%).

NOUN tokens may have the following values of Gender:

Paradigm deloMascFemNeut
Case=Acc|Number=Plurdela
Case=Gen|Number=Plurdeladela
Case=Loc|Number=Plurdelima
Case=Nom|Number=Singdeladelo
Case=Nom|Number=Plurdela

Gender seems to be lexical feature of NOUN. 99% lemmas (3202) occur only with one value of Gender.

ADJ

10699 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (10270; 96%), Definite=Def (9870; 92%), VerbForm=EMPTY (9605; 90%), Voice=EMPTY (9605; 90%), Number=Sing (7233; 68%).

ADJ tokens may have the following values of Gender:

Paradigm novMascFemNeut
Animacy=Anim|Case=Acc|Definite=Def|Degree=Pos|Number=Singnovog
Animacy=Inan|Case=Acc|Definite=Def|Degree=Pos|Number=Singnovi
Animacy=Inan|Case=Acc|Definite=Ind|Degree=Pos|Number=Singnov
Case=Acc|Definite=Def|Degree=Pos|Number=Singnovunovo
Case=Acc|Definite=Def|Degree=Pos|Number=Plurnovenovenova
Case=Acc|Definite=Def|Degree=Cmp|Number=Singnovije
Case=Acc|Definite=Def|Degree=Sup|Number=Singnajnoviju
Case=Acc|Definite=Def|Degree=Sup|Number=Plurnajnovijenajnovije
Case=Dat|Definite=Def|Degree=Pos|Number=SingnovomNovoj
Case=Gen|Definite=Def|Degree=Pos|Number=Singnovognovenovog
Case=Gen|Definite=Def|Degree=Pos|Number=Plurnovihnovihnovih
Case=Gen|Definite=Def|Degree=Sup|Number=Singnajnovije
Case=Gen|Definite=Def|Degree=Sup|Number=Plurnajnovijihnajnovijih
Case=Ins|Definite=Def|Degree=Pos|Number=Singnovimnovim
Case=Ins|Definite=Def|Degree=Pos|Number=PlurnovimNovim
Case=Loc|Definite=Def|Degree=Pos|Number=SingnovomnovojNovom
Case=Loc|Definite=Def|Degree=Pos|Number=Plurnovimnovim
Case=Loc|Definite=Def|Degree=Cmp|Number=Singnovijoj
Case=Loc|Definite=Def|Degree=Sup|Number=Singnajnovijem
Case=Loc|Definite=Def|Degree=Sup|Number=Plurnajnovijim
Case=Nom|Definite=Def|Degree=Pos|Number=Singnovinovanovo
Case=Nom|Definite=Def|Degree=Pos|Number=Plurnovinovenova
Case=Nom|Definite=Def|Degree=Sup|Number=Singnajnovijinajnovija
Case=Nom|Definite=Def|Degree=Sup|Number=Plurnajnoviji
Case=Nom|Definite=Ind|Degree=Pos|Number=Singnov

PROPN

7407 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (7206; 97%), Case=Nom (4146; 56%).

PROPN tokens may have the following values of Gender:

Paradigm INAMascFemNeut
Case=GenINA-e, INE
Case=NomINAINAINA

Gender seems to be lexical feature of PROPN. 98% lemmas (2086) occur only with one value of Gender.

DET

3501 DET tokens (96% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Person=EMPTY (3134; 90%), Poss=EMPTY (2744; 78%), Number=Sing (2387; 68%).

DET tokens may have the following values of Gender:

Paradigm kojiMascFemNeut
Animacy=Anim|Case=Acc|Number=Singkoji, kojeg
Animacy=Inan|Case=Acc|Number=Singkoji
Case=Acc|Number=Singkojukoje
Case=Acc|Number=Plurkojekojekoja
Case=Dat|Number=Singkojemkojoj
Case=Dat|Number=Plurkojima, kojikojima
Case=Gen|Number=Singkojeg, kogkojekojeg
Case=Gen|Number=Plurkojihkojihkojih
Case=Ins|Number=Singkojimkojomkojim
Case=Ins|Number=Plurkojimakojimakojima
Case=Loc|Number=Singkojem, kom, komekojojkojem, kome, kom
Case=Loc|Number=Plurkojimakojimakojima
Case=Nom|Number=Singkojikojakoje
Case=Nom|Number=Plurkojikojekoja, koje

VERB

3352 VERB tokens (40% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (3352; 100%), Person=EMPTY (3352; 100%), Tense=Past (3352; 100%), VerbForm=Part (3352; 100%), Voice=Act (3352; 100%), Number=Sing (2577; 77%).

VERB tokens may have the following values of Gender:

Paradigm rećiMascFemNeut
Number=Singrekaoreklareklo
Number=Plurreklirekle

PRON

772 PRON tokens (32% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (772; 100%), Case=Nom (540; 70%), Person=3 (480; 62%), PronType=Prs (480; 62%), Number=Sing (416; 54%).

PRON tokens may have the following values of Gender:

Paradigm onMascFemNeut
Case=Acc|Number=Singga, njegaje, ju, njuono
Case=Dat|Number=Singmu, njemujoj
Case=Gen|Number=Singnjeganje
Case=Ins|Number=Singnjim, Njimenjom
Case=Loc|Number=Singnjemunjoj
Case=Nom|Number=Singononaono
Case=Nom|Number=Plurona

NUM

554 NUM tokens (27% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Number=Sing (413; 75%), Degree=EMPTY (298; 54%).

NUM tokens may have the following values of Gender:

Paradigm jedanMascFemNeut
Animacy=Anim|Case=Acc|Number=Sing|NumType=Cardjednog
Animacy=Inan|Case=Acc|Definite=Ind|Degree=Pos|Number=Singjedan
Animacy=Inan|Case=Acc|Number=Sing|NumType=Cardjedan
Case=Acc|Number=Sing|NumType=Cardjednujedno
Case=Dat|Number=Sing|NumType=Cardjednomjednoj
Case=Gen|Number=Sing|NumType=Cardjednogjedne
Case=Ins|Number=Sing|NumType=Cardjednimjednom
Case=Loc|Definite=Def|Degree=Pos|Number=Singjednoj
Case=Loc|Number=Sing|NumType=Cardjednomjednojjednom
Case=Nom|Number=Sing|NumType=Cardjedanjednajedno
Case=Nom|Number=Plur|NumType=Cardjedni

AUX

304 AUX tokens (5% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (304; 100%), Person=EMPTY (304; 100%), Tense=Past (304; 100%), VerbForm=Part (304; 100%), Number=Sing (247; 81%).

AUX tokens may have the following values of Gender:

Paradigm bitiMascFemNeut
Number=Singbiobilabilo
Number=Plurbilibilebila

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (8037; 99%), NOUN –[det]–> DET (1663; 98%), PROPN –[flat]–> PROPN (1312; 99%), NOUN –[flat]–> PROPN (679; 82%), ADJ –[nsubj]–> NOUN (672; 91%), VERB –[nsubj]–> PROPN (655; 56%), NOUN –[acl]–> ADJ (378; 83%), PROPN –[conj]–> PROPN (338; 72%), ADJ –[conj]–> ADJ (292; 87%), VERB –[nsubj]–> PRON (240; 56%).