home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Croatian: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

99291 tokens (50%) have a non-empty value of Gender. 32250 types (91%) occur at least once with a non-empty value of Gender. 15821 lemmas (85%) occur at least once with a non-empty value of Gender. The feature is used with 9 part-of-speech tags: NOUN (47808; 24% instances), ADJ (22390; 11% instances), PROPN (12691; 6% instances), DET (7122; 4% instances), VERB (6021; 3% instances), PRON (2010; 1% instances), AUX (647; 0% instances), NUM (601; 0% instances), ADV (1; 0% instances).

NOUN

47808 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (34157; 71%).

NOUN tokens may have the following values of Gender:

Paradigm našMascFemNeut
Case=Gen|Number=Singnašeganašega
Case=Loc|Number=Plurnašim

Gender seems to be lexical feature of NOUN. 99% lemmas (6303) occur only with one value of Gender.

ADJ

22390 ADJ tokens (95% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (20825; 93%), Number=Sing (14913; 67%).

ADJ tokens may have the following values of Gender:

Paradigm velikMascFemNeut
Animacy=Inan|Case=Acc|Definite=Def|Degree=Pos|Number=Singveliki
Animacy=Inan|Case=Acc|Definite=Def|Degree=Cmp|Number=Singveći
Animacy=Inan|Case=Acc|Definite=Def|Degree=Sup|Number=Singnajveći
Animacy=Inan|Case=Acc|Definite=Ind|Degree=Pos|Number=Singvelik, veći
Case=Acc|Definite=Def|Degree=Pos|Number=Singveliku
Case=Acc|Definite=Def|Degree=Pos|Number=Plurvelikevelikeveća
Case=Acc|Definite=Def|Degree=Cmp|Number=Singvećivećuveće
Case=Acc|Definite=Def|Degree=Cmp|Number=Plurvećeveće
Case=Acc|Definite=Def|Degree=Sup|Number=Singnajvećunajveće
Case=Acc|Definite=Def|Degree=Sup|Number=Plurnajveće
Case=Acc|Definite=Ind|Degree=Pos|Number=Singveliki, velik
Case=Acc|Definite=Ind|Degree=Cmp|Number=Singveći
Case=Acc|Definite=Ind|Degree=Sup|Number=Singnajveći
Case=Acc|Degree=Pos|Number=Singveliku
Case=Acc|Degree=Pos|Number=Plurvelike
Case=Acc|Degree=Cmp|Number=Singvećuveće
Case=Acc|Degree=Cmp|Number=Plurveće
Case=Acc|Degree=Sup|Number=Singnajveću
Case=Acc|Degree=Sup|Number=Plurnajvećenajveća
Case=Dat|Definite=Def|Degree=Pos|Number=Singvelikomvelikoj
Case=Dat|Definite=Def|Degree=Pos|Number=Plurvelikim
Case=Dat|Definite=Def|Degree=Cmp|Number=Singvećoj
Case=Dat|Definite=Def|Degree=Sup|Number=Singnajvećim
Case=Dat|Definite=Def|Degree=Sup|Number=Plurnajvećim
Case=Dat|Degree=Pos|Number=Singvelikom
Case=Dat|Degree=Cmp|Number=Singvećoj
Case=Dat|Degree=Sup|Number=Singnajvećem
Case=Gen|Definite=Def|Degree=Pos|Number=Singvelikog, velikogavelikenajvećeg, velikog
Case=Gen|Definite=Def|Degree=Pos|Number=Plurvelikihvelikihvelikih
Case=Gen|Definite=Def|Degree=Cmp|Number=Singvećegvećevećeg
Case=Gen|Definite=Def|Degree=Cmp|Number=Plurvećih
Case=Gen|Definite=Def|Degree=Sup|Number=Singnajvećegnajveće
Case=Gen|Definite=Def|Degree=Sup|Number=Plurnajvećihnajvećihnajvećih
Case=Gen|Definite=Ind|Degree=Sup|Number=Singnajveća
Case=Gen|Degree=Pos|Number=Singvelikog, velikavelikevelikog
Case=Gen|Degree=Pos|Number=Plurvelikihvelikihvelikih
Case=Gen|Degree=Cmp|Number=Singvećegveće
Case=Gen|Degree=Cmp|Number=Plurvećihvećih
Case=Gen|Degree=Sup|Number=Singnajvećegnajveće
Case=Gen|Degree=Sup|Number=Plurnajvećihnajvećihnajvećih
Case=Ins|Definite=Def|Degree=Pos|Number=Singvelikimvelikomnajvećim
Case=Ins|Definite=Def|Degree=Pos|Number=Plurvelikim
Case=Ins|Definite=Def|Degree=Cmp|Number=Singvećimvećom
Case=Ins|Definite=Def|Degree=Cmp|Number=Plurvećim
Case=Ins|Definite=Def|Degree=Sup|Number=Singnajvećimnajvećom
Case=Ins|Degree=Pos|Number=Singvelikimvelikom
Case=Ins|Degree=Pos|Number=Plurvelikimvelikim
Case=Ins|Degree=Cmp|Number=Singvećim
Case=Ins|Degree=Sup|Number=Singnajvećimnajvećom
Case=Ins|Degree=Sup|Number=Plurnajvećimnajvećima
Case=Loc|Definite=Def|Degree=Pos|Number=Singvelikomvelikoj
Case=Loc|Definite=Def|Degree=Pos|Number=Plurvelikimvelikimvelikim
Case=Loc|Definite=Def|Degree=Cmp|Number=Singvećem
Case=Loc|Definite=Def|Degree=Sup|Number=Singnajvećemnajvećoj
Case=Loc|Definite=Def|Degree=Sup|Number=Plurnajvećim
Case=Loc|Degree=Pos|Number=Singvelikomvelikojvelikom
Case=Loc|Degree=Pos|Number=Plurvelikim
Case=Loc|Degree=Cmp|Number=Singvećemvećoj
Case=Loc|Degree=Sup|Number=Singnajvećem
Case=Nom|Definite=Def|Degree=Pos|Number=Singvelikivelikaveliko
Case=Nom|Definite=Def|Degree=Pos|Number=Plurvelikivelikevelika
Case=Nom|Definite=Def|Degree=Cmp|Number=Singvećivećaveće
Case=Nom|Definite=Def|Degree=Sup|Number=Singnajvećinajvećanajveće
Case=Nom|Definite=Def|Degree=Sup|Number=Plurnajveći
Case=Nom|Definite=Ind|Degree=Pos|Number=Singvelik
Case=Nom|Degree=Pos|Number=Singvelik, velikivelikaVeliko
Case=Nom|Degree=Pos|Number=Plurvelikivelikevelika
Case=Nom|Degree=Cmp|Number=Singvećiveća
Case=Nom|Degree=Sup|Number=Singnajvećinajveća
Case=Nom|Degree=Sup|Number=Plurnajvećinajvećenajveća

PROPN

12691 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (12354; 97%), Case=Nom (6445; 51%).

PROPN tokens may have the following values of Gender:

Paradigm EUMascFem
Animacy=Inan|Case=Acc|Number=SingEU
Case=Acc|Number=SingEU
Case=Dat|Number=SingEU
Case=Gen|Number=SingEU, EU-aEU, EU-a
Case=Gen|Number=PlurEU
Case=Ins|Number=SingEU-omEU
Case=Loc|Number=SingEU, EU-uEU
Case=Nom|Number=SingEUEU

Gender seems to be lexical feature of PROPN. 98% lemmas (4432) occur only with one value of Gender.

DET

7122 DET tokens (98% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number[psor]=EMPTY (6325; 89%), Person=EMPTY (6325; 89%), Poss=EMPTY (5581; 78%), Number=Sing (4828; 68%).

DET tokens may have the following values of Gender:

Paradigm kojiMascFemNeut
Animacy=Anim|Case=Acc|Number=Sing|PronType=Int,Relkojeg, kojega
Animacy=Inan|Case=Acc|Number=Sing|PronType=Int,Relkoji
Case=Acc|Number=Sing|PronType=Int,Relkojikojukoje
Case=Acc|Number=Plur|PronType=Int,Relkojekojekoja, koje
Case=Dat|Number=Sing|PronType=Int,Relkojemu, kojemkojojkojem
Case=Dat|Number=Plur|PronType=Int,Relkojimakojimakojima
Case=Gen|Number=Sing|PronType=Int,Relkojeg, kojegakojekojeg, kojega
Case=Gen|Number=Plur|PronType=Int,Relkojihkojihkojih
Case=Ins|Number=Sing|PronType=Int,Relkojimkojomkojim
Case=Ins|Number=Plur|PronType=Int,Relkojimakojimakojima
Case=Loc|Number=Sing|PronType=Int,Relkojem, kojemu, komkojojkojem, kojemu
Case=Loc|Number=Plur|PronType=Int,Relkojima, kojimkojima, kojimkojima
Case=Nom|Number=Sing|PronType=IntKoji
Case=Nom|Number=Sing|PronType=Int,Relkojikojakoje
Case=Nom|Number=Plur|PronType=IntKojiKoje
Case=Nom|Number=Plur|PronType=Int,Relkojikojekoja

VERB

6021 VERB tokens (35% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (6021; 100%), Person=EMPTY (6021; 100%), Tense=Past (6021; 100%), VerbForm=Part (6021; 100%), Voice=Act (6021; 100%), Number=Sing (4356; 72%).

VERB tokens may have the following values of Gender:

Paradigm moćiMascFemNeut
Number=Singmogaomoglamoglo
Number=Plurmoglimoglemogla

PRON

2010 PRON tokens (35% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (2010; 100%), Person=EMPTY (1119; 56%).

PRON tokens may have the following values of Gender:

Paradigm onMascFemNeut
Case=Accga, njegaje, ju, njuga, nj, njega
Case=Datmu, njemujoj, njoj
Case=Gennjeganje, je
Case=Insnjim, njimenjom, njomenjime, njim
Case=Locnjemunjoj
Case=Nomononaono

AUX

647 AUX tokens (5% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (647; 100%), Person=EMPTY (647; 100%), Tense=Past (645; 100%), VerbForm=Part (645; 100%), Number=Sing (524; 81%).

AUX tokens may have the following values of Gender:

Paradigm bitiMascFemNeut
Number=Singbiobilabilo
Number=Plurbilibilebila

NUM

601 NUM tokens (19% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (564; 94%), Number=Sing (404; 67%).

NUM tokens may have the following values of Gender:

Paradigm jedanMascFemNeut
Animacy=Anim|Case=Acc|Number=Singjednog
Animacy=Inan|Case=Acc|Number=Singjedan
Case=Acc|Number=Singjednujedno
Case=Acc|Number=Plurjedne
Case=Dat|Number=Singjednoj
Case=Gen|Number=Singjednogjednejednog, jednoga
Case=Ins|Number=Singjednimjednom
Case=Loc|Number=Singjednom, jednomejednojjednom
Case=Nom|Number=Singjedanjednajedno
Case=Nom|Number=Plurjedni

ADV

1 ADV tokens (0% of all ADV tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADV and Gender co-occurred: Degree=Sup (1; 100%), PronType=Ind (1; 100%).

ADV tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (15906; 95%), NOUN –[det]–> DET (2919; 96%), PROPN –[flat]–> PROPN (1986; 98%), NOUN –[appos]–> PROPN (1293; 74%), VERB –[nsubj]–> PROPN (1130; 57%), ADJ –[nsubj]–> NOUN (843; 94%), ADJ –[conj]–> ADJ (766; 94%), PROPN –[conj]–> PROPN (696; 73%), NOUN –[acl]–> ADJ (664; 85%), ADJ –[nsubj:pass]–> NOUN (565; 89%).