This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home hr/feat issue tracker

Gender: gender

This document is a placeholder for the language-specific documentation for Gender.


Treebank Statistics (UD_Croatian)

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

71964 tokens (52%) have a non-empty value of Gender. 25422 types (91%) occur at least once with a non-empty value of Gender. 12912 lemmas (85%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: NOUN (34357; 25% instances), ADJ (16024; 12% instances), PROPN (10020; 7% instances), PRON (5878; 4% instances), VERB (4558; 3% instances), NUM (634; 0% instances), AUX (493; 0% instances).

NOUN

34357 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (24914; 73%).

NOUN tokens may have the following values of Gender:

Paradigm narodMascNeut
Animacy=Inan|Case=Acc|Number=Singnarod
Case=Acc|Number=Plurnarode
Case=Dat|Number=Singnarodu
Case=Dat|Number=Plurnarodima
Case=Gen|Number=Singnaroda
Case=Gen|Number=Plurnaroda
Case=Nom|Number=Singnarod
Case=Nom|Number=Plurnarodi

Gender seems to be lexical feature of NOUN. 99% lemmas (5175) occur only with one value of Gender.

ADJ

16024 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (15383; 96%), Number=Sing (10580; 66%), Definite=EMPTY (9718; 61%).

ADJ tokens may have the following values of Gender:

Paradigm novMascFemNeut
Animacy=Anim|Case=Acc|Definite=Def|Degree=Pos|Number=Singnovog, novoga
Animacy=Inan|Case=Acc|Definite=Def|Degree=Pos|Number=Singnovi, nove
Animacy=Inan|Case=Acc|Definite=Ind|Degree=Pos|Number=Singnov
Case=Acc|Definite=Def|Degree=Pos|Number=Singnovognovunovo
Case=Acc|Definite=Def|Degree=Pos|Number=Plurnovenovenova
Case=Acc|Definite=Def|Degree=Cmp|Number=Singnovije
Case=Acc|Definite=Ind|Degree=Pos|Number=Singnovi, nov
Case=Acc|Degree=Pos|Number=Singnovunovo
Case=Acc|Degree=Pos|Number=Plurnovenovenova
Case=Acc|Degree=Cmp|Number=Singnovije
Case=Acc|Degree=Sup|Number=Singnajnoviju
Case=Acc|Degree=Sup|Number=Plurnajnovije
Case=Dat|Definite=Def|Degree=Pos|Number=Singnovoj
Case=Dat|Definite=Def|Degree=Pos|Number=Plurnovim
Case=Dat|Degree=Pos|Number=SingnovomNovoj
Case=Gen|Definite=Def|Degree=Pos|Number=Singnovog, novoganovenovog
Case=Gen|Definite=Def|Degree=Pos|Number=Plurnovihnovihnovih
Case=Gen|Definite=Def|Degree=Pos|Number=Plur|Poss=YesNovih
Case=Gen|Degree=Pos|Number=Singnovog, novanovenovog
Case=Gen|Degree=Pos|Number=Plurnovihnovihnovih
Case=Gen|Degree=Sup|Number=Singnajnovije
Case=Gen|Degree=Sup|Number=Plurnajnovijih
Case=Ins|Definite=Def|Degree=Pos|Number=Singnovimnovom
Case=Ins|Degree=Pos|Number=Singnovimnovom
Case=Ins|Degree=Sup|Number=Singnajnovijim
Case=Loc|Definite=Def|Degree=Pos|Number=Singnovom, novomenovojNovom
Case=Loc|Degree=Pos|Number=Singnovomnovojnovom
Case=Loc|Degree=Pos|Number=Plurnovimnovim
Case=Loc|Degree=Sup|Number=Singnajnovijemnajnovijemnov
Case=Loc|Degree=Sup|Number=Plurnajnovijimnajnovijim
Case=Nom|Definite=Def|Degree=Pos|Number=Singnovinovanovo
Case=Nom|Definite=Def|Degree=Pos|Number=Plurnovinove
Case=Nom|Definite=Def|Degree=Sup|Number=SingNajnovije
Case=Nom|Degree=Pos|Number=Singnovi, novnovanovo
Case=Nom|Degree=Pos|Number=Plurnovinovenova
Case=Nom|Degree=Sup|Number=Singnajnovijinajnovija
Case=Nom|Degree=Sup|Number=Plurnajnoviji

PROPN

10020 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (9746; 97%).

PROPN tokens may have the following values of Gender:

Paradigm EUMascFem
Animacy=Inan|Case=AccEU
Case=AccEU
Case=DatEU
Case=GenEU, EU-aEU, EU-a
Case=InsEU-omEU
Case=LocEU, EU-uEU
Case=NomEUEU

Gender seems to be lexical feature of PROPN. 98% lemmas (3591) occur only with one value of Gender.

PRON

5878 PRON tokens (70% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (5394; 92%), Person=EMPTY (4258; 72%), Number=Sing (3803; 65%).

PRON tokens may have the following values of Gender:

Paradigm kojiMascFemNeut
Animacy=Anim|Case=Acc|Number=Sing|PronType=Indkojeg, kojega
Animacy=Inan|Case=Acc|Number=Sing|PronType=Indkoji
Case=Acc|Number=Sing|PronType=Indkojikojukoje
Case=Acc|Number=Plur|PronType=Indkojekojekoja, koje
Case=Dat|Number=Sing|PronType=Indkojem, kojemukojojkojem
Case=Dat|Number=Plur|PronType=Indkojimakojimakojima
Case=Gen|Number=Sing|PronType=Indkojegkojekojeg
Case=Gen|Number=Plur|PronType=Indkojihkojihkojih
Case=Ins|Number=Sing|PronType=Indkojimkojomkojim
Case=Ins|Number=Plur|PronType=Indkojimakojimakojima
Case=Loc|Number=Sing|PronType=Indkojem, kojemu, komkojojkojem, kojemu
Case=Loc|Number=Plur|PronType=Indkojimakojima, kojimkojima
Case=Nom|Number=Sing|PronType=Indkojikojakoje
Case=Nom|Number=Plur|PronType=Indkojikojekoja
Case=Nom|Number=Plur|PronType=IntKojiKoje

VERB

4558 VERB tokens (39% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Tense=EMPTY (4558; 100%), Person=EMPTY (4558; 100%), VerbForm=Part (4558; 100%), Number=Sing (3370; 74%).

VERB tokens may have the following values of Gender:

Paradigm moćiMascFemNeut
Number=Singmogaomoglamoglo
Number=Plurmoglimoglemogla

NUM

634 NUM tokens (20% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Number=Sing (406; 64%), NumType=Card (364; 57%).

NUM tokens may have the following values of Gender:

Paradigm jedanMascFemNeut
Animacy=Anim|Case=Acc|Number=Singjednog
Animacy=Inan|Case=Acc|Number=Singjedan
Case=Acc|Number=Singjednujedno
Case=Acc|Number=Plurjedne
Case=Dat|Number=Singjednoj
Case=Gen|Number=Singjednogjednejednog, jednoga
Case=Ins|Number=Singjednimjednom
Case=Loc|Number=Singjednom, jednomejednojjednom
Case=Nom|Number=Singjedanjednajedno
Case=Nom|Number=Plurjedni

AUX

493 AUX tokens (5% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Tense=EMPTY (493; 100%), Person=EMPTY (493; 100%), Number=Sing (406; 82%).

AUX tokens may have the following values of Gender:

Paradigm bitiMascFemNeut
Number=Singbiobilabilo
Number=Plurbilibilebila

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (10973; 100%), PROPN –[name]–> PROPN (1621; 99%), NOUN –[appos]–> PROPN (1058; 76%), NOUN –[det]–> PRON (1035; 97%), NOUN –[nmod]–> PRON (1002; 91%), VERB –[nsubj]–> PROPN (918; 61%), ADJ –[nsubj]–> NOUN (529; 93%), ADJ –[conj]–> ADJ (526; 94%), NOUN –[compound]–> ADJ (521; 99%), NOUN –[acl]–> ADJ (489; 85%).


Gender in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]