home cs/feat edit page issue tracker

Gender: gender

Gender is a lexical feature of nouns and inflectional feature of other parts of speech (adjectives, verbs) that mark agreement with nouns. There are three values of gender: masculine, feminine, and neuter.

See also the related feature of Animacy.

Masc: masculine gender

Nouns denoting male persons are masculine. Other nouns may be also grammatically masculine, without any relation to sex.

Examples

Fem: feminine gender

Nouns denoting female persons are feminine. Other nouns may be also grammatically feminine, without any relation to sex.

Examples

Neut: neuter gender

This third gender is for nouns that are neither masculine nor feminine (grammatically). Nouns whose nominative suffix is -o  or -í  (including a large group of deverbative nouns denoting actions) are usually neuter.

Examples


Treebank Statistics (UD_Czech)

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut. Some words have combined values of the feature; 3 combinations have been observed: Fem|Masc, Fem|Neut, Masc|Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

753255 tokens (50%) have a non-empty value of Gender. 122285 types (95%) occur at least once with a non-empty value of Gender. 49852 lemmas (86%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: cs-pos/NOUN (371970; 25% instances), cs-pos/ADJ (176190; 12% instances), cs-pos/PROPN (82083; 5% instances), cs-pos/VERB (63395; 4% instances), cs-pos/PRON (33610; 2% instances), cs-pos/DET (17996; 1% instances), cs-pos/NUM (4759; 0% instances), cs-pos/AUX (3252; 0% instances).

NOUN

371970 cs-pos/NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Negative=Pos (371406; 100%), Number=Sing (259816; 70%), Animacy=EMPTY (208424; 56%).

NOUN tokens may have the following values of Gender:

Paradigm imageMascFemNeut
Animacy=Inanimage
imageimage

Gender seems to be lexical feature of NOUN. 99% lemmas (17613) occur only with one value of Gender.

ADJ

176190 cs-pos/ADJ tokens (97% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Negative=Pos (164176; 93%), Degree=Pos (155117; 88%), Number=Sing (122021; 69%), Animacy=EMPTY (102266; 58%).

ADJ tokens may have the following values of Gender:

Paradigm známýFem,MascFem,NeutMascFemNeut
Animacy=Anim|Case=Acc|Degree=Pos|Negative=Neg|Number=Singneznámého
Animacy=Anim|Case=Acc|Degree=Pos|Negative=Neg|Number=Plurneznámé
Animacy=Anim|Case=Acc|Degree=Pos|Negative=Pos|Number=Singznámého
Animacy=Anim|Case=Acc|Degree=Pos|Negative=Pos|Number=Plurznámé
Animacy=Anim|Case=Acc|Degree=Sup|Negative=Pos|Number=Plurnejznámější
Animacy=Anim|Case=Dat|Degree=Pos|Negative=Pos|Number=Singznámému
Animacy=Anim|Case=Dat|Degree=Pos|Negative=Pos|Number=Plurznámým
Animacy=Anim|Case=Dat|Degree=Sup|Negative=Pos|Number=Plurnejznámějším
Animacy=Anim|Case=Gen|Degree=Pos|Negative=Neg|Number=Singneznámého
Animacy=Anim|Case=Gen|Degree=Pos|Negative=Neg|Number=Plurneznámých
Animacy=Anim|Case=Gen|Degree=Pos|Negative=Pos|Number=Singznámého
Animacy=Anim|Case=Gen|Degree=Pos|Negative=Pos|Number=Plurznámých
Animacy=Anim|Case=Gen|Degree=Sup|Negative=Pos|Number=Plurnejznámějších
Animacy=Anim|Case=Ins|Degree=Pos|Negative=Neg|Number=Singneznámým
Animacy=Anim|Case=Ins|Degree=Pos|Negative=Neg|Number=Plurneznámými
Animacy=Anim|Case=Ins|Degree=Pos|Negative=Pos|Number=Singznámým
Animacy=Anim|Case=Ins|Degree=Sup|Negative=Pos|Number=Singnejznámějším
Animacy=Anim|Case=Loc|Degree=Pos|Negative=Neg|Number=Singneznámém
Animacy=Anim|Case=Nom|Degree=Pos|Negative=Neg|Number=Singneznámý
Animacy=Anim|Case=Nom|Degree=Pos|Negative=Neg|Number=Plurneznámí
Animacy=Anim|Case=Nom|Degree=Pos|Negative=Pos|Number=Singznámý
Animacy=Anim|Case=Nom|Degree=Pos|Negative=Pos|Number=Plurznámí
Animacy=Anim|Case=Nom|Degree=Sup|Negative=Pos|Number=Singnejznámější
Animacy=Anim|Negative=Pos|Number=Plur|Variant=Shortznámi
Animacy=Inan|Case=Acc|Degree=Pos|Negative=Neg|Number=Singneznámý
Animacy=Inan|Case=Acc|Degree=Pos|Negative=Pos|Number=Singznámý
Animacy=Inan|Case=Acc|Degree=Pos|Negative=Pos|Number=Plurznámé
Animacy=Inan|Case=Acc|Degree=Sup|Negative=Pos|Number=Plurnejznámější
Animacy=Inan|Case=Dat|Degree=Pos|Negative=Pos|Number=Plurznámým
Animacy=Inan|Case=Gen|Degree=Pos|Negative=Neg|Number=Singneznámého
Animacy=Inan|Case=Gen|Degree=Pos|Negative=Neg|Number=Plurneznámých
Animacy=Inan|Case=Gen|Degree=Pos|Negative=Pos|Number=Singznámého
Animacy=Inan|Case=Gen|Degree=Pos|Negative=Pos|Number=Plurznámých
Animacy=Inan|Case=Gen|Degree=Sup|Negative=Pos|Number=Plurnejznámějších
Animacy=Inan|Case=Ins|Degree=Pos|Negative=Neg|Number=Singneznámým
Animacy=Inan|Case=Ins|Degree=Pos|Negative=Neg|Number=Plurneznámými
Animacy=Inan|Case=Ins|Degree=Pos|Negative=Pos|Number=Singznámým
Animacy=Inan|Case=Ins|Degree=Pos|Negative=Pos|Number=Plurznámými
Animacy=Inan|Case=Ins|Degree=Sup|Negative=Pos|Number=Singnejznámějším
Animacy=Inan|Case=Loc|Degree=Pos|Negative=Pos|Number=Singznámém
Animacy=Inan|Case=Loc|Degree=Pos|Negative=Pos|Number=Plurznámých
Animacy=Inan|Case=Nom|Degree=Pos|Negative=Neg|Number=Singneznámý
Animacy=Inan|Case=Nom|Degree=Pos|Negative=Neg|Number=Plurneznámé
Animacy=Inan|Case=Nom|Degree=Pos|Negative=Pos|Number=Singznámý
Animacy=Inan|Case=Nom|Degree=Pos|Negative=Pos|Number=Plurznámé
Animacy=Inan|Case=Nom|Degree=Sup|Negative=Pos|Number=Singnejznámější
Animacy=Inan|Case=Nom|Degree=Sup|Negative=Pos|Number=PlurNejznámější
Animacy=Inan|Negative=Pos|Number=Plur|Variant=Shortznámy
Case=Acc|Degree=Pos|Negative=Neg|Number=Singneznámouneznámé
Case=Acc|Degree=Pos|Negative=Neg|Number=Plurneznámá
Case=Acc|Degree=Pos|Negative=Pos|Number=Singznámouznámé
Case=Acc|Degree=Pos|Negative=Pos|Number=Plurznámá
Case=Dat|Degree=Pos|Negative=Pos|Number=Singznámé
Case=Dat|Degree=Pos|Negative=Pos|Number=Plurznámým
Case=Dat|Degree=Sup|Negative=Pos|Number=Plurnejznámějšímnejznámějším
Case=Gen|Degree=Pos|Negative=Neg|Number=Singneznámé
Case=Gen|Degree=Pos|Negative=Neg|Number=Plurneznámýchneznámých
Case=Gen|Degree=Pos|Negative=Pos|Number=Singznáméznámého
Case=Gen|Degree=Pos|Negative=Pos|Number=Plurznámýchznámých
Case=Gen|Degree=Cmp|Negative=Pos|Number=Singznámější
Case=Gen|Degree=Sup|Negative=Pos|Number=Plurnejznámějších
Case=Ins|Degree=Pos|Negative=Neg|Number=Singneznámouneznámým
Case=Ins|Degree=Pos|Negative=Neg|Number=Plurneznámými
Case=Ins|Degree=Pos|Negative=Pos|Number=Singznámouznámým
Case=Ins|Degree=Pos|Negative=Pos|Number=Plurznámými
Case=Loc|Degree=Pos|Negative=Neg|Number=Singneznáméneznámém
Case=Loc|Degree=Pos|Negative=Pos|Number=Singznámé
Case=Nom|Degree=Pos|Negative=Neg|Number=Singneznámáneznámé
Case=Nom|Degree=Pos|Negative=Neg|Number=Plurneznámé
Case=Nom|Degree=Pos|Negative=Pos|Number=Singznámáznámé
Case=Nom|Degree=Pos|Negative=Pos|Number=Plurznáméznámá
Case=Nom|Degree=Cmp|Negative=Pos|Number=Plurznámější
Case=Nom|Degree=Sup|Negative=Pos|Number=Singnejznámějšínejznámější
Case=Nom|Degree=Sup|Negative=Pos|Number=Plurnejznámější
Case=Voc|Degree=Pos|Negative=Neg|Number=Singneznámá
Negative=Neg|Number=Sing|Variant=Shortneznámo
Negative=Pos|Number=Sing|Variant=Shortznámznámo
Negative=Pos|Number=Plur,Sing|Variant=Shortznáma, neznáma

PROPN

82083 cs-pos/PROPN tokens (98% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Negative=Pos (82083; 100%), Abbr=EMPTY (70671; 86%), Number=Sing (63175; 77%).

PROPN tokens may have the following values of Gender:

Paradigm MMascFemNeut
Animacy=Anim|NameType=GivM
Animacy=Anim|NameType=SurM
NameType=ComM
NameType=GivM
NameType=SurM

Gender seems to be lexical feature of PROPN. 98% lemmas (14605) occur only with one value of Gender.

VERB

63395 cs-pos/VERB tokens (38% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (63395; 100%), Person=EMPTY (63393; 100%), VerbForm=Part (63251; 100%), Negative=Pos (58700; 93%), Voice=Act (53869; 85%), Tense=Past (53727; 85%), Number=Sing (35102; 55%).

VERB tokens may have the following values of Gender:

Paradigm dátFem,MascFem,NeutMascFemNeut
Animacy=Anim|Negative=Neg|Number=Plur|Tense=Past|Voice=Actnedali
Animacy=Anim|Negative=Pos|Number=Plur|Tense=Past|Voice=Actdali
Animacy=Inan|Negative=Neg|Number=Plur|Tense=Past|Voice=Actnedaly
Animacy=Inan|Negative=Pos|Number=Plur|Tense=Past|Voice=Actdaly
Animacy=Inan|Negative=Pos|Number=Plur|Voice=Passdány
Case=Acc|Negative=Pos|Number=Sing|Voice=Passdánu
Negative=Neg|Number=Sing|Tense=Past|Voice=Actnedalnedalo
Negative=Neg|Number=Plur,Sing|Tense=Past|Voice=Actnedala
Negative=Pos|Number=Sing|Tense=Past|Voice=Actdaldalo
Negative=Pos|Number=Sing|Voice=Passdándáno
Negative=Pos|Number=Plur,Sing|Tense=Past|Voice=Actdala
Negative=Pos|Number=Plur,Sing|Voice=Passdána

PRON

33610 cs-pos/PRON tokens (46% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (33519; 100%), Variant=EMPTY (32377; 96%), Person=EMPTY (29028; 86%), Number=Sing (26099; 78%), Case=Nom (17079; 51%).

PRON tokens may have the following values of Gender:

Paradigm můjFem,NeutMascMasc,NeutFemNeut
Animacy=Anim|Case=Acc|Number=Sing|Number[psor]=Plurnašeho
Animacy=Anim|Case=Nom|Number=Plur|Number[psor]=Plurnaši
Animacy=Inan|Case=Acc|Number=Sing|Number[psor]=Plurnáš
Animacy=Inan|Case=Acc|Number=Sing|Number[psor]=SingMůj
Animacy=Inan|Case=Nom|Number=Plur|Number[psor]=Singmoje
Case=Acc|Number=Sing|Number[psor]=Plurnaši
Case=Acc|Number=Sing|Number[psor]=Singmoumoje
Case=Dat|Number=Sing|Number[psor]=Plurnašemunaší
Case=Dat|Number=Sing|Number[psor]=Singmému
Case=Gen|Number=Sing|Number[psor]=Plurnašehonaší
Case=Gen|Number=Sing|Number[psor]=Singmého
Case=Ins|Number=Sing|Number[psor]=Plurnaším
Case=Ins|Number=Sing|Number[psor]=Singmým
Case=Loc|Number=Sing|Number[psor]=Plurnašem
Case=Nom|Number=Sing|Number[psor]=Plurnašenáš
Case=Nom|Number=Sing|Number[psor]=Singmojemůj
Case=Nom|Number=Plur|Number[psor]=Plurnaše
Case=Voc|Number=Sing|Number[psor]=Singmoje

DET

17996 cs-pos/DET tokens (65% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Gender[psor]=EMPTY (16857; 94%), Number=Sing (14993; 83%), Number[psor]=EMPTY (14610; 81%), Person=EMPTY (14610; 81%), Reflex=EMPTY (14008; 78%), Poss=EMPTY (10622; 59%).

DET tokens may have the following values of Gender:

Paradigm můjFem,NeutMascMasc,NeutFemNeut
Abbr=Yes|Case=Ins|Number=Sing|Number[psor]=Plurn
Animacy=Anim|Case=Acc|Number=Sing|Number[psor]=Plurnašeho
Animacy=Anim|Case=Nom|Number=Plur|Number[psor]=Plurnaši
Animacy=Anim|Case=Nom|Number=Plur|Number[psor]=Singmoji, Mí
Animacy=Inan|Case=Acc|Number=Sing|Number[psor]=Plurnáš
Animacy=Inan|Case=Acc|Number=Sing|Number[psor]=Singmůj
Animacy=Inan|Case=Nom|Number=Plur|Number[psor]=Plurnaše
Animacy=Inan|Case=Nom|Number=Plur|Number[psor]=Singmé, moje
Case=Acc|Number=Sing|Number[psor]=Plurnašinaše
Case=Acc|Number=Sing|Number[psor]=Plur|Style=Collnaší
Case=Acc|Number=Sing|Number[psor]=Singmou, mojimé, moje
Case=Acc|Number=Plur|Number[psor]=Sing
Case=Dat|Number=Sing|Number[psor]=Plurnašemunaší
Case=Dat|Number=Sing|Number[psor]=Singmémumé, mojí
Case=Gen|Number=Sing|Number[psor]=Plurnašehonaší
Case=Gen|Number=Sing|Number[psor]=Singméhomé, mojí
Case=Gen|Number=Sing|Number[psor]=Sing|Style=Collmýho
Case=Ins|Number=Sing|Number[psor]=Plurnašímnaší
Case=Ins|Number=Sing|Number[psor]=Plur|Style=Collnašim
Case=Ins|Number=Sing|Number[psor]=Singmýmmou, mojí
Case=Loc|Number=Sing|Number[psor]=Plurnašemnaší
Case=Loc|Number=Sing|Number[psor]=Singmém
Case=Nom|Number=Sing|Number[psor]=Plurnašenáš
Case=Nom|Number=Sing|Number[psor]=Singmojemůj
Case=Nom|Number=Plur|Number[psor]=Plurnaše
Case=Nom|Number=Plur|Number[psor]=Singmoje
Case=Voc|Number=Sing|Number[psor]=Plurnáš
Case=Voc|Number=Sing|Number[psor]=Singmůj

NUM

4759 cs-pos/NUM tokens (11% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (4417; 93%), NumValue=1,2,3 (4417; 93%), NumForm=Word (4417; 93%), Number=Sing (2795; 59%).

NUM tokens may have the following values of Gender:

Paradigm jedenMascMasc,NeutFemNeut
Animacy=Anim|Case=Accjednoho
Animacy=Inan|Case=Accjeden
Case=Accjednujedno
Case=Datjednomujedné
Case=Genjednohojedné
Case=Insjednímjednou
Case=Locjednomjedné
Case=Nomjedenjednajedno

AUX

3252 cs-pos/AUX tokens (16% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Voice=Act (3252; 100%), VerbForm=Part (3252; 100%), Mood=EMPTY (3252; 100%), Tense=Past (3252; 100%), Person=EMPTY (3252; 100%), Negative=Pos (2982; 92%), Number=Sing (1704; 52%).

AUX tokens may have the following values of Gender:

Paradigm býtFem,MascFem,NeutMascNeut
Animacy=Anim|Negative=Neg|Number=Plurnebyli
Animacy=Anim|Negative=Pos|Number=Plurbyli
Animacy=Inan|Negative=Neg|Number=Plurnebyly
Animacy=Inan|Negative=Pos|Number=Plurbyly
Negative=Neg|Number=Singnebylnebylo
Negative=Neg|Number=Plur,Singnebyla
Negative=Pos|Number=Singbylbylo
Negative=Pos|Number=Plur,Singbyla

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (148372; 99%), PROPN –[name]–> PROPN (13443; 99%), PROPN –[nmod]–> NOUN (8330; 89%), VERB –[nsubj]–> PROPN (7301; 53%), ADJ –[conj]–> ADJ (5336; 92%), VERB –[conj]–> VERB (4627; 50%), PROPN –[conj]–> PROPN (4434; 68%), PROPN –[amod]–> ADJ (4189; 82%), ADJ –[nsubj]–> NOUN (4003; 94%), VERB –[auxpass]–> AUX (3122; 51%).


Gender in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]