This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home cs/feat issue tracker

Gender: gender

Gender is a lexical feature of nouns and inflectional feature of other parts of speech (adjectives, verbs) that mark agreement with nouns. There are three values of gender: masculine, feminine, and neuter.

See also the related feature of Animacy.

Masc: masculine gender

Nouns denoting male persons are masculine. Other nouns may be also grammatically masculine, without any relation to sex.

Examples

Fem: feminine gender

Nouns denoting female persons are feminine. Other nouns may be also grammatically feminine, without any relation to sex.

Examples

Neut: neuter gender

This third gender is for nouns that are neither masculine nor feminine (grammatically). Nouns whose nominative suffix is -o  or -í  (including a large group of deverbative nouns denoting actions) are usually neuter.

Examples


Treebank Statistics (UD_Czech)

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut. Some words have combined values of the feature; 3 combinations have been observed: Fem|Masc, Fem|Neut, Masc|Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

753255 tokens (50%) have a non-empty value of Gender. 122285 types (95%) occur at least once with a non-empty value of Gender. 49852 lemmas (86%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: cs-pos/NOUN (371970; 25% instances), cs-pos/ADJ (176190; 12% instances), cs-pos/PROPN (82083; 5% instances), cs-pos/VERB (63395; 4% instances), cs-pos/PRON (33608; 2% instances), cs-pos/DET (17998; 1% instances), cs-pos/NUM (4759; 0% instances), cs-pos/AUX (3252; 0% instances).

NOUN

371970 cs-pos/NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Negative=Pos (371406; 100%), Number=Sing (259816; 70%), Animacy=EMPTY (208424; 56%).

NOUN tokens may have the following values of Gender:

Paradigm imageMascFemNeut
Animacy=Inanimage
imageimage

Gender seems to be lexical feature of NOUN. 99% lemmas (17613) occur only with one value of Gender.

ADJ

176190 cs-pos/ADJ tokens (97% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Negative=Pos (164176; 93%), Degree=Pos (155117; 88%), Number=Sing (122021; 69%), Animacy=EMPTY (102266; 58%).

ADJ tokens may have the following values of Gender:

Paradigm známýFem,MascFem,NeutMascFemNeut
Animacy=Anim|Case=Acc|Degree=Pos|Negative=Neg|Number=Singneznámého
Animacy=Anim|Case=Acc|Degree=Pos|Negative=Neg|Number=Plurneznámé
Animacy=Anim|Case=Acc|Degree=Pos|Negative=Pos|Number=Singznámého
Animacy=Anim|Case=Acc|Degree=Pos|Negative=Pos|Number=Plurznámé
Animacy=Anim|Case=Acc|Degree=Sup|Negative=Pos|Number=Plurnejznámější
Animacy=Anim|Case=Dat|Degree=Pos|Negative=Pos|Number=Singznámému
Animacy=Anim|Case=Dat|Degree=Pos|Negative=Pos|Number=Plurznámým
Animacy=Anim|Case=Dat|Degree=Sup|Negative=Pos|Number=Plurnejznámějším
Animacy=Anim|Case=Gen|Degree=Pos|Negative=Neg|Number=Singneznámého
Animacy=Anim|Case=Gen|Degree=Pos|Negative=Neg|Number=Plurneznámých
Animacy=Anim|Case=Gen|Degree=Pos|Negative=Pos|Number=Singznámého
Animacy=Anim|Case=Gen|Degree=Pos|Negative=Pos|Number=Plurznámých
Animacy=Anim|Case=Gen|Degree=Sup|Negative=Pos|Number=Plurnejznámějších
Animacy=Anim|Case=Ins|Degree=Pos|Negative=Neg|Number=Singneznámým
Animacy=Anim|Case=Ins|Degree=Pos|Negative=Neg|Number=Plurneznámými
Animacy=Anim|Case=Ins|Degree=Pos|Negative=Pos|Number=Singznámým
Animacy=Anim|Case=Ins|Degree=Sup|Negative=Pos|Number=Singnejznámějším
Animacy=Anim|Case=Loc|Degree=Pos|Negative=Neg|Number=Singneznámém
Animacy=Anim|Case=Nom|Degree=Pos|Negative=Neg|Number=Singneznámý
Animacy=Anim|Case=Nom|Degree=Pos|Negative=Neg|Number=Plurneznámí
Animacy=Anim|Case=Nom|Degree=Pos|Negative=Pos|Number=Singznámý
Animacy=Anim|Case=Nom|Degree=Pos|Negative=Pos|Number=Plurznámí
Animacy=Anim|Case=Nom|Degree=Sup|Negative=Pos|Number=Singnejznámější
Animacy=Anim|Negative=Pos|Number=Plur|Variant=Shortznámi
Animacy=Inan|Case=Acc|Degree=Pos|Negative=Neg|Number=Singneznámý
Animacy=Inan|Case=Acc|Degree=Pos|Negative=Pos|Number=Singznámý
Animacy=Inan|Case=Acc|Degree=Pos|Negative=Pos|Number=Plurznámé
Animacy=Inan|Case=Acc|Degree=Sup|Negative=Pos|Number=Plurnejznámější
Animacy=Inan|Case=Dat|Degree=Pos|Negative=Pos|Number=Plurznámým
Animacy=Inan|Case=Gen|Degree=Pos|Negative=Neg|Number=Singneznámého
Animacy=Inan|Case=Gen|Degree=Pos|Negative=Neg|Number=Plurneznámých
Animacy=Inan|Case=Gen|Degree=Pos|Negative=Pos|Number=Singznámého
Animacy=Inan|Case=Gen|Degree=Pos|Negative=Pos|Number=Plurznámých
Animacy=Inan|Case=Gen|Degree=Sup|Negative=Pos|Number=Plurnejznámějších
Animacy=Inan|Case=Ins|Degree=Pos|Negative=Neg|Number=Singneznámým
Animacy=Inan|Case=Ins|Degree=Pos|Negative=Neg|Number=Plurneznámými
Animacy=Inan|Case=Ins|Degree=Pos|Negative=Pos|Number=Singznámým
Animacy=Inan|Case=Ins|Degree=Pos|Negative=Pos|Number=Plurznámými
Animacy=Inan|Case=Ins|Degree=Sup|Negative=Pos|Number=Singnejznámějším
Animacy=Inan|Case=Loc|Degree=Pos|Negative=Pos|Number=Singznámém
Animacy=Inan|Case=Loc|Degree=Pos|Negative=Pos|Number=Plurznámých
Animacy=Inan|Case=Nom|Degree=Pos|Negative=Neg|Number=Singneznámý
Animacy=Inan|Case=Nom|Degree=Pos|Negative=Neg|Number=Plurneznámé
Animacy=Inan|Case=Nom|Degree=Pos|Negative=Pos|Number=Singznámý
Animacy=Inan|Case=Nom|Degree=Pos|Negative=Pos|Number=Plurznámé
Animacy=Inan|Case=Nom|Degree=Sup|Negative=Pos|Number=Singnejznámější
Animacy=Inan|Case=Nom|Degree=Sup|Negative=Pos|Number=PlurNejznámější
Animacy=Inan|Negative=Pos|Number=Plur|Variant=Shortznámy
Case=Acc|Degree=Pos|Negative=Neg|Number=Singneznámouneznámé
Case=Acc|Degree=Pos|Negative=Neg|Number=Plurneznámá
Case=Acc|Degree=Pos|Negative=Pos|Number=Singznámouznámé
Case=Acc|Degree=Pos|Negative=Pos|Number=Plurznámá
Case=Dat|Degree=Pos|Negative=Pos|Number=Singznámé
Case=Dat|Degree=Pos|Negative=Pos|Number=Plurznámým
Case=Dat|Degree=Sup|Negative=Pos|Number=Plurnejznámějšímnejznámějším
Case=Gen|Degree=Pos|Negative=Neg|Number=Singneznámé
Case=Gen|Degree=Pos|Negative=Neg|Number=Plurneznámýchneznámých
Case=Gen|Degree=Pos|Negative=Pos|Number=Singznáméznámého
Case=Gen|Degree=Pos|Negative=Pos|Number=Plurznámýchznámých
Case=Gen|Degree=Cmp|Negative=Pos|Number=Singznámější
Case=Gen|Degree=Sup|Negative=Pos|Number=Plurnejznámějších
Case=Ins|Degree=Pos|Negative=Neg|Number=Singneznámouneznámým
Case=Ins|Degree=Pos|Negative=Neg|Number=Plurneznámými
Case=Ins|Degree=Pos|Negative=Pos|Number=Singznámouznámým
Case=Ins|Degree=Pos|Negative=Pos|Number=Plurznámými
Case=Loc|Degree=Pos|Negative=Neg|Number=Singneznáméneznámém
Case=Loc|Degree=Pos|Negative=Pos|Number=Singznámé
Case=Nom|Degree=Pos|Negative=Neg|Number=Singneznámáneznámé
Case=Nom|Degree=Pos|Negative=Neg|Number=Plurneznámé
Case=Nom|Degree=Pos|Negative=Pos|Number=Singznámáznámé
Case=Nom|Degree=Pos|Negative=Pos|Number=Plurznáméznámá
Case=Nom|Degree=Cmp|Negative=Pos|Number=Plurznámější
Case=Nom|Degree=Sup|Negative=Pos|Number=Singnejznámějšínejznámější
Case=Nom|Degree=Sup|Negative=Pos|Number=Plurnejznámější
Case=Voc|Degree=Pos|Negative=Neg|Number=Singneznámá
Negative=Neg|Number=Sing|Variant=Shortneznámo
Negative=Pos|Number=Sing|Variant=Shortznámznámo
Negative=Pos|Number=Plur,Sing|Variant=Shortznáma, neznáma

PROPN

82083 cs-pos/PROPN tokens (98% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Negative=Pos (82083; 100%), Abbr=EMPTY (70671; 86%), Number=Sing (63175; 77%).

PROPN tokens may have the following values of Gender:

Paradigm MMascFemNeut
Animacy=Anim|NameType=GivM
Animacy=Anim|NameType=SurM
NameType=ComM
NameType=GivM
NameType=SurM

Gender seems to be lexical feature of PROPN. 98% lemmas (14605) occur only with one value of Gender.

VERB

63395 cs-pos/VERB tokens (38% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (63395; 100%), Person=EMPTY (63393; 100%), VerbForm=Part (63251; 100%), Negative=Pos (58700; 93%), Voice=Act (53869; 85%), Tense=Past (53727; 85%), Number=Sing (35102; 55%).

VERB tokens may have the following values of Gender:

Paradigm dátFem,MascFem,NeutMascFemNeut
Animacy=Anim|Negative=Neg|Number=Plur|Tense=Past|Voice=Actnedali
Animacy=Anim|Negative=Pos|Number=Plur|Tense=Past|Voice=Actdali
Animacy=Inan|Negative=Neg|Number=Plur|Tense=Past|Voice=Actnedaly
Animacy=Inan|Negative=Pos|Number=Plur|Tense=Past|Voice=Actdaly
Animacy=Inan|Negative=Pos|Number=Plur|Voice=Passdány
Case=Acc|Negative=Pos|Number=Sing|Voice=Passdánu
Negative=Neg|Number=Sing|Tense=Past|Voice=Actnedalnedalo
Negative=Neg|Number=Plur,Sing|Tense=Past|Voice=Actnedala
Negative=Pos|Number=Sing|Tense=Past|Voice=Actdaldalo
Negative=Pos|Number=Sing|Voice=Passdándáno
Negative=Pos|Number=Plur,Sing|Tense=Past|Voice=Actdala
Negative=Pos|Number=Plur,Sing|Voice=Passdána

PRON

33608 cs-pos/PRON tokens (46% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (33517; 100%), Variant=EMPTY (32375; 96%), Person=EMPTY (29026; 86%), Number=Sing (26097; 78%), Case=Nom (17078; 51%).

PRON tokens may have the following values of Gender:

Paradigm můjFem,NeutMascMasc,NeutFemNeut
Animacy=Anim|Case=Acc|Number=Sing|Number[psor]=Plurnašeho
Animacy=Anim|Case=Nom|Number=Plur|Number[psor]=Plurnaši
Animacy=Inan|Case=Acc|Number=Sing|Number[psor]=Plurnáš
Animacy=Inan|Case=Acc|Number=Sing|Number[psor]=SingMůj
Animacy=Inan|Case=Nom|Number=Plur|Number[psor]=Singmoje
Case=Acc|Number=Sing|Number[psor]=Plurnaši
Case=Acc|Number=Sing|Number[psor]=Singmoumoje
Case=Dat|Number=Sing|Number[psor]=Plurnašemunaší
Case=Dat|Number=Sing|Number[psor]=Singmému
Case=Gen|Number=Sing|Number[psor]=Plurnašehonaší
Case=Gen|Number=Sing|Number[psor]=Singmého
Case=Ins|Number=Sing|Number[psor]=Plurnaším
Case=Ins|Number=Sing|Number[psor]=Singmým
Case=Loc|Number=Sing|Number[psor]=Plurnašem
Case=Nom|Number=Sing|Number[psor]=Plurnašenáš
Case=Nom|Number=Sing|Number[psor]=Singmojemůj
Case=Nom|Number=Plur|Number[psor]=Plurnaše
Case=Voc|Number=Sing|Number[psor]=Singmoje

DET

17998 cs-pos/DET tokens (65% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Gender[psor]=EMPTY (16859; 94%), Number=Sing (14995; 83%), Number[psor]=EMPTY (14612; 81%), Person=EMPTY (14612; 81%), Reflex=EMPTY (14010; 78%), Poss=EMPTY (10624; 59%).

DET tokens may have the following values of Gender:

Paradigm můjFem,NeutMascMasc,NeutFemNeut
Abbr=Yes|Case=Ins|Number=Sing|Number[psor]=Plurn
Animacy=Anim|Case=Acc|Number=Sing|Number[psor]=Plurnašeho
Animacy=Anim|Case=Nom|Number=Plur|Number[psor]=Plurnaši
Animacy=Anim|Case=Nom|Number=Plur|Number[psor]=Singmoji, Mí
Animacy=Inan|Case=Acc|Number=Sing|Number[psor]=Plurnáš
Animacy=Inan|Case=Acc|Number=Sing|Number[psor]=Singmůj
Animacy=Inan|Case=Nom|Number=Plur|Number[psor]=Plurnaše
Animacy=Inan|Case=Nom|Number=Plur|Number[psor]=Singmé, moje
Case=Acc|Number=Sing|Number[psor]=Plurnašinaše
Case=Acc|Number=Sing|Number[psor]=Plur|Style=Collnaší
Case=Acc|Number=Sing|Number[psor]=Singmou, mojimé, moje
Case=Acc|Number=Plur|Number[psor]=Sing
Case=Dat|Number=Sing|Number[psor]=Plurnašemunaší
Case=Dat|Number=Sing|Number[psor]=Singmémumé, mojí
Case=Gen|Number=Sing|Number[psor]=Plurnašehonaší
Case=Gen|Number=Sing|Number[psor]=Singméhomé, mojí
Case=Gen|Number=Sing|Number[psor]=Sing|Style=Collmýho
Case=Ins|Number=Sing|Number[psor]=Plurnašímnaší
Case=Ins|Number=Sing|Number[psor]=Plur|Style=Collnašim
Case=Ins|Number=Sing|Number[psor]=Singmýmmou, mojí
Case=Loc|Number=Sing|Number[psor]=Plurnašemnaší
Case=Loc|Number=Sing|Number[psor]=Singmém
Case=Nom|Number=Sing|Number[psor]=Plurnašenáš
Case=Nom|Number=Sing|Number[psor]=Singmojemůj
Case=Nom|Number=Plur|Number[psor]=Plurnaše
Case=Nom|Number=Plur|Number[psor]=Singmoje
Case=Voc|Number=Sing|Number[psor]=Plurnáš
Case=Voc|Number=Sing|Number[psor]=Singmůj

NUM

4759 cs-pos/NUM tokens (11% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumValue=1,2,3 (4417; 93%), NumForm=Word (4417; 93%), NumType=Card (4417; 93%), Number=Sing (2795; 59%).

NUM tokens may have the following values of Gender:

Paradigm jedenMascMasc,NeutFemNeut
Animacy=Anim|Case=Accjednoho
Animacy=Inan|Case=Accjeden
Case=Accjednujedno
Case=Datjednomujedné
Case=Genjednohojedné
Case=Insjednímjednou
Case=Locjednomjedné
Case=Nomjedenjednajedno

AUX

3252 cs-pos/AUX tokens (16% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Tense=Past (3252; 100%), VerbForm=Part (3252; 100%), Voice=Act (3252; 100%), Mood=EMPTY (3252; 100%), Person=EMPTY (3252; 100%), Negative=Pos (2982; 92%), Number=Sing (1704; 52%).

AUX tokens may have the following values of Gender:

Paradigm býtFem,MascFem,NeutMascNeut
Animacy=Anim|Negative=Neg|Number=Plurnebyli
Animacy=Anim|Negative=Pos|Number=Plurbyli
Animacy=Inan|Negative=Neg|Number=Plurnebyly
Animacy=Inan|Negative=Pos|Number=Plurbyly
Negative=Neg|Number=Singnebylnebylo
Negative=Neg|Number=Plur,Singnebyla
Negative=Pos|Number=Singbylbylo
Negative=Pos|Number=Plur,Singbyla

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (148457; 99%), PROPN –[name]–> PROPN (13445; 99%), PROPN –[nmod]–> NOUN (8340; 89%), VERB –[nsubj]–> PROPN (7264; 53%), ADJ –[conj]–> ADJ (5335; 92%), VERB –[conj]–> VERB (4629; 50%), PROPN –[conj]–> PROPN (4435; 68%), PROPN –[amod]–> ADJ (4177; 82%), ADJ –[nsubj]–> NOUN (4004; 94%), VERB –[auxpass]–> AUX (3125; 51%).


Treebank Statistics (UD_Czech-CAC)

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut. Some words have combined values of the feature; 3 combinations have been observed: Fem|Masc, Fem|Neut, Masc|Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

252494 tokens (51%) have a non-empty value of Gender. 58315 types (93%) occur at least once with a non-empty value of Gender. 25116 lemmas (89%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: cs-pos/NOUN (136143; 28% instances), cs-pos/ADJ (70223; 14% instances), cs-pos/VERB (15966; 3% instances), cs-pos/PRON (10869; 2% instances), cs-pos/PROPN (9803; 2% instances), cs-pos/DET (6978; 1% instances), cs-pos/AUX (1313; 0% instances), cs-pos/NUM (1199; 0% instances).

NOUN

136143 cs-pos/NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Negative=Pos (135949; 100%), Number=Sing (95308; 70%), Animacy=EMPTY (79760; 59%).

NOUN tokens may have the following values of Gender:

Paradigm rokMascNeut
Animacy=Inan|Case=Acc|Number=Singrok
Animacy=Inan|Case=Acc|Number=Plurroky
Animacy=Inan|Case=Dat|Number=Singroku
Animacy=Inan|Case=Gen|Number=Singroku, roka
Animacy=Inan|Case=Gen|Number=Plurroků
Animacy=Inan|Case=Ins|Number=Singrokem
Animacy=Inan|Case=Ins|Number=Plurroky
Animacy=Inan|Case=Loc|Number=Singroce
Animacy=Inan|Case=Nom|Number=Singrok
Animacy=Inan|Case=Nom|Number=Plurroky
Case=Gen|Number=Plurlet
Case=Ins|Number=Plurlety
Case=Loc|Number=Plurletech

Gender seems to be lexical feature of NOUN. 100% lemmas (11084) occur only with one value of Gender.

ADJ

70223 cs-pos/ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Negative=Pos (67398; 96%), Degree=Pos (63126; 90%), Number=Sing (45755; 65%), Animacy=EMPTY (42160; 60%).

ADJ tokens may have the following values of Gender:

Paradigm známýFem,MascFem,NeutMascFemNeut
Animacy=Anim|Case=Acc|Degree=Pos|Negative=Pos|Number=Singznámého
Animacy=Anim|Case=Dat|Degree=Pos|Negative=Neg|Number=Singneznámému
Animacy=Anim|Case=Gen|Degree=Pos|Negative=Neg|Number=Plurneznámých
Animacy=Anim|Case=Gen|Degree=Pos|Negative=Pos|Number=Singznámého
Animacy=Anim|Case=Gen|Degree=Pos|Negative=Pos|Number=Plurznámých
Animacy=Anim|Case=Ins|Degree=Pos|Negative=Neg|Number=Singneznámým
Animacy=Anim|Case=Ins|Degree=Pos|Negative=Pos|Number=Singznámým
Animacy=Anim|Case=Ins|Degree=Sup|Negative=Pos|Number=SingNejznámějším
Animacy=Anim|Case=Loc|Degree=Pos|Negative=Neg|Number=Singneznámém
Animacy=Anim|Case=Nom|Degree=Pos|Negative=Neg|Number=Singneznámý
Animacy=Anim|Case=Nom|Degree=Pos|Negative=Pos|Number=Singznámý
Animacy=Inan|Case=Acc|Degree=Pos|Negative=Neg|Number=Singneznámý
Animacy=Inan|Case=Acc|Degree=Pos|Negative=Pos|Number=Singznámý
Animacy=Inan|Case=Acc|Degree=Pos|Negative=Pos|Number=Plurznámé
Animacy=Inan|Case=Dat|Degree=Pos|Negative=Pos|Number=Plurznámým
Animacy=Inan|Case=Gen|Degree=Pos|Negative=Neg|Number=Singneznámého
Animacy=Inan|Case=Gen|Degree=Pos|Negative=Neg|Number=Plurneznámých
Animacy=Inan|Case=Gen|Degree=Pos|Negative=Pos|Number=Singznámého
Animacy=Inan|Case=Gen|Degree=Pos|Negative=Pos|Number=Plurznámých
Animacy=Inan|Case=Gen|Degree=Sup|Negative=Pos|Number=PlurNejznámějších
Animacy=Inan|Case=Ins|Degree=Pos|Negative=Pos|Number=Plurznámými
Animacy=Inan|Case=Ins|Degree=Sup|Negative=Pos|Number=SingNejznámějším
Animacy=Inan|Case=Loc|Degree=Pos|Negative=Pos|Number=Plurznámých
Animacy=Inan|Case=Nom|Degree=Pos|Negative=Neg|Number=Singneznámý
Animacy=Inan|Case=Nom|Degree=Pos|Negative=Pos|Number=Singznámý
Animacy=Inan|Case=Nom|Degree=Pos|Negative=Pos|Number=Plurznámé
Animacy=Inan|Negative=Neg|Number=Plur|Variant=Shortneznámy
Animacy=Inan|Negative=Pos|Number=Plur|Variant=Shortznámy
Case=Acc|Degree=Pos|Negative=Neg|Number=Singneznámou
Case=Acc|Degree=Pos|Negative=Neg|Number=Plurneznámé
Case=Acc|Degree=Pos|Negative=Pos|Number=Singznámouznámé
Case=Acc|Degree=Pos|Negative=Pos|Number=Plurznámá
Case=Dat|Degree=Pos|Negative=Neg|Number=Singneznámému
Case=Dat|Degree=Pos|Negative=Pos|Number=Singznámé
Case=Dat|Degree=Pos|Negative=Pos|Number=Plurznámým
Case=Gen|Degree=Pos|Negative=Neg|Number=Singneznáméneznámého
Case=Gen|Degree=Pos|Negative=Neg|Number=Plurneznámých
Case=Gen|Degree=Pos|Negative=Pos|Number=Singznáméznámého
Case=Gen|Degree=Pos|Negative=Pos|Number=Plurznámýchznámých
Case=Ins|Degree=Pos|Negative=Neg|Number=Singneznámouneznámým
Case=Ins|Degree=Pos|Negative=Pos|Number=Singznámou
Case=Ins|Degree=Pos|Negative=Pos|Number=Plurznámými
Case=Loc|Degree=Pos|Negative=Pos|Number=Singznáméznámém
Case=Nom|Degree=Pos|Negative=Neg|Number=Singneznámáneznámé
Case=Nom|Degree=Pos|Negative=Neg|Number=Plurneznámé
Case=Nom|Degree=Pos|Negative=Pos|Number=Singznámáznámé
Case=Nom|Degree=Pos|Negative=Pos|Number=Plurznámá
Case=Nom|Degree=Cmp|Negative=Pos|Number=Singznámější
Case=Nom|Degree=Sup|Negative=Pos|Number=SingNejznámějšíNejznámější
Negative=Pos|Number=Sing|Variant=Shortznámznámo
Negative=Pos|Number=Plur,Sing|Variant=Shortznáma

VERB

15966 cs-pos/VERB tokens (30% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (15966; 100%), Person=EMPTY (15966; 100%), VerbForm=Part (15931; 100%), Negative=Pos (15012; 94%), Voice=Act (11700; 73%), Tense=Past (11666; 73%).

VERB tokens may have the following values of Gender:

Paradigm zajistitFem,MascFem,NeutMascFemNeut
Animacy=Anim|Aspect=Perf|Negative=Pos|Number=Plur|Tense=Past|Voice=Actzajistili
Animacy=Inan|Aspect=Perf|Negative=Neg|Number=Plur|Tense=Past|Voice=Actnezajistily
Animacy=Inan|Negative=Pos|Number=Plur|Voice=Passzajištěny
Aspect=Perf|Negative=Neg|Number=Sing|Tense=Past|Voice=Actnezajistil
Aspect=Perf|Negative=Pos|Number=Sing|Tense=Past|Voice=Actzajistilzajistilo
Aspect=Perf|Negative=Pos|Number=Plur,Sing|Tense=Past|Voice=Actzajistila
Case=Acc|Negative=Pos|Number=Sing|Voice=Passzajištěnu
Negative=Neg|Number=Sing|Voice=Passnezajištěn
Negative=Pos|Number=Sing|Voice=Passzajištěnzajištěno
Negative=Pos|Number=Plur,Sing|Voice=Passzajištěna

PRON

10869 cs-pos/PRON tokens (45% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (10846; 100%), Variant=EMPTY (10557; 97%), Person=EMPTY (9565; 88%), Number=Sing (8206; 75%), Case=Nom (5491; 51%).

PRON tokens may have the following values of Gender:

Paradigm můjFem,NeutMascMasc,NeutFemNeut
Animacy=Anim|Case=Nom|Number=Plur|Number[psor]=Plurnaši
Animacy=Anim|Case=Nom|Number=Plur|Number[psor]=SingMoji
Animacy=Inan|Case=Acc|Number=Sing|Number[psor]=PlurNáš
Case=Acc|Number=Sing|Number[psor]=Plurnašinaše
Case=Dat|Number=Sing|Number[psor]=Plurnašemunaší
Case=Gen|Number=Sing|Number[psor]=Plurnašehonaší
Case=Loc|Number=Sing|Number[psor]=Plurnašem
Case=Nom|Number=Sing|Number[psor]=Plurnašenáš
Case=Nom|Number=Sing|Number[psor]=Singmoje
Case=Nom|Number=Plur|Number[psor]=Plurnaše
Case=Nom|Number=Plur|Number[psor]=Singmoje

PROPN

9803 cs-pos/PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Negative=Pos (9803; 100%), Abbr=EMPTY (7931; 81%), Number=Sing (7187; 73%).

PROPN tokens may have the following values of Gender:

Paradigm KSČMascFem
Animacy=InanKSČ
KSČ

Gender seems to be lexical feature of PROPN. 99% lemmas (3428) occur only with one value of Gender.

DET

6978 cs-pos/DET tokens (63% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Gender[psor]=EMPTY (6514; 93%), Reflex=EMPTY (5833; 84%), Number=Sing (5772; 83%), Person=EMPTY (5486; 79%), Number[psor]=EMPTY (5486; 79%), Poss=EMPTY (4341; 62%), PronType=Dem (3494; 50%).

DET tokens may have the following values of Gender:

Paradigm můjFem,NeutMascMasc,NeutFemNeut
Animacy=Anim|Case=Acc|Number=Sing|Number[psor]=Plurnašeho
Animacy=Anim|Case=Nom|Number=Plur|Number[psor]=Plurnaši
Animacy=Anim|Case=Nom|Number=Plur|Number[psor]=Singmoji
Animacy=Inan|Case=Acc|Number=Sing|Number[psor]=Plurnáš
Animacy=Inan|Case=Acc|Number=Sing|Number[psor]=Singmůj
Animacy=Inan|Case=Nom|Number=Plur|Number[psor]=Plurnaše
Case=Acc|Number=Sing|Number[psor]=Plurnašinaše
Case=Acc|Number=Sing|Number[psor]=Singmoumoje
Case=Acc|Number=Plur|Number[psor]=Sing
Case=Dat|Number=Sing|Number[psor]=Plurnašemunaší
Case=Dat|Number=Sing|Number[psor]=Singmému
Case=Gen|Number=Sing|Number[psor]=Plurnašehonaší
Case=Gen|Number=Sing|Number[psor]=Singméhomé, mojí
Case=Ins|Number=Sing|Number[psor]=Plurnašímnaší
Case=Ins|Number=Sing|Number[psor]=Singmýmmou, mojí
Case=Ins|Number=Dual|Number[psor]=Plurnašima
Case=Ins|Number=Dual|Number[psor]=Singmýma
Case=Loc|Number=Sing|Number[psor]=Plurnašemnaší
Case=Loc|Number=Sing|Number[psor]=Singmém
Case=Nom|Number=Sing|Number[psor]=Plurnašenáš
Case=Nom|Number=Sing|Number[psor]=Singmojemůj
Case=Nom|Number=Plur|Number[psor]=Plurnaše
Case=Nom|Number=Plur|Number[psor]=Sing

AUX

1313 cs-pos/AUX tokens (21% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: VerbForm=Part (1313; 100%), Voice=Act (1313; 100%), Tense=Past (1313; 100%), Person=EMPTY (1313; 100%), Mood=EMPTY (1313; 100%), Negative=Pos (1244; 95%).

AUX tokens may have the following values of Gender:

Paradigm býtFem,MascFem,NeutMascNeut
Animacy=Anim|Negative=Pos|Number=Plurbyli
Animacy=Inan|Negative=Neg|Number=Plurnebyly
Animacy=Inan|Negative=Pos|Number=Plurbyly
Negative=Neg|Number=Singnebylnebylo
Negative=Neg|Number=Plur,Singnebyla
Negative=Pos|Number=Singbylbylo
Negative=Pos|Number=Plur,Singbyla

NUM

1199 cs-pos/NUM tokens (16% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (1139; 95%), NumForm=Word (1139; 95%), NumValue=1,2,3 (1139; 95%), Number=Sing (798; 67%).

NUM tokens may have the following values of Gender:

Paradigm jedenMascMasc,NeutFemNeut
Animacy=Anim|Case=Accjednoho
Animacy=Inan|Case=Accjeden
Case=Accjednujedno
Case=Datjednomujedné
Case=Genjednohojedné
Case=Insjednímjednou
Case=Locjednomjedné
Case=Nomjedenjednajedno

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (59660; 99%), NOUN –[conj]–> NOUN (7204; 50%), ADJ –[conj]–> ADJ (3444; 95%), ADJ –[nsubj]–> NOUN (1550; 95%), VERB –[conj]–> VERB (1429; 50%), PROPN –[name]–> PROPN (838; 99%), PROPN –[conj]–> PROPN (807; 64%), PROPN –[nmod]–> NOUN (756; 85%), VERB –[nsubj]–> PROPN (738; 54%), NOUN –[appos]–> NOUN (692; 50%).


Treebank Statistics (UD_Czech-CLTT)

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut. Some words have combined values of the feature; 3 combinations have been observed: Fem|Masc, Fem|Neut, Masc|Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

19272 tokens (55%) have a non-empty value of Gender. 3629 types (77%) occur at least once with a non-empty value of Gender. 1647 lemmas (62%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: cs-pos/NOUN (11303; 32% instances), cs-pos/ADJ (6524; 19% instances), cs-pos/PRON (610; 2% instances), cs-pos/VERB (384; 1% instances), cs-pos/DET (354; 1% instances), cs-pos/AUX (51; 0% instances), cs-pos/NUM (46; 0% instances).

NOUN

11303 cs-pos/NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Negative=Pos (11288; 100%), Number=Sing (7969; 71%), Animacy=EMPTY (6755; 60%).

NOUN tokens may have the following values of Gender:

Paradigm rokMascNeut
Animacy=Inan|Case=Acc|Number=Singrok
Animacy=Inan|Case=Gen|Number=Singroku
Animacy=Inan|Case=Ins|Number=Singrokem
Animacy=Inan|Case=Loc|Number=Singroce
Animacy=Inan|Case=Nom|Number=Singrok
Case=Gen|Number=Plurlet
Case=Loc|Number=Plurletech

Gender seems to be lexical feature of NOUN. 100% lemmas (859) occur only with one value of Gender.

ADJ

6524 cs-pos/ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Negative=Pos (6344; 97%), Degree=Pos (6028; 92%), Number=Sing (4114; 63%), Animacy=EMPTY (3902; 60%).

ADJ tokens may have the following values of Gender:

Paradigm známýFem,MascFem,NeutMascFemNeut
Animacy=Inan|Case=Nom|Degree=Pos|Number=Singznámý
Animacy=Inan|Number=Plur|Variant=Shortznámy
Case=Acc|Degree=Pos|Number=Singznámou
Number=Sing|Variant=Shortznámznámo
Number=Plur,Sing|Variant=Shortznáma

PRON

610 cs-pos/PRON tokens (50% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (610; 100%), Variant=EMPTY (609; 100%), PronType=Int,Rel (412; 68%), Number=Sing (400; 66%), Case=Nom (323; 53%).

PRON tokens may have the following values of Gender:

Paradigm kterýMascMasc,NeutFemNeut
Animacy=Inan|Case=Acc|Number=Singkterý
Animacy=Inan|Case=Nom|Number=Plurkteré
Case=Acc|Number=Singkteroukteré
Case=Acc|Number=Plurkterékteré
Case=Dat|Number=Singkterémukteré
Case=Gen|Number=Singkteréhokteré
Case=Ins|Number=Singkterýmkterou
Case=Loc|Number=Singkterémkteré
Case=Nom|Number=Singkterýkterákteré
Case=Nom|Number=Plurkterékterá

VERB

384 cs-pos/VERB tokens (15% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (384; 100%), Person=EMPTY (384; 100%), VerbForm=Part (383; 100%), Negative=Pos (371; 97%), Tense=EMPTY (260; 68%), Voice=Pass (260; 68%).

VERB tokens may have the following values of Gender:

Paradigm uvéstFem,MascFem,NeutMascNeut
Animacy=Inan|Number=Pluruvedeny
Number=Singuvedenuvedeno
Number=Plur,Singuvedena

DET

354 cs-pos/DET tokens (59% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Person=EMPTY (322; 91%), Gender[psor]=EMPTY (322; 91%), Number[psor]=EMPTY (322; 91%), Number=Sing (306; 86%), Poss=EMPTY (301; 85%), PronType=Dem (292; 82%).

DET tokens may have the following values of Gender:

Paradigm tentoMascMasc,NeutFemNeut
Animacy=Inan|Case=Acc|Number=Singtento
Animacy=Inan|Case=Acc|Number=Plurtyto
Animacy=Inan|Case=Nom|Number=Plurtyto
Case=Acc|Number=Singtutototo
Case=Acc|Number=Plurtyto
Case=Dat|Number=Singtomutotéto
Case=Gen|Number=Singtohototéto
Case=Ins|Number=Singtímtotouto
Case=Loc|Number=Singtomtotéto
Case=Nom|Number=Singtentotatototo
Case=Nom|Number=Plurtyto

AUX

51 cs-pos/AUX tokens (30% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (51; 100%), Person=EMPTY (51; 100%), Tense=Past (51; 100%), Voice=Act (51; 100%), VerbForm=Part (51; 100%), Negative=Pos (41; 80%), Animacy=EMPTY (26; 51%).

AUX tokens may have the following values of Gender:

Paradigm býtFem,MascFem,NeutMascNeut
Animacy=Inan|Negative=Neg|Number=Plurnebyly
Animacy=Inan|Negative=Pos|Number=Plurbyly
Negative=Neg|Number=Singnebyl
Negative=Neg|Number=Plur,Singnebyla
Negative=Pos|Number=Singbylbylo
Negative=Pos|Number=Plur,Singbyla

NUM

46 cs-pos/NUM tokens (10% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (46; 100%), NumValue=1,2,3 (46; 100%), NumForm=Word (46; 100%), Number=Sing (38; 83%).

NUM tokens may have the following values of Gender:

Paradigm jedenMascMasc,NeutFemNeut
Animacy=Inan|Case=Accjeden
Case=Accjednujedno
Case=Datjednomu
Case=Genjednohojedné
Case=Insjednímjednou
Case=Locjednom
Case=Nomjeden

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (5941; 98%), NOUN –[conj]–> NOUN (983; 54%), ADJ –[conj]–> ADJ (177; 86%), NOUN –[appos]–> NOUN (48; 72%), NOUN –[xcomp]–> ADJ (12; 92%), ADJ –[dobj]–> PRON (7; 58%), PRON –[nmod]–> NOUN (6; 75%), ADJ –[nsubj]–> PRON (6; 55%), ADJ –[dep]–> NOUN (6; 100%), ADJ –[amod]–> ADJ (5; 83%).


Gender in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]