home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-CAC: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut. Some words have combined values of the feature; 3 combinations have been observed: Fem|Masc, Fem|Neut, Masc|Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

252252 tokens (51%) have a non-empty value of Gender. 58315 types (93%) occur at least once with a non-empty value of Gender. 25196 lemmas (88%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (136144; 28% instances), ADJ (73917; 15% instances), DET (15353; 3% instances), VERB (10199; 2% instances), PROPN (9808; 2% instances), AUX (2821; 1% instances), PRON (2811; 1% instances), NUM (1199; 0% instances).

NOUN

136144 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (95309; 70%), Animacy=EMPTY (79760; 59%).

NOUN tokens may have the following values of Gender:

Paradigm rokMascNeut
Animacy=Inan|Case=Acc|Number=Singrok
Animacy=Inan|Case=Acc|Number=Plurroky
Animacy=Inan|Case=Dat|Number=Singroku
Animacy=Inan|Case=Gen|Number=Singroku, roka
Animacy=Inan|Case=Gen|Number=Plurroků
Animacy=Inan|Case=Ins|Number=Singrokem
Animacy=Inan|Case=Ins|Number=Plurroky
Animacy=Inan|Case=Loc|Number=Singroce
Animacy=Inan|Case=Nom|Number=Singrok
Animacy=Inan|Case=Nom|Number=Plurroky
Case=Gen|Number=Plurlet
Case=Ins|Number=Plurlety
Case=Loc|Number=Plurletech

Gender seems to be lexical feature of NOUN. 100% lemmas (11129) occur only with one value of Gender.

ADJ

73917 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Polarity=Pos (71070; 96%), Degree=Pos (67618; 91%), VerbForm=EMPTY (61724; 84%), Voice=EMPTY (61724; 84%), Number=Sing (47115; 64%), Animacy=EMPTY (44952; 61%).

ADJ tokens may have the following values of Gender:

Paradigm uvedenýFem,MascFem,NeutMascFemNeut
Animacy=Anim|Case=Gen|Number=Plur|Polarity=Posuvedených
Animacy=Anim|Case=Nom|Number=Sing|Polarity=Posuvedený
Animacy=Anim|Case=Nom|Number=Plur|Polarity=Posuvedení
Animacy=Anim|Number=Plur|Polarity=Pos|Variant=Shortuvedeni
Animacy=Inan|Case=Acc|Number=Sing|Polarity=Posuvedený
Animacy=Inan|Case=Acc|Number=Plur|Polarity=Negneuvedené
Animacy=Inan|Case=Acc|Number=Plur|Polarity=Posuvedené
Animacy=Inan|Case=Dat|Number=Plur|Polarity=Posuvedeným
Animacy=Inan|Case=Gen|Number=Sing|Polarity=Posuvedeného
Animacy=Inan|Case=Gen|Number=Plur|Polarity=Posuvedených
Animacy=Inan|Case=Ins|Number=Sing|Polarity=Posuvedeným
Animacy=Inan|Case=Ins|Number=Plur|Polarity=Posuvedenými
Animacy=Inan|Case=Loc|Number=Sing|Polarity=Posuvedeném
Animacy=Inan|Case=Loc|Number=Plur|Polarity=Posuvedených
Animacy=Inan|Case=Nom|Number=Sing|Polarity=Posuvedený
Animacy=Inan|Case=Nom|Number=Plur|Polarity=Posuvedené
Animacy=Inan|Number=Plur|Polarity=Pos|Variant=Shortuvedeny
Case=Acc|Number=Sing|Polarity=Posuvedenouuvedené
Case=Acc|Number=Plur|Polarity=PosuvedenéUvedená
Case=Dat|Number=Sing|Polarity=Posuvedenéuvedenému
Case=Dat|Number=Plur|Polarity=Posuvedeným
Case=Gen|Number=Sing|Polarity=Posuvedenéuvedeného
Case=Gen|Number=Plur|Polarity=Posuvedenýchuvedených
Case=Ins|Number=Sing|Polarity=Posuvedenouuvedeným
Case=Ins|Number=Plur|Polarity=Posuvedenýmiuvedenými
Case=Loc|Number=Sing|Polarity=Posuvedenéuvedeném
Case=Loc|Number=Plur|Polarity=Posuvedenýchuvedených
Case=Nom|Number=Sing|Polarity=Posuvedenáuvedené
Case=Nom|Number=Plur|Polarity=Posuvedenéuvedená
Number=Sing|Polarity=Pos|Variant=Shortuvedenuvedeno
Number=Plur,Sing|Polarity=Pos|Variant=Shortuvedena

DET

15353 DET tokens (77% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number[psor]=EMPTY (13829; 90%), Person=EMPTY (13829; 90%), Animacy=EMPTY (12927; 84%), Poss=EMPTY (12641; 82%), Number=Sing (12219; 80%).

DET tokens may have the following values of Gender:

Paradigm můjFem,NeutMascMasc,NeutFemNeut
Animacy=Anim|Case=Acc|Number=Sing|Number[psor]=Plurnašeho
Animacy=Anim|Case=Nom|Number=Plur|Number[psor]=Singmoji
Animacy=Anim|Case=Nom|Number=Plur|Number[psor]=Plurnaši
Animacy=Inan|Case=Acc|Number=Sing|Number[psor]=Singmůj
Animacy=Inan|Case=Acc|Number=Sing|Number[psor]=Plurnáš
Animacy=Inan|Case=Nom|Number=Plur|Number[psor]=Plurnaše
Case=Acc|Number=Sing|Number[psor]=Singmoumoje
Case=Acc|Number=Sing|Number[psor]=Plurnašinaše
Case=Acc|Number=Plur|Number[psor]=Sing
Case=Dat|Number=Sing|Number[psor]=Singmému
Case=Dat|Number=Sing|Number[psor]=Plurnašemunaší
Case=Gen|Number=Sing|Number[psor]=Singméhomé, mojí
Case=Gen|Number=Sing|Number[psor]=Plurnašehonaší
Case=Ins|Number=Sing|Number[psor]=Singmýmmojí, mou
Case=Ins|Number=Sing|Number[psor]=Plurnašímnaší
Case=Ins|Number=Dual|Number[psor]=Singmýma
Case=Ins|Number=Dual|Number[psor]=Plurnašima
Case=Loc|Number=Sing|Number[psor]=Singmém
Case=Loc|Number=Sing|Number[psor]=Plurnašemnaší
Case=Nom|Number=Sing|Number[psor]=Singmojemůj
Case=Nom|Number=Sing|Number[psor]=Plurnašenáš
Case=Nom|Number=Plur|Number[psor]=Singmoje
Case=Nom|Number=Plur|Number[psor]=Plurnaše

VERB

10199 VERB tokens (26% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (10199; 100%), Person=EMPTY (10199; 100%), Voice=Act (10199; 100%), Tense=Past (10166; 100%), VerbForm=Part (10165; 100%), Polarity=Pos (9431; 92%).

VERB tokens may have the following values of Gender:

Paradigm chtítFem,MascFem,NeutMascFemNeut
Animacy=Anim|Number=Plur|Polarity=Neg|Tense=Past|VerbForm=Partnechtěli
Animacy=Anim|Number=Plur|Polarity=Pos|Tense=Past|VerbForm=Partchtěli
Animacy=Inan|Number=Plur|Polarity=Neg|Tense=Past|VerbForm=PartNechtěly
Animacy=Inan|Number=Plur|Polarity=Pos|Tense=Past|VerbForm=Partchtěly
Aspect=Imp|Number=Sing|Polarity=Neg|Tense=Pres|VerbForm=Convnechtíc
Aspect=Imp|Number=Sing|Polarity=Pos|Tense=Pres|VerbForm=Convchtíc
Number=Sing|Polarity=Neg|Tense=Past|VerbForm=Partnechtělnechtělo
Number=Sing|Polarity=Pos|Tense=Past|VerbForm=Partchtělchtělo
Number=Plur,Sing|Polarity=Pos|Tense=Past|VerbForm=Partchtěla

PROPN

9808 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Abbr=EMPTY (7936; 81%), Number=Sing (7192; 73%).

PROPN tokens may have the following values of Gender:

Paradigm KSČMascFem
Animacy=InanKSČ
KSČ

Gender seems to be lexical feature of PROPN. 99% lemmas (3427) occur only with one value of Gender.

AUX

2821 AUX tokens (18% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Aspect=Imp (2821; 100%), Mood=EMPTY (2821; 100%), Person=EMPTY (2821; 100%), Voice=Act (2821; 100%), Tense=Past (2820; 100%), VerbForm=Part (2820; 100%), Polarity=Pos (2588; 92%), Number=Sing (1549; 55%).

AUX tokens may have the following values of Gender:

Paradigm býtFem,MascFem,NeutMascFemNeut
Animacy=Anim|Number=Sing|Polarity=Pos|Tense=Past|VerbForm=Partbyl
Animacy=Anim|Number=Plur|Polarity=Neg|Tense=Past|VerbForm=Partnebyli
Animacy=Anim|Number=Plur|Polarity=Pos|Tense=Past|VerbForm=Partbyli
Animacy=Inan|Number=Sing|Polarity=Pos|Tense=Past|VerbForm=Partbyl
Animacy=Inan|Number=Plur|Polarity=Neg|Tense=Past|VerbForm=Partnebyly
Animacy=Inan|Number=Plur|Polarity=Pos|Tense=Past|VerbForm=Partbyly
Number=Sing|Polarity=Neg|Tense=Past|VerbForm=Partnebylnebylo
Number=Sing|Polarity=Pos|Tense=Past|VerbForm=Partbylbylabylo
Number=Sing|Polarity=Pos|Tense=Pres|VerbForm=Convjsouc
Number=Plur,Sing|Polarity=Neg|Tense=Past|VerbForm=Partnebyla
Number=Plur,Sing|Polarity=Pos|Tense=Past|VerbForm=Partbyla
Number=Plur|Polarity=Pos|Tense=Past|VerbForm=Partbylybyla

PRON

2811 PRON tokens (18% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (2811; 100%), Variant=EMPTY (2499; 89%), Number=Sing (2067; 74%), Person=EMPTY (1539; 55%), PrepCase=EMPTY (1418; 50%).

PRON tokens may have the following values of Gender:

Paradigm onMascMasc,NeutFemNeut
Animacy=Anim|Case=Nom|Number=Pluroni
Case=Acc|Number=Sing|PrepCase=Nprjehojejjije
Case=Acc|Number=Sing|PrepCase=Preněj, něhoni
Case=Acc|Number=Sing|Variant=Shortho
Case=Dat|Number=Sing|PrepCase=Nprjemu
Case=Dat|Number=Sing|PrepCase=Preněmu
Case=Dat|Number=Sing|Variant=Shortmu
Case=Gen|Number=Sing|PrepCase=Nprjehojej
Case=Gen|Number=Sing|PrepCase=Preněho, něj
Case=Ins|Number=Sing|PrepCase=Nprjím
Case=Ins|Number=Sing|PrepCase=Prením
Case=Loc|Number=Sing|PrepCase=Preněm
Case=Nom|Number=Singononaono
Case=Nom|Number=Plurony

NUM

1199 NUM tokens (16% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (1139; 95%), NumType=Card (1139; 95%), Number=Sing (798; 67%).

NUM tokens may have the following values of Gender:

Paradigm jedenMascMasc,NeutFemNeut
Animacy=Anim|Case=Accjednoho
Animacy=Inan|Case=Accjeden
Case=Accjednujedno
Case=Datjednomujedné
Case=Genjednohojedné
Case=Insjednímjednou
Case=Locjednomjedné
Case=Nomjedenjednajedno

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (59362; 99%), NOUN –[conj]–> NOUN (7222; 50%), ADJ –[conj]–> ADJ (3669; 92%), ADJ –[nsubj]–> NOUN (1920; 77%), VERB –[conj]–> VERB (1077; 61%), NOUN –[flat]–> PROPN (855; 100%), PROPN –[conj]–> PROPN (763; 64%), NOUN –[appos]–> NOUN (750; 53%), PROPN –[flat]–> PROPN (613; 99%), PROPN –[amod]–> ADJ (534; 85%).