home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-CLTT: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut. Some words have combined values of the feature; 3 combinations have been observed: Fem|Masc, Fem|Neut, Masc|Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

19272 tokens (54%) have a non-empty value of Gender. 3629 types (77%) occur at least once with a non-empty value of Gender. 1610 lemmas (60%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: NOUN (11303; 32% instances), ADJ (6766; 19% instances), DET (892; 3% instances), VERB (116; 0% instances), PRON (90; 0% instances), AUX (59; 0% instances), NUM (46; 0% instances).

NOUN

11303 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Polarity=Pos (11288; 100%), Number=Sing (7969; 71%), Animacy=EMPTY (6755; 60%).

NOUN tokens may have the following values of Gender:

Paradigm rokMascNeut
Animacy=Inan|Case=Acc|Number=Singrok
Animacy=Inan|Case=Gen|Number=Singroku
Animacy=Inan|Case=Ins|Number=Singrokem
Animacy=Inan|Case=Loc|Number=Singroce
Animacy=Inan|Case=Nom|Number=Singrok
Case=Gen|Number=Plurlet
Case=Loc|Number=Plurletech

Gender seems to be lexical feature of NOUN. 100% lemmas (859) occur only with one value of Gender.

ADJ

6766 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Polarity=Pos (6585; 97%), Degree=Pos (6010; 89%), Number=Sing (4201; 62%), Animacy=EMPTY (4043; 60%).

ADJ tokens may have the following values of Gender:

Paradigm uvedenýFem,MascFem,NeutMascFemNeut
Animacy=Anim|Case=Acc|Degree=Pos|Number=Plur|Polarity=Negneuvedené
Animacy=Inan|Case=Acc|Degree=Pos|Number=Sing|Polarity=Posuvedený
Animacy=Inan|Case=Acc|Degree=Pos|Number=Plur|Polarity=Posuvedené
Animacy=Inan|Case=Dat|Degree=Pos|Number=Sing|Polarity=Posuvedenému
Animacy=Inan|Case=Gen|Degree=Pos|Number=Sing|Polarity=Posuvedeného
Animacy=Inan|Case=Gen|Degree=Pos|Number=Plur|Polarity=Posuvedených
Animacy=Inan|Case=Ins|Degree=Pos|Number=Sing|Polarity=Posuvedeným
Animacy=Inan|Case=Loc|Degree=Pos|Number=Sing|Polarity=Posuvedeném
Animacy=Inan|Case=Loc|Degree=Pos|Number=Plur|Polarity=Posuvedených
Animacy=Inan|Case=Nom|Degree=Pos|Number=Sing|Polarity=Posuvedený
Animacy=Inan|Case=Nom|Degree=Pos|Number=Plur|Polarity=Posuvedené
Animacy=Inan|Number=Plur|Polarity=Pos|Variant=Short|VerbForm=Part|Voice=Passuvedeny
Case=Acc|Degree=Pos|Number=Sing|Polarity=Posuvedené
Case=Acc|Degree=Pos|Number=Plur|Polarity=Posuvedenéuvedená
Case=Dat|Degree=Pos|Number=Sing|Polarity=Posuvedené
Case=Dat|Degree=Pos|Number=Plur|Polarity=Posuvedeným
Case=Gen|Degree=Pos|Number=Sing|Polarity=Negneuvedené
Case=Gen|Degree=Pos|Number=Sing|Polarity=Posuvedené
Case=Gen|Degree=Pos|Number=Plur|Polarity=Posuvedenýchuvedených
Case=Ins|Degree=Pos|Number=Sing|Polarity=Posuvedenou
Case=Ins|Degree=Pos|Number=Plur|Polarity=Posuvedenými
Case=Loc|Degree=Pos|Number=Sing|Polarity=Posuvedené
Case=Loc|Degree=Pos|Number=Plur|Polarity=Posuvedených
Case=Nom|Degree=Pos|Number=Sing|Polarity=Negneuvedená
Case=Nom|Degree=Pos|Number=Sing|Polarity=Posuvedená
Case=Nom|Degree=Pos|Number=Plur|Polarity=Posuvedenéuvedená
Number=Sing|Polarity=Pos|Variant=Short|VerbForm=Part|Voice=Passuvedenuvedeno
Number=Plur,Sing|Polarity=Pos|Variant=Short|VerbForm=Part|Voice=Passuvedena

DET

892 DET tokens (75% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number[psor]=EMPTY (860; 96%), Person=EMPTY (860; 96%), Poss=EMPTY (839; 94%), Number=Sing (639; 72%).

DET tokens may have the following values of Gender:

Paradigm kterýMascMasc,NeutFemNeut
Animacy=Inan|Case=Acc|Number=Singkterý
Animacy=Inan|Case=Nom|Number=Plurkteré
Case=Acc|Number=Singkteroukteré
Case=Acc|Number=Plurkterékteré
Case=Dat|Number=Singkterémukteré
Case=Gen|Number=Singkteréhokteré
Case=Ins|Number=Singkterýmkterou
Case=Loc|Number=Singkterémkteré
Case=Nom|Number=Singkterýkterákteré
Case=Nom|Number=Plurkterékterá

VERB

116 VERB tokens (6% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (116; 100%), Person=EMPTY (116; 100%), Voice=Act (116; 100%), Tense=Past (115; 99%), VerbForm=Part (115; 99%), Polarity=Pos (106; 91%).

VERB tokens may have the following values of Gender:

Paradigm mociFem,NeutMascNeut
Number=Singmohlmohlo
Number=Plur,Singmohla

PRON

90 PRON tokens (14% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (90; 100%), Variant=EMPTY (89; 99%), Number=Sing (85; 94%), Person=EMPTY (54; 60%), PronType=Rel (46; 51%).

PRON tokens may have the following values of Gender:

Paradigm veškerýMascMasc,NeutFemNeut
Animacy=Inan|Case=Nom|Number=Plurveškeré
Case=Acc|Number=Singveškeré
Case=Acc|Number=Plurveškeréveškeré
Case=Gen|Number=Singveškerého
Case=Nom|Number=Plurveškeré

AUX

59 AUX tokens (10% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (59; 100%), Person=EMPTY (59; 100%), Tense=Past (59; 100%), VerbForm=Part (59; 100%), Voice=Act (59; 100%), Polarity=Pos (47; 80%).

AUX tokens may have the following values of Gender:

Paradigm býtFem,MascFem,NeutMascNeut
Animacy=Inan|Number=Plur|Polarity=Negnebyly
Animacy=Inan|Number=Plur|Polarity=Posbyly
Number=Sing|Polarity=Negnebyl
Number=Sing|Polarity=Posbylbylo
Number=Plur,Sing|Polarity=Negnebyla
Number=Plur,Sing|Polarity=Posbyla

NUM

46 NUM tokens (10% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (46; 100%), NumType=Card (46; 100%), NumValue=1,2,3 (46; 100%), Number=Sing (38; 83%).

NUM tokens may have the following values of Gender:

Paradigm jedenMascMasc,NeutFemNeut
Animacy=Inan|Case=Accjeden
Case=Accjednujedno
Case=Datjednomu
Case=Genjednohojedné
Case=Insjednímjednou
Case=Locjednom
Case=Nomjeden

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (5933; 97%), NOUN –[conj]–> NOUN (979; 54%), ADJ –[conj]–> ADJ (190; 83%), NOUN –[appos]–> NOUN (48; 72%), NOUN –[xcomp]–> ADJ (15; 94%), VERB –[conj]–> VERB (11; 52%), DET –[nmod]–> NOUN (9; 82%), ADJ –[dep]–> NOUN (6; 75%), ADJ –[obj]–> PRON (6; 55%), ADJ –[amod]–> ADJ (5; 83%).