home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-PDTC: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut. Some words have combined values of the feature; 3 combinations have been observed: Fem|Masc, Fem|Neut, Masc|Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

1583528 tokens (46%) have a non-empty value of Gender. 170910 types (92%) occur at least once with a non-empty value of Gender. 65307 lemmas (78%) occur at least once with a non-empty value of Gender. The feature is used with 9 part-of-speech tags: NOUN (760311; 22% instances), ADJ (358829; 10% instances), VERB (144213; 4% instances), DET (129582; 4% instances), PROPN (121319; 4% instances), AUX (33583; 1% instances), NUM (21697; 1% instances), PRON (13980; 0% instances), SYM (14; 0% instances).

NOUN

760311 NOUN tokens (97% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (529889; 70%), Animacy=EMPTY (422453; 56%).

NOUN tokens may have the following values of Gender:

Paradigm imageMascFemNeut
_imageimage
Animacy=Inanimage
Animacy=Inan|Case=Acc|Number=Singimage
Animacy=Inan|Case=Gen|Number=Singimage
Animacy=Inan|Case=Nom|Number=Singimage
Animacy=Inan|Case=Nom|Number=Plurimage
Case=Gen|Number=Singimage
Case=Loc|Number=Singimagi
Case=Nom|Number=Singimage
Case=Nom|Number=PlurImage

Gender seems to be lexical feature of NOUN. 100% lemmas (21527) occur only with one value of Gender.

ADJ

358829 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Polarity=Pos (335939; 94%), Degree=Pos (319362; 89%), VerbForm=EMPTY (303217; 85%), Voice=EMPTY (303217; 85%), Number=Sing (235891; 66%), Animacy=EMPTY (207172; 58%).

ADJ tokens may have the following values of Gender:

Paradigm spojenýFem,MascFem,NeutMascFemNeut
Animacy=Anim|Aspect=Perf|Number=Plur|Variant=Shortspojeni
Animacy=Anim|Case=Acc|Number=Plurspojené
Animacy=Anim|Case=Dat|Number=Plurspojeným
Animacy=Anim|Case=Gen|Number=Plurspojených
Animacy=Anim|Case=Ins|Number=Singspojeným
Animacy=Anim|Case=Nom|Number=Singspojený
Animacy=Anim|Case=Nom|Number=Plurspojení
Animacy=Inan|Aspect=Perf|Number=Plur|Variant=Shortspojeny
Animacy=Inan|Case=Acc|Number=Singspojený
Animacy=Inan|Case=Acc|Number=Plurspojené
Animacy=Inan|Case=Dat|Number=Singspojenému
Animacy=Inan|Case=Dat|Number=Plurspojeným
Animacy=Inan|Case=Gen|Number=Singspojeného
Animacy=Inan|Case=Gen|Number=Plurspojených
Animacy=Inan|Case=Ins|Number=Plurspojenými
Animacy=Inan|Case=Loc|Number=SingSpojeném
Animacy=Inan|Case=Loc|Number=Plurspojených
Animacy=Inan|Case=Nom|Number=Singspojený
Animacy=Inan|Case=Nom|Number=Plurspojené
Aspect=Perf|Number=Sing|Variant=Shortspojenspojeno
Aspect=Perf|Number=Plur,Sing|Variant=Shortspojena
Case=Acc|Number=Singspojenouspojené
Case=Acc|Number=Plurspojené
Case=Dat|Number=Singspojenéspojenému
Case=Dat|Number=Plurspojenýmspojeným
Case=Gen|Number=Singspojenéspojeného
Case=Gen|Number=Plurspojenýchspojených
Case=Ins|Number=Singspojenouspojeným
Case=Ins|Number=Plurspojenými
Case=Loc|Number=Singspojeném
Case=Loc|Number=Plurspojenýchspojených
Case=Nom|Number=Singspojenáspojené
Case=Nom|Number=Plur|Style=Collspojené
Case=Nom|Number=Plurspojenéspojená

VERB

144213 VERB tokens (45% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (144213; 100%), Voice=Act (144213; 100%), Person=EMPTY (144212; 100%), Tense=Past (144052; 100%), VerbForm=Part (144050; 100%), Polarity=Pos (133565; 93%), Animacy=EMPTY (102241; 71%), Aspect=Perf (82832; 57%).

VERB tokens may have the following values of Gender:

Paradigm chtítFem,MascFem,NeutMascFemNeut
Animacy=Anim|Number=Plur|Polarity=Neg|Tense=Past|VerbForm=Partnechtěli
Animacy=Anim|Number=Plur|Polarity=Pos|Tense=Past|VerbForm=Partchtěli
Animacy=Inan|Number=Plur|Polarity=Neg|Tense=Past|VerbForm=Partnechtěly
Animacy=Inan|Number=Plur|Polarity=Pos|Tense=Past|VerbForm=Partchtěly
ExtPos=ADV|Number=Sing|Polarity=Pos|Tense=Pres|VerbForm=Convchtěchtíc
Number=Sing|Polarity=Neg|Tense=Past|VerbForm=Partnechtělnechtělo
Number=Sing|Polarity=Neg|Tense=Pres|VerbForm=Convnechtěnechtíc
Number=Sing|Polarity=Pos|Tense=Past|VerbForm=Partchtělchtělo
Number=Plur,Sing|Polarity=Neg|Tense=Past|VerbForm=Partnechtěla
Number=Plur,Sing|Polarity=Pos|Tense=Past|VerbForm=Partchtěla

DET

129582 DET tokens (84% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number[psor]=EMPTY (119330; 92%), Person=EMPTY (119330; 92%), Animacy=EMPTY (111516; 86%), Poss=EMPTY (110856; 86%), Number=Sing (105808; 82%), PronType=Dem (67649; 52%), Case=Nom (65716; 51%).

DET tokens may have the following values of Gender:

Paradigm můjFem,NeutMascMasc,NeutFemNeut
Animacy=Anim|Case=Acc|Number=Singmého
Animacy=Anim|Case=Nom|Number=Plurmoji, mí
Animacy=Anim|Case=Voc|Number=Plur
Animacy=Inan|Case=Acc|Number=Singmůj
Animacy=Inan|Case=Nom|Number=Plurmoje, mé
Case=Acc|Number=Singmoji, moumoje, mé
Case=Acc|Number=Sing|Style=Collmojí
Case=Acc|Number=Plur
Case=Dat|Number=Singmémumojí, mé
Case=Gen|Number=Singméhomé, mojí
Case=Gen|Number=Sing|Style=Collmýho
Case=Ins|Number=Singmýmmojí, mou
Case=Ins|Number=Dualmýma
Case=Loc|Number=Singmémmé, mojí
Case=Loc|Number=Sing|Style=Collmým
Case=Nom|Number=Singmojemůj
Case=Nom|Number=Sing|Style=Collmuj
Case=Nom|Number=Plurmoje
Case=Nom|Number=Plur|Style=Coll
Case=Voc|Number=Singmojemůj

PROPN

121319 PROPN tokens (93% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (109966; 91%), Case=Nom (65848; 54%), NameType=Giv (62169; 51%).

PROPN tokens may have the following values of Gender:

Paradigm AntonioMascFemNeut
Animacy=Anim|Case=Gen|NameType=Giv|Number=SingAntonia
Animacy=Anim|Case=Ins|NameType=Giv|Number=SingAntoniem
Animacy=Anim|Case=Nom|NameType=Giv|Number=SingAntonio
Case=Acc|NameType=Geo|Number=SingAntonio
Case=Gen|NameType=Geo|Number=SingAntonia
Case=Loc|NameType=Geo|Number=SingAntoniu
NameType=GivAntonio

Gender seems to be lexical feature of PROPN. 99% lemmas (20426) occur only with one value of Gender.

AUX

33583 AUX tokens (23% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Aspect=Imp (33583; 100%), Mood=EMPTY (33583; 100%), Person=EMPTY (33583; 100%), Voice=Act (33583; 100%), Tense=Past (33577; 100%), VerbForm=Part (33577; 100%), Polarity=Pos (30603; 91%), Number=Sing (18156; 54%).

AUX tokens may have the following values of Gender:

Paradigm býtFem,MascFem,NeutMascFemNeut
Animacy=Anim|Number=Plur|Polarity=Neg|Tense=Past|VerbForm=Partnebyli
Animacy=Anim|Number=Plur|Polarity=Pos|Tense=Past|VerbForm=Partbyli
Animacy=Inan|Number=Plur|Polarity=Neg|Tense=Past|VerbForm=Partnebyly
Animacy=Inan|Number=Plur|Polarity=Pos|Tense=Past|VerbForm=Partbyly
Number=Sing|Polarity=Neg|Tense=Past|VerbForm=Partnebylnebylo
Number=Sing|Polarity=Pos|Tense=Past|VerbForm=Partbylbylo
Number=Sing|Polarity=Pos|Tense=Pres|VerbForm=Convjsajsouc
Number=Plur,Sing|Polarity=Neg|Tense=Past|VerbForm=Partnebyla
Number=Plur,Sing|Polarity=Pos|Tense=Past|VerbForm=Partbyla
Number=Plur|Polarity=Pos|Style=Coll|Tense=Past|VerbForm=Partbyly

NUM

21697 NUM tokens (21% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (21620; 100%), NumForm=EMPTY (11252; 52%).

NUM tokens may have the following values of Gender:

Paradigm jedenMascMasc,NeutFemNeut
Animacy=Anim|Case=Accjednoho
Animacy=Inan|Case=Accjeden
Case=Accjednu, jednajedno
Case=Datjednomujedné
Case=Genjednohojedné
Case=Insjednímjednou
Case=Locjednomjedné
Case=Nomjedenjednajedno

PRON

13980 PRON tokens (12% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (13980; 100%), Animacy=EMPTY (13454; 96%), Number=Sing (13236; 95%), Person=3 (11338; 81%), PronType=Prs (11338; 81%), Variant=EMPTY (10559; 76%).

PRON tokens may have the following values of Gender:

Paradigm onMascMasc,NeutFemNeut
Animacy=Anim|Case=Nom|Number=Pluroni
Animacy=Inan|Case=Nom|Number=Plurony
Case=Acc|Number=Sing|PrepCase=Nprjehojejjije
Case=Acc|Number=Sing|PrepCase=Npr|Style=Coll
Case=Acc|Number=Sing|PrepCase=Preněj, něhoni
Case=Acc|Number=Sing|PrepCase=Pre|Style=Coll
Case=Acc|Number=Sing|Variant=Shortho
Case=Dat|Number=Sing|PrepCase=Nprjemu
Case=Dat|Number=Sing|PrepCase=Preněmu
Case=Dat|Number=Sing|Variant=Shortmu
Case=Gen|Number=Sing|PrepCase=Nprjehojej
Case=Gen|Number=Sing|PrepCase=Preněj, něho
Case=Gen|Number=Sing|Variant=Shortho
Case=Ins|Number=Sing|PrepCase=Nprjím
Case=Ins|Number=Sing|PrepCase=Prením, nim
Case=Loc|Number=Sing|PrepCase=Preněm
Case=Nom|Number=Singononaono
Case=Nom|Number=Sing|Style=Collvonvona
Case=Nom|Number=Plurony

SYM

14 SYM tokens (0% of all SYM tokens) have a non-empty value of Gender.

SYM tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (285828; 99%), NOUN –[det]–> DET (41255; 58%), NOUN –[flat]–> PROPN (17268; 100%), PROPN –[flat]–> PROPN (13388; 99%), VERB –[conj]–> VERB (12658; 61%), ADJ –[conj]–> ADJ (9590; 94%), ADJ –[nsubj]–> NOUN (8771; 93%), PROPN –[amod]–> ADJ (5692; 92%), PROPN –[conj]–> PROPN (5000; 64%), ADJ –[nsubj]–> DET (4203; 96%).