home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-PDT: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut. Some words have combined values of the feature; 3 combinations have been observed: Fem|Masc, Fem|Neut, Masc|Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

160378 tokens (48%) have a non-empty value of Gender. 47347 types (89%) occur at least once with a non-empty value of Gender. 21607 lemmas (80%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (79711; 24% instances), ADJ (40442; 12% instances), PROPN (14282; 4% instances), VERB (10879; 3% instances), DET (10231; 3% instances), AUX (1745; 1% instances), NUM (1691; 1% instances), PRON (1397; 0% instances).

NOUN

79711 NOUN tokens (96% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (56889; 71%), Animacy=EMPTY (44880; 56%).

NOUN tokens may have the following values of Gender:

Paradigm imageMascFemNeut
_image
Animacy=Inanimage
Animacy=Inan|Case=Acc|Number=Singimage
Animacy=Inan|Case=Gen|Number=Singimage
Animacy=Inan|Case=Nom|Number=Singimage
Animacy=Inan|Case=Nom|Number=Plurimage
Case=Gen|Number=Singimage

Gender seems to be lexical feature of NOUN. 100% lemmas (8542) occur only with one value of Gender.

ADJ

40442 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Polarity=Pos (37751; 93%), Degree=Pos (36176; 89%), VerbForm=EMPTY (34523; 85%), Voice=EMPTY (34523; 85%), Number=Sing (27654; 68%), Animacy=EMPTY (24045; 59%).

ADJ tokens may have the following values of Gender:

Paradigm známýFem,MascFem,NeutMascFemNeut
Animacy=Anim|Case=Acc|Degree=Pos|Number=Plur|Polarity=Negneznámé
Animacy=Anim|Case=Acc|Degree=Sup|Number=Plur|Polarity=Posnejznámější
Animacy=Anim|Case=Dat|Degree=Pos|Number=Plur|Polarity=Posznámým
Animacy=Anim|Case=Gen|Degree=Pos|Number=Sing|Polarity=Negneznámého
Animacy=Anim|Case=Gen|Degree=Pos|Number=Sing|Polarity=Posznámého
Animacy=Anim|Case=Gen|Degree=Sup|Number=Plur|Polarity=Posnejznámějších
Animacy=Anim|Case=Ins|Degree=Pos|Number=Sing|Polarity=Posznámým
Animacy=Anim|Case=Ins|Degree=Sup|Number=Sing|Polarity=Posnejznámějším
Animacy=Anim|Case=Nom|Degree=Pos|Number=Sing|Polarity=Negneznámý
Animacy=Anim|Case=Nom|Degree=Pos|Number=Sing|Polarity=Posznámý
Animacy=Anim|Case=Nom|Degree=Pos|Number=Plur|Polarity=Negneznámí
Animacy=Anim|Case=Nom|Degree=Pos|Number=Plur|Polarity=Posznámí
Animacy=Anim|Case=Nom|Degree=Sup|Number=Sing|Polarity=Posnejznámější
Animacy=Inan|Case=Acc|Degree=Pos|Number=Sing|Polarity=Posznámý
Animacy=Inan|Case=Acc|Degree=Pos|Number=Plur|Polarity=Posznámé
Animacy=Inan|Case=Acc|Degree=Sup|Number=Plur|Polarity=Posnejznámější
Animacy=Inan|Case=Dat|Degree=Pos|Number=Plur|Polarity=Posznámým
Animacy=Inan|Case=Gen|Degree=Pos|Number=Sing|Polarity=Negneznámého
Animacy=Inan|Case=Gen|Degree=Pos|Number=Sing|Polarity=Posznámého
Animacy=Inan|Case=Gen|Degree=Pos|Number=Plur|Polarity=Negneznámých
Animacy=Inan|Case=Gen|Degree=Pos|Number=Plur|Polarity=Posznámých
Animacy=Inan|Case=Ins|Degree=Pos|Number=Sing|Polarity=Posznámým
Animacy=Inan|Case=Ins|Degree=Pos|Number=Plur|Polarity=Negneznámými
Animacy=Inan|Case=Ins|Degree=Sup|Number=Sing|Polarity=PosNejznámějším
Animacy=Inan|Case=Nom|Degree=Pos|Number=Sing|Polarity=Negneznámý
Animacy=Inan|Case=Nom|Degree=Pos|Number=Sing|Polarity=Posznámý
Animacy=Inan|Case=Nom|Degree=Pos|Number=Plur|Polarity=Posznámé
Animacy=Inan|Degree=Pos|Number=Plur|Polarity=Pos|Variant=Shortznámy
Case=Acc|Degree=Pos|Number=Sing|Polarity=Negneznámé
Case=Acc|Degree=Pos|Number=Sing|Polarity=Posznámouznámé
Case=Acc|Degree=Pos|Number=Plur|Polarity=Negneznámá
Case=Gen|Degree=Pos|Number=Sing|Polarity=Posznáméznámého
Case=Gen|Degree=Pos|Number=Plur|Polarity=Negneznámých
Case=Gen|Degree=Pos|Number=Plur|Polarity=Posznámýchznámých
Case=Ins|Degree=Pos|Number=Sing|Polarity=Negneznámým
Case=Ins|Degree=Pos|Number=Sing|Polarity=Posznámouznámým
Case=Loc|Degree=Pos|Number=Sing|Polarity=Posznámé
Case=Nom|Degree=Pos|Number=Sing|Polarity=Negneznámá
Case=Nom|Degree=Pos|Number=Sing|Polarity=Posznámáznámé
Case=Nom|Degree=Pos|Number=Plur|Polarity=Posznámé
Case=Voc|Degree=Pos|Number=Sing|Polarity=Negneznámá
Degree=Pos|Number=Sing|Polarity=Neg|Variant=Shortneznámo
Degree=Pos|Number=Sing|Polarity=Pos|Variant=Shortznámznámo
Degree=Pos|Number=Plur,Sing|Polarity=Pos|Variant=Shortznáma

PROPN

14282 PROPN tokens (91% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (13115; 92%), NameType=Giv (7723; 54%), Case=Nom (7700; 54%), Animacy=Anim (7316; 51%).

PROPN tokens may have the following values of Gender:

Paradigm RusMascFem
Animacy=Anim|Case=Acc|NameType=Nat|Number=SingRusa
Animacy=Anim|Case=Gen|NameType=Nat|Number=PlurRusů
Animacy=Anim|Case=Ins|NameType=Nat|Number=SingRusem
Animacy=Anim|Case=Loc|NameType=Nat|Number=SingRusu
Animacy=Anim|Case=Nom|NameType=Nat|Number=SingRus
Animacy=Anim|Case=Nom|NameType=Nat|Number=PlurRusové
Case=Gen|NameType=Geo|Number=SingRusi

Gender seems to be lexical feature of PROPN. 100% lemmas (4506) occur only with one value of Gender.

VERB

10879 VERB tokens (37% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (10879; 100%), Person=EMPTY (10879; 100%), Voice=Act (10879; 100%), Tense=Past (10862; 100%), VerbForm=Part (10862; 100%), Polarity=Pos (9997; 92%), Aspect=Perf (6766; 62%), Number=Sing (5916; 54%).

VERB tokens may have the following values of Gender:

Paradigm mítFem,MascFem,NeutMascNeut
Animacy=Anim|Number=Plur|Polarity=Negneměli
Animacy=Anim|Number=Plur|Polarity=Posměli
Animacy=Inan|Number=Plur|Polarity=Negneměly
Animacy=Inan|Number=Plur|Polarity=Posměly
Number=Sing|Polarity=Negnemělnemělo
Number=Sing|Polarity=Posmělmělo
Number=Plur,Sing|Polarity=Negneměla
Number=Plur,Sing|Polarity=Posměla

DET

10231 DET tokens (80% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number[psor]=EMPTY (9390; 92%), Person=EMPTY (9390; 92%), Animacy=EMPTY (8662; 85%), Poss=EMPTY (8453; 83%), Number=Sing (8213; 80%).

DET tokens may have the following values of Gender:

Paradigm nášFem,NeutMascMasc,NeutFemNeut
Abbr=Yes|Case=Ins|Number=Singn
Animacy=Anim|Case=Acc|Number=Singnašeho
Animacy=Anim|Case=Nom|Number=Plurnaši
Animacy=Inan|Case=Acc|Number=Singnáš
Animacy=Inan|Case=Nom|Number=Plurnaše
Case=Acc|Number=Singnašinaše
Case=Dat|Number=Singnašemunaší
Case=Gen|Number=Singnašehonaší
Case=Ins|Number=Singnašímnaší
Case=Ins|Number=Sing|Style=Collnašim
Case=Loc|Number=Singnašemnaší
Case=Nom|Number=Singnašenáš
Case=Nom|Number=Plurnaše

AUX

1745 AUX tokens (16% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Aspect=Imp (1745; 100%), Mood=EMPTY (1745; 100%), Person=EMPTY (1745; 100%), Voice=Act (1745; 100%), Tense=Past (1744; 100%), VerbForm=Part (1744; 100%), Polarity=Pos (1555; 89%), Number=Sing (1007; 58%).

AUX tokens may have the following values of Gender:

Paradigm býtFem,MascFem,NeutMascNeut
Animacy=Anim|Number=Plur|Polarity=Neg|Tense=Past|VerbForm=Partnebyli
Animacy=Anim|Number=Plur|Polarity=Pos|Tense=Past|VerbForm=Partbyli
Animacy=Inan|Number=Plur|Polarity=Neg|Tense=Past|VerbForm=Partnebyly
Animacy=Inan|Number=Plur|Polarity=Pos|Tense=Past|VerbForm=Partbyly
Number=Sing|Polarity=Neg|Tense=Past|VerbForm=Partnebylnebylo
Number=Sing|Polarity=Pos|Tense=Past|VerbForm=Partbylbylo
Number=Sing|Polarity=Pos|Tense=Pres|VerbForm=Convjsa
Number=Plur,Sing|Polarity=Neg|Tense=Past|VerbForm=Partnebyla
Number=Plur,Sing|Polarity=Pos|Tense=Past|VerbForm=Partbyla

NUM

1691 NUM tokens (18% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (1685; 100%), NumForm=Word (965; 57%).

NUM tokens may have the following values of Gender:

Paradigm jedenMascMasc,NeutFemNeut
Animacy=Anim|Case=Accjednoho
Animacy=Inan|Case=Accjeden
Case=Accjednujedno
Case=Datjednomujedné
Case=Genjednohojedné
Case=Insjednímjednou
Case=Locjednomjedné
Case=Nomjedenjednajedno

PRON

1397 PRON tokens (14% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (1397; 100%), Number=Sing (1346; 96%), Variant=EMPTY (1125; 81%), Person=3 (1042; 75%), PronType=Prs (1042; 75%).

PRON tokens may have the following values of Gender:

Paradigm onMascMasc,NeutFemNeut
Animacy=Anim|Case=Nom|Number=Pluroni
Case=Acc|Number=Sing|PrepCase=Nprjejjije
Case=Acc|Number=Sing|PrepCase=Preněj, něhoni
Case=Acc|Number=Sing|Variant=Shortho
Case=Dat|Number=Sing|PrepCase=Nprjemu
Case=Dat|Number=Sing|PrepCase=Preněmu
Case=Dat|Number=Sing|Variant=Shortmu
Case=Gen|Number=Sing|PrepCase=Nprjehojej
Case=Gen|Number=Sing|PrepCase=Preněj, něho
Case=Gen|Number=Sing|Variant=Shortho
Case=Ins|Number=Sing|PrepCase=Nprjím
Case=Ins|Number=Sing|PrepCase=Prením
Case=Loc|Number=Sing|PrepCase=Preněm
Case=Nom|Number=Singononaono
Case=Nom|Number=Plurony

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (32498; 98%), NOUN –[flat]–> PROPN (2253; 100%), PROPN –[flat]–> PROPN (1724; 99%), ADJ –[conj]–> ADJ (1265; 88%), ADJ –[nsubj]–> NOUN (1150; 84%), VERB –[nsubj]–> PROPN (944; 52%), VERB –[conj]–> VERB (878; 55%), PROPN –[conj]–> PROPN (738; 64%), PROPN –[amod]–> ADJ (734; 92%), ADJ –[aux:pass]–> AUX (712; 53%).