home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-FicTree: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

68808 tokens (41%) have a non-empty value of Gender. 23541 types (87%) occur at least once with a non-empty value of Gender. 11560 lemmas (84%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (27596; 17% instances), VERB (14516; 9% instances), ADJ (10889; 7% instances), DET (8094; 5% instances), PRON (3609; 2% instances), PROPN (2255; 1% instances), AUX (1010; 1% instances), NUM (839; 1% instances).

NOUN

27596 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Polarity=Pos (27558; 100%), Number=Sing (21353; 77%), Animacy=EMPTY (15698; 57%).

NOUN tokens may have the following values of Gender:

Paradigm dítěFemNeut
Case=Acc|Number=Singdítě
Case=Acc|Number=Plurděti
Case=Dat|Number=Singdítěti
Case=Dat|Number=Plurdětem
Case=Gen|Number=Singdítěte
Case=Gen|Number=Plurdětí
Case=Ins|Number=Singdítětem
Case=Ins|Number=Plurdětmi
Case=Loc|Number=Plurdětech
Case=Nom|Number=Singdítě
Case=Nom|Number=Plurděti
Case=Voc|Number=Plurděti

Gender seems to be lexical feature of NOUN. 100% lemmas (5249) occur only with one value of Gender.

VERB

14516 VERB tokens (58% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (14516; 100%), Person=EMPTY (14516; 100%), Voice=Act (14516; 100%), Tense=Past (14467; 100%), VerbForm=Part (14464; 100%), Polarity=Pos (13013; 90%), Number=Sing (12521; 86%), Animacy=Anim (7753; 53%).

VERB tokens may have the following values of Gender:

Paradigm mítMascFemNeut
Animacy=Anim|Number=Sing|Polarity=Negneměl
Animacy=Anim|Number=Sing|Polarity=Posměl
Animacy=Anim|Number=Plur|Polarity=Negneměli
Animacy=Anim|Number=Plur|Polarity=Posměli
Animacy=Inan|Number=Sing|Polarity=Posměl
Animacy=Inan|Number=Plur|Polarity=Posměly
Number=Sing|Polarity=Negnemělanemělo
Number=Sing|Polarity=Posmělamělo
Number=Plur|Polarity=Negneměly
Number=Plur|Polarity=Posmělyměla

ADJ

10889 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Polarity=Pos (9991; 92%), Degree=Pos (9039; 83%), Number=Sing (8342; 77%), Animacy=EMPTY (6273; 58%).

ADJ tokens may have the following values of Gender:

Paradigm celýMascFemNeut
Animacy=Anim|Case=Acc|Number=Sing|Polarity=Poscelého
Animacy=Anim|Case=Gen|Number=Sing|Polarity=Poscelého
Animacy=Anim|Case=Nom|Number=Sing|Polarity=Poscelý
Animacy=Anim|Case=Nom|Number=Plur|Polarity=Poscelí
Animacy=Inan|Case=Acc|Number=Sing|Polarity=NegNecelý
Animacy=Inan|Case=Acc|Number=Sing|Polarity=Poscelý
Animacy=Inan|Case=Acc|Number=Plur|Polarity=Poscelé
Animacy=Inan|Case=Dat|Number=Sing|Polarity=Poscelému
Animacy=Inan|Case=Gen|Number=Sing|Polarity=Poscelého
Animacy=Inan|Case=Ins|Number=Sing|Polarity=Poscelým
Animacy=Inan|Case=Loc|Number=Sing|Polarity=Poscelém
Animacy=Inan|Case=Nom|Number=Sing|Polarity=Poscelý
Animacy=Inan|Case=Nom|Number=Plur|Polarity=Negnecelé
Case=Acc|Number=Sing|Polarity=Posceloucelé
Case=Acc|Number=Plur|Polarity=Poscelécelá
Case=Dat|Number=Sing|Polarity=Poscelécelému
Case=Gen|Number=Sing|Polarity=Poscelécelého
Case=Gen|Number=Plur|Polarity=Poscelých
Case=Ins|Number=Sing|Polarity=Posceloucelým
Case=Loc|Number=Sing|Polarity=Poscelécelém
Case=Nom|Number=Sing|Polarity=Poscelácelé
Case=Nom|Number=Plur|Polarity=Poscelé

DET

8094 DET tokens (96% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number[psor]=EMPTY (6698; 83%), Person=EMPTY (6698; 83%), Number=Sing (6640; 82%), Poss=EMPTY (5950; 74%), Animacy=EMPTY (5406; 67%).

DET tokens may have the following values of Gender:

Paradigm tenMascFemNeut
Animacy=Anim|Case=Acc|Number=Singtoho
Animacy=Anim|Case=Acc|Number=Plurty
Animacy=Anim|Case=Dat|Number=Singtomu
Animacy=Anim|Case=Dat|Number=Plurtěm
Animacy=Anim|Case=Gen|Number=Singtoho
Animacy=Anim|Case=Gen|Number=Plurtěch
Animacy=Anim|Case=Ins|Number=Singtím
Animacy=Anim|Case=Ins|Number=Plurtěmi
Animacy=Anim|Case=Loc|Number=Singtom
Animacy=Anim|Case=Loc|Number=Plurtěch
Animacy=Anim|Case=Nom|Number=Singten
Animacy=Anim|Case=Nom|Number=Plurti
Animacy=Inan|Case=Acc|Number=Singten
Animacy=Inan|Case=Acc|Number=Plurty
Animacy=Inan|Case=Dat|Number=Singtomu
Animacy=Inan|Case=Dat|Number=Plurtěm
Animacy=Inan|Case=Gen|Number=Singtoho
Animacy=Inan|Case=Gen|Number=Plurtěch
Animacy=Inan|Case=Ins|Number=Singtím
Animacy=Inan|Case=Ins|Number=Plurtěmi
Animacy=Inan|Case=Loc|Number=Singtom
Animacy=Inan|Case=Loc|Number=Plurtěch
Animacy=Inan|Case=Nom|Number=Singten
Animacy=Inan|Case=Nom|Number=Plurty
Case=Acc|Number=Singtuto
Case=Acc|Number=Plurtyta
Case=Dat|Number=Singtomu
Case=Dat|Number=Plurtěm
Case=Gen|Number=Singtoho
Case=Gen|Number=Sing|Style=Coll
Case=Gen|Number=Plurtěchtěch
Case=Ins|Number=Singtoutím
Case=Ins|Number=Plurtěmitěmi
Case=Loc|Number=Singtom
Case=Loc|Number=Plurtěchtěch
Case=Nom|Number=Singtato
Case=Nom|Number=Plurtyta

PRON

3609 PRON tokens (26% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (3609; 100%), Person=3 (2937; 81%), PronType=Prs (2937; 81%), Variant=EMPTY (2619; 73%), Number=Sing (2523; 70%), Animacy=Anim (2036; 56%).

PRON tokens may have the following values of Gender:

Paradigm onMascFemNeut
Animacy=Anim|Case=Acc|Number=Sing|PrepCase=Preněho, něj
Animacy=Anim|Case=Acc|Number=Singjeho
Animacy=Anim|Case=Acc|Number=Sing|Style=Archjej
Animacy=Anim|Case=Acc|Number=Sing|Variant=Shortho
Animacy=Anim|Case=Acc|Number=Plur|PrepCase=Pre
Animacy=Anim|Case=Acc|Number=Plurje
Animacy=Anim|Case=Dat|Number=Sing|PrepCase=Preněmu
Animacy=Anim|Case=Dat|Number=Singjemu
Animacy=Anim|Case=Dat|Number=Sing|Variant=Shortmu
Animacy=Anim|Case=Dat|Number=Plur|PrepCase=Prenim
Animacy=Anim|Case=Dat|Number=Plurjim
Animacy=Anim|Case=Gen|Number=Sing|PrepCase=Preněj, něho
Animacy=Anim|Case=Gen|Number=Singjeho
Animacy=Anim|Case=Gen|Number=Sing|Variant=Shortho
Animacy=Anim|Case=Gen|Number=Plur|PrepCase=Prenich
Animacy=Anim|Case=Gen|Number=Plurjich
Animacy=Anim|Case=Ins|Number=Sing|PrepCase=Prením
Animacy=Anim|Case=Ins|Number=Singjím
Animacy=Anim|Case=Ins|Number=Plur|PrepCase=Prenimi
Animacy=Anim|Case=Ins|Number=Plurjimi
Animacy=Anim|Case=Loc|Number=Sing|PrepCase=Preněm
Animacy=Anim|Case=Loc|Number=Plur|PrepCase=Prenich
Animacy=Anim|Case=Nom|Number=Singon
Animacy=Anim|Case=Nom|Number=Pluroni
Animacy=Inan|Case=Acc|Number=Sing|PrepCase=Preněj
Animacy=Inan|Case=Acc|Number=Sing|Style=Archjej
Animacy=Inan|Case=Acc|Number=Sing|Variant=Shortho
Animacy=Inan|Case=Acc|Number=Plur|PrepCase=Pre
Animacy=Inan|Case=Acc|Number=Plurje
Animacy=Inan|Case=Dat|Number=Sing|PrepCase=Preněmu
Animacy=Inan|Case=Dat|Number=Sing|Variant=Shortmu
Animacy=Inan|Case=Dat|Number=Plurjim
Animacy=Inan|Case=Gen|Number=Sing|PrepCase=Preněj, něho
Animacy=Inan|Case=Gen|Number=Sing|Variant=Shortho
Animacy=Inan|Case=Gen|Number=Plur|PrepCase=Prenich
Animacy=Inan|Case=Gen|Number=Plurjich
Animacy=Inan|Case=Ins|Number=Sing|PrepCase=Prením
Animacy=Inan|Case=Ins|Number=Plur|PrepCase=Prenimi
Animacy=Inan|Case=Ins|Number=Plurjimi
Animacy=Inan|Case=Loc|Number=Sing|PrepCase=Preněm
Animacy=Inan|Case=Loc|Number=Plur|PrepCase=Prenich
Animacy=Inan|Case=Nom|Number=Plurony
Case=Acc|Number=Sing|PrepCase=Prenině, něj, něho
Case=Acc|Number=Singjije
Case=Acc|Number=Sing|Style=Coll
Case=Acc|Number=Sing|Variant=Shortho
Case=Acc|Number=Plur|PrepCase=Pre
Case=Acc|Number=Plurjeje
Case=Dat|Number=Sing|PrepCase=Pre
Case=Dat|Number=Sing
Case=Dat|Number=Sing|Variant=Shortmu
Case=Dat|Number=Plur|PrepCase=Prenimnim
Case=Dat|Number=Plurjim
Case=Gen|Number=Sing|PrepCase=Preněho, něj
Case=Gen|Number=Sing
Case=Gen|Number=Sing|Variant=Shortho
Case=Gen|Number=Plur|PrepCase=Prenich
Case=Gen|Number=Plurjichjich
Case=Ins|Number=Sing|PrepCase=Prením
Case=Ins|Number=Singjím
Case=Ins|Number=Plur|PrepCase=Preniminimi
Case=Ins|Number=Plurjimi
Case=Loc|Number=Sing|PrepCase=Preněm
Case=Loc|Number=Plur|PrepCase=Prenichnich
Case=Nom|Number=Singonaono
Case=Nom|Number=PluronyOna

PROPN

2255 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Polarity=Pos (2255; 100%), Number=Sing (2143; 95%), Animacy=Anim (1278; 57%), Case=Nom (1278; 57%).

PROPN tokens may have the following values of Gender:

Paradigm KMascFem
Animacy=AnimK
K

Gender seems to be lexical feature of PROPN. 98% lemmas (413) occur only with one value of Gender.

AUX

1010 AUX tokens (15% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (1010; 100%), Person=EMPTY (1010; 100%), Voice=Act (1010; 100%), Tense=Past (1009; 100%), VerbForm=Part (1009; 100%), Polarity=Pos (925; 92%), Number=Sing (865; 86%).

AUX tokens may have the following values of Gender:

Paradigm býtMascFemNeut
Animacy=Anim|Number=Sing|Polarity=Neg|Tense=Past|VerbForm=Partnebyl
Animacy=Anim|Number=Sing|Polarity=Pos|Tense=Past|VerbForm=Partbyl
Animacy=Anim|Number=Plur|Polarity=Neg|Tense=Past|VerbForm=Partnebyli
Animacy=Anim|Number=Plur|Polarity=Pos|Tense=Past|VerbForm=Partbyli
Animacy=Inan|Number=Sing|Polarity=Neg|Tense=Past|VerbForm=Partnebyl
Animacy=Inan|Number=Sing|Polarity=Pos|Tense=Past|VerbForm=Partbyl
Animacy=Inan|Number=Plur|Polarity=Neg|Tense=Past|VerbForm=Partnebyly
Animacy=Inan|Number=Plur|Polarity=Pos|Tense=Past|VerbForm=Partbyly
Aspect=Imp|Number=Sing|Polarity=Pos|Tense=Pres|VerbForm=Convjsouc
Number=Sing|Polarity=Neg|Tense=Past|VerbForm=Partnebylanebylo
Number=Sing|Polarity=Pos|Tense=Past|VerbForm=Partbylabylo
Number=Plur|Polarity=Neg|Tense=Past|VerbForm=Partnebyla
Number=Plur|Polarity=Pos|Tense=Past|VerbForm=Partbylybyla

NUM

839 NUM tokens (64% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (834; 99%), NumType=Card (834; 99%), NumValue=1,2,3 (834; 99%), Number=Plur (425; 51%).

NUM tokens may have the following values of Gender:

Paradigm jedenMascFemNeut
Animacy=Anim|Case=Accjednoho, jeden
Animacy=Anim|Case=Datjednomu
Animacy=Anim|Case=Genjednoho
Animacy=Anim|Case=Insjedním
Animacy=Anim|Case=Nomjeden
Animacy=Anim|Case=Vocjeden
Animacy=Inan|Case=Accjeden
Animacy=Inan|Case=Datjednomu
Animacy=Inan|Case=Genjednoho
Animacy=Inan|Case=Insjedním
Animacy=Inan|Case=Locjednom
Animacy=Inan|Case=Nomjeden
Case=Accjednujedno
Case=Datjednéjednomu
Case=Genjednéjednoho
Case=Insjednoujedním
Case=Locjednéjednom
Case=Nomjednajedno

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (7654; 100%), NOUN –[det]–> DET (4068; 99%), VERB –[nsubj]–> NOUN (3148; 69%), VERB –[conj]–> VERB (2881; 72%), VERB –[nsubj]–> DET (682; 60%), VERB –[nsubj]–> PROPN (675; 73%), NOUN –[nummod]–> NUM (534; 85%), ADJ –[conj]–> ADJ (518; 97%), ADJ –[nsubj]–> NOUN (348; 98%), PROPN –[nmod]–> NOUN (236; 95%).