home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Ukrainian-IU: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

50617 tokens (41%) have a non-empty value of Gender. 22990 types (73%) occur at least once with a non-empty value of Gender. 12963 lemmas (72%) occur at least once with a non-empty value of Gender. The feature is used with 9 part-of-speech tags: NOUN (28827; 24% instances), ADJ (8177; 7% instances), VERB (3592; 3% instances), PROPN (3390; 3% instances), DET (2981; 2% instances), PRON (2754; 2% instances), AUX (480; 0% instances), NUM (405; 0% instances), X (11; 0% instances).

NOUN

28827 NOUN tokens (98% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Animacy=Inan (24673; 86%), Number=Sing (20705; 72%).

NOUN tokens may have the following values of Gender:

Paradigm головаMascFem
Animacy=Anim|Case=Acc|Number=SingГолову
Animacy=Anim|Case=Dat|Number=Singголові
Animacy=Anim|Case=Gen|Number=Singголови
Animacy=Anim|Case=Gen|Number=Plurголів
Animacy=Anim|Case=Ins|Number=Singголовою
Animacy=Anim|Case=Nom|Number=SingголоваГолова
Animacy=Inan|Case=Acc|Number=Singголову
Animacy=Inan|Case=Acc|Number=Plurголови
Animacy=Inan|Case=Gen|Number=Singголови
Animacy=Inan|Case=Ins|Number=Singголовою
Animacy=Inan|Case=Loc|Number=Singголові
Animacy=Inan|Case=Nom|Number=Singголова
Animacy=Inan|Case=Nom|Number=Plurголови

Gender seems to be lexical feature of NOUN. 100% lemmas (6880) occur only with one value of Gender.

ADJ

8177 ADJ tokens (68% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (8177; 100%), Animacy=EMPTY (7626; 93%), Aspect=EMPTY (7425; 91%), VerbForm=EMPTY (7425; 91%), Voice=EMPTY (7425; 91%), Degree=EMPTY (5840; 71%).

ADJ tokens may have the following values of Gender:

Paradigm українськийMascFemNeut
Animacy=Inan|Case=Accукраїнський
Case=Accукраїнськуукраїнське
Case=DatукраїнськомуукраїнськійУкраїнському
Case=Genукраїнськогоукраїнськоїукраїнського
Case=Insукраїнськимукраїнською
Case=Locукраїнськомуукраїнськійукраїнському
Case=Nomукраїнськийукраїнськаукраїнське

VERB

3592 VERB tokens (28% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=Ind (3592; 100%), Number=Sing (3592; 100%), Person=EMPTY (3592; 100%), Tense=Past (3592; 100%), VerbForm=Fin (3592; 100%), Aspect=Perf (1952; 54%).

VERB tokens may have the following values of Gender:

Paradigm бутиMascFemNeut
бувбулабуло

PROPN

3390 PROPN tokens (97% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (3368; 99%), Uninflect=EMPTY (2889; 85%), Animacy=Anim (2004; 59%).

PROPN tokens may have the following values of Gender:

Paradigm І.MascFem
Case=Acc|NameType=GivІ
Case=Gen|NameType=GivІ
Case=Gen|NameType=PatІ
Case=Nom|NameType=GivІІ
Case=Nom|NameType=PatІ

Gender seems to be lexical feature of PROPN. 99% lemmas (1512) occur only with one value of Gender.

DET

2981 DET tokens (64% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (2981; 100%), Animacy=EMPTY (2658; 89%), Reflex=EMPTY (2590; 87%), Person=EMPTY (2421; 81%), Poss=EMPTY (2154; 72%).

DET tokens may have the following values of Gender:

Paradigm якийMascFemNeut
Animacy=Anim|Case=Acc|PronType=Relякого
Animacy=Inan|Case=Acc|PronType=Indякий
Animacy=Inan|Case=Acc|PronType=Intякий
Animacy=Inan|Case=Acc|PronType=Relякий
Case=Acc|PronType=Relякуяке
Case=Dat|PronType=Relякомуякій
Case=Gen|PronType=Indбудь-якої
Case=Gen|PronType=IntЯкогоякої
Case=Gen|PronType=Relякогоякоїякого
Case=Ins|PronType=Indяким
Case=Ins|PronType=Relякимякоюяким
Case=Loc|PronType=Indякій
Case=Loc|PronType=Relякомуякійякому
Case=Nom|PronType=Indякийяка
Case=Nom|PronType=Intяка
Case=Nom|PronType=Relякийякаяке

PRON

2754 PRON tokens (55% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (2754; 100%), Animacy=EMPTY (1382; 50%), Person=3 (1382; 50%), PronType=Prs (1382; 50%).

PRON tokens may have the following values of Gender:

Gender seems to be lexical feature of PRON. 100% lemmas (21) occur only with one value of Gender.

AUX

480 AUX tokens (45% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Aspect=Imp (480; 100%), Mood=Ind (480; 100%), Number=Sing (480; 100%), Person=EMPTY (480; 100%), Tense=Past (480; 100%), VerbForm=Fin (480; 100%).

AUX tokens may have the following values of Gender:

Paradigm бутиMascFemNeut
бувбулабуло

NUM

405 NUM tokens (25% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (405; 100%), Case=Nom (210; 52%), Uninflect=EMPTY (208; 51%).

NUM tokens may have the following values of Gender:

Paradigm дваMascFemNeut
Case=Accдва, двохдві, двохдва
Case=Genдвохдвохдвох
Case=Insдвомадвома
Case=Locдвохдвох
Case=Nomдвадвідва

X

11 X tokens (2% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Foreign=Yes (11; 100%).

X tokens may have the following values of Gender:

Gender seems to be lexical feature of X. 100% lemmas (11) occur only with one value of Gender.

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (6464; 69%), NOUN –[det]–> DET (2061; 69%), PROPN –[flat:name]–> PROPN (528; 100%), VERB –[conj]–> VERB (479; 64%), ADJ –[conj]–> ADJ (375; 96%), NOUN –[flat:title]–> PROPN (364; 76%), ADJ –[nsubj]–> NOUN (317; 65%), VERB –[nsubj]–> PROPN (292; 65%), NOUN –[appos]–> NOUN (282; 59%), PROPN –[conj]–> PROPN (149; 76%).