home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Ukrainian-ParlaMint: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut. Some words have combined values of the feature; 1 combinations have been observed: Fem|Masc.

34724 tokens (41%) have a non-empty value of Gender. 8817 types (69%) occur at least once with a non-empty value of Gender. 4497 lemmas (67%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (21183; 25% instances), ADJ (5726; 7% instances), PROPN (3013; 4% instances), DET (2104; 2% instances), PRON (1556; 2% instances), VERB (784; 1% instances), NUM (226; 0% instances), AUX (132; 0% instances).

NOUN

21183 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Animacy=Inan (18721; 88%), Number=Sing (15496; 73%).

NOUN tokens may have the following values of Gender:

Paradigm колегаFem,MascMascFem
Case=Acc|Number=Singколегу
Case=Acc|Number=Plurколег
Case=Dat|Number=Singколезі
Case=Dat|Number=Plurколегам
Case=Gen|Number=Singколегиколеги
Case=Gen|Number=Plurколег
Case=Ins|Number=Singколегоюколегою
Case=Ins|Number=Plurколегами
Case=Nom|Number=Singколега
Case=Nom|Number=Plurколеги
Case=Voc|Number=Singколего
Case=Voc|Number=Plurколегиколеги

Gender seems to be lexical feature of NOUN. 99% lemmas (2342) occur only with one value of Gender.

ADJ

5726 ADJ tokens (65% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (5722; 100%), Degree=EMPTY (4632; 81%).

ADJ tokens may have the following values of Gender:

Paradigm шановнийMascFemNeut
Case=Gen|Degree=Pos|Number=Singшановної
Case=Nom|Degree=Pos|Number=SingшановнийШановна
Case=Nom|Number=SingШановний
Case=Voc|Degree=Pos|Number=Singшановнийшановне
Case=Voc|Degree=Pos|Number=PlurШановні
Case=Voc|Number=Singшановний

PROPN

3013 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (3011; 100%), Animacy=Anim (1567; 52%).

PROPN tokens may have the following values of Gender:

Paradigm РНБОFemNeut
Case=DatРНБО
Case=GenРНБО
Case=NomРНБО

Gender seems to be lexical feature of PROPN. 99% lemmas (519) occur only with one value of Gender.

DET

2104 DET tokens (60% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (2103; 100%), Animacy=EMPTY (1888; 90%), Person=EMPTY (1752; 83%), Poss=EMPTY (1659; 79%).

DET tokens may have the following values of Gender:

Paradigm якийMascFemNeut
Animacy=Anim|Case=Acc|PronType=Relякого
Animacy=Inan|Case=Acc|PronType=Relякий
Case=Acc|PronType=Intяке
Case=Acc|PronType=Relякийякуяке
Case=Dat|PronType=Relякомуякій
Case=Gen|PronType=IntЯкого
Case=Gen|PronType=Relякогоякої
Case=Ins|PronType=Intяким
Case=Ins|PronType=Relякимякою
Case=Loc|PronType=Indякому
Case=Loc|PronType=Relякомуякій
Case=Nom|PronType=IndЯкий
Case=Nom|PronType=IntЯка
Case=Nom|PronType=Relякийякаяке

PRON

1556 PRON tokens (37% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (1553; 100%), Person=EMPTY (1241; 80%), Animacy=Inan (1118; 72%), PronType=Dem (938; 60%).

PRON tokens may have the following values of Gender:

Paradigm якийMascFemNeut
Animacy=Inan|Case=Accякий
Case=Accяку
Case=Genякої
Case=Nomякийякаяке
Case=Nom|Typo=Yesяке

VERB

784 VERB tokens (9% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=Ind (784; 100%), Number=Sing (784; 100%), Person=EMPTY (784; 100%), Tense=Past (784; 100%), VerbForm=Fin (769; 98%), Reflex=EMPTY (596; 76%), Aspect=Perf (483; 62%).

VERB tokens may have the following values of Gender:

Paradigm бутиMascFemNeut
був
VerbForm=Finбувбулабуло

NUM

226 NUM tokens (26% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (225; 100%), Number=Sing (117; 52%).

NUM tokens may have the following values of Gender:

Paradigm одинMascFemNeut
Animacy=Inan|Case=Acc|Number=Singодин
Animacy=Inan|Case=Accодин
Case=Acc|Number=Singоднуодне
Case=Gen|Number=Singодногооднієї
Case=Ins|Number=Singоднимоднієюодним
Case=Loc|Number=Singодномуодній
Case=Nom|Number=Singодиноднаодне

AUX

132 AUX tokens (16% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Aspect=Imp (132; 100%), Mood=Ind (132; 100%), Number=Sing (132; 100%), Person=EMPTY (132; 100%), Tense=Past (132; 100%), VerbForm=Fin (131; 99%).

AUX tokens may have the following values of Gender:

Paradigm бутиMascFemNeut
було
VerbForm=Finбувбулабуло

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (4753; 66%), NOUN –[det]–> DET (1532; 65%), NOUN –[conj]–> NOUN (814; 50%), PROPN –[flat:name]–> PROPN (735; 100%), NOUN –[appos]–> PROPN (267; 68%), NOUN –[appos]–> NOUN (215; 54%), ADJ –[conj]–> ADJ (151; 96%), ADJ –[nsubj]–> NOUN (144; 67%), ADJ –[nsubj:pass]–> NOUN (87; 65%), PROPN –[amod]–> ADJ (79; 99%).