home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Polish-MPDT: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

21464 tokens (45%) have a non-empty value of Gender. 11526 types (85%) occur at least once with a non-empty value of Gender. 6150 lemmas (81%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (9730; 21% instances), ADJ (4478; 9% instances), DET (2212; 5% instances), VERB (1805; 4% instances), PROPN (1434; 3% instances), PRON (1323; 3% instances), NUM (276; 1% instances), AUX (206; 0% instances).

NOUN

9730 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (6946; 71%).

NOUN tokens may have the following values of Gender:

Paradigm książęMascNeut
Animacy=Hum|Case=Acc|Number=PlurKsiążąt
Animacy=Hum|Case=Nom|Number=Plurksiążęta
Case=Acc|Number=Singksięcia
Case=Dat|Number=SingKsiążęciu
Case=Gen|Number=Singksiążęcia, księciaKsiążęcia
Case=Gen|Number=Plurksiążąt
Case=Ins|Number=Singksiążęciem
Case=Nom|Number=Singksiążęksiążę
Case=Voc|Number=SingKsiążę

Gender seems to be lexical feature of NOUN. 99% lemmas (3132) occur only with one value of Gender.

ADJ

4478 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (4241; 95%), Aspect=EMPTY (3646; 81%), Polarity=EMPTY (3637; 81%), VerbForm=EMPTY (3637; 81%), Voice=EMPTY (3637; 81%), Number=Sing (3046; 68%).

ADJ tokens may have the following values of Gender:

Paradigm wielkiMascFemNeut
Animacy=Hum|Case=Acc|Degree=Pos|Number=PlurWielkich
Animacy=Hum|Case=Nom|Degree=Pos|Number=Plurwielcy
Animacy=Hum|Case=Nom|Degree=Sup|Number=Plurnajwięksi
Animacy=Nhum|Case=Acc|Degree=Pos|Number=SingWielkiego
Case=Acc|Degree=Pos|Number=Singwielkiwielkąwielkie
Case=Acc|Degree=Pos|Number=Plurwielkiewielkie
Case=Dat|Degree=Pos|Number=Singwielkiemuwielkiemu
Case=Dat|Degree=Pos|Number=Plurwielkim
Case=Gen|Degree=Pos|Number=Singwielkiegowielkiejwielkiego
Case=Gen|Degree=Pos|Number=Plurwielkichwielkichwielkich
Case=Gen|Degree=Sup|Number=Singnajwiększej
Case=Ins|Degree=Pos|Number=Singwielkimwielkąwielkim
Case=Ins|Degree=Pos|Number=Plurwielkiemi, wielkimiwielkiemi
Case=Ins|Degree=Sup|Number=Singnajwiększymnajwiększąnajwiększym
Case=Ins|Degree=Sup|Number=Plurnajwiększemi
Case=Loc|Degree=Pos|Number=Singwielkiejwielkim
Case=Loc|Degree=Cmp|Number=Plurwiększych
Case=Loc|Degree=Sup|Number=Singnajwiększej
Case=Loc|Degree=Sup|Number=Plurnawiętszych
Case=Nom|Degree=Pos|Number=Singwielkiwielkawielkie
Case=Nom|Degree=Pos|Number=Plurwielkiewielkie
Case=Nom|Degree=Cmp|Number=Singwiększe
Case=Nom|Degree=Sup|Number=Plurnajwiększe
Case=Voc|Degree=Pos|Number=Plurwielkie

DET

2212 DET tokens (100% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Reflex=EMPTY (1986; 90%), Number[psor]=EMPTY (1923; 87%), Person=EMPTY (1923; 87%), Poss=EMPTY (1697; 77%), Number=Sing (1416; 64%).

DET tokens may have the following values of Gender:

Paradigm tenMascFemNeut
Animacy=Hum|Case=Acc|Number=Singtego
Animacy=Hum|Case=Acc|Number=Plurtych
Animacy=Hum|Case=Nom|Number=Plurci
Animacy=Nhum|Case=Acc|Number=Singtego
Case=Acc|ExtPos=DET|Number=Singten
Case=Acc|ExtPos=DET|Number=Plurte
Case=Acc|Number=Singtento
Case=Acc|Number=Sing|Variant=Short
Case=Acc|Number=Plurtetete
Case=Dat|Number=Singtemutej
Case=Dat|Number=Plurtymtym
Case=Gen|Number=Singtegotejtego
Case=Gen|Number=Plurtychtychtych
Case=Ins|ExtPos=ADV|Number=SingTym
Case=Ins|ExtPos=DET|Number=Singtym
Case=Ins|Number=Singtym, temtym
Case=Ins|Number=Plurtemi, tymitemitemi
Case=Loc|ExtPos=DET|Number=Singtym
Case=Loc|Number=Singtym, temtejtym, tem
Case=Loc|Number=Plurtychtychtych
Case=Nom|ExtPos=DET|Number=Singten
Case=Nom|Number=Singtentato
Case=Nom|Number=Plurtetete

VERB

1805 VERB tokens (34% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=Ind (1805; 100%), Person=EMPTY (1805; 100%), VerbForm=Fin (1805; 100%), Voice=Act (1805; 100%), Tense=Past (1763; 98%), Number=Sing (1341; 74%), Aspect=Perf (905; 50%).

VERB tokens may have the following values of Gender:

Paradigm miećMascFemNeut
Animacy=Hum|Number=Plurmieli
Number=Singmiałmiałamiało
Number=Plurmiałymiałymiały

PROPN

1434 PROPN tokens (98% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1266; 88%).

PROPN tokens may have the following values of Gender:

Paradigm SalemMascFem
Case=AccSalem
Case=NomSalem

Gender seems to be lexical feature of PROPN. 99% lemmas (874) occur only with one value of Gender.

PRON

1323 PRON tokens (51% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (1323; 100%), Number=Sing (1058; 80%), Person=3 (730; 55%), PronType=Prs (730; 55%).

PRON tokens may have the following values of Gender:

Paradigm onMascFemNeut
Animacy=Hum|Case=Acc|Number=Sing|PrepCase=Pre|Variant=Shortń
Animacy=Hum|Case=Acc|Number=Plur|PrepCase=Npr|Variant=Shortich
Animacy=Hum|Case=Acc|Number=Plur|PrepCase=Pre|Variant=Shortnich
Animacy=Hum|Case=Dat|Number=Plur|PrepCase=Npr|Variant=Longim
Animacy=Hum|Case=Dat|Number=Plur|PrepCase=Npr|Variant=Shortim
Animacy=Hum|Case=Dat|Number=Plur|PrepCase=Pre|Variant=Shortnim
Animacy=Hum|Case=Gen|Number=Plur|PrepCase=Npr|Variant=Shortich
Animacy=Hum|Case=Gen|Number=Plur|PrepCase=Pre|Variant=Shortnich
Animacy=Hum|Case=Ins|Number=Plur|PrepCase=Pre|Variant=Shortniemi, nimi
Animacy=Hum|Case=Nom|Number=Plur|PrepCase=Npr|Variant=Shortoni
Animacy=Hum|Case=Nom|Number=Plur|PrepCase=Pre|Variant=Shortoni
Case=Acc|Number=Sing|PrepCase=Npr|Variant=Shortgoje
Case=Acc|Number=Sing|PrepCase=Pre|Variant=Longniego
Case=Acc|Number=Sing|PrepCase=Pre|Variant=Shortńnią, nięnie
Case=Acc|Number=Plur|PrepCase=Npr|Variant=Shortjeje, nichje, ich
Case=Acc|Number=Plur|PrepCase=Pre|Variant=Shortnienienie
Case=Dat|Number=Sing|PrepCase=Npr|Variant=Longjemu
Case=Dat|Number=Sing|PrepCase=Npr|Variant=Shortmujejmu
Case=Dat|Number=Sing|PrepCase=Pre|Variant=Shortniemuniej
Case=Dat|Number=Plur|PrepCase=Npr|Variant=Shortim, Ichimim
Case=Gen|Number=Sing|PrepCase=Npr|Variant=Longjegojejjego
Case=Gen|Number=Sing|PrepCase=Npr|Variant=Shortgojejgo
Case=Gen|Number=Sing|PrepCase=Pre|Variant=Longniego
Case=Gen|Number=Sing|PrepCase=Pre|Variant=Shortniejniego, ń
Case=Gen|Number=Plur|PrepCase=Npr|Variant=Shortichichich
Case=Gen|Number=Plur|PrepCase=Pre|Variant=Shortnichnichnich
Case=Ins|Number=Sing|PrepCase=Npr|Variant=Shortnimnią
Case=Ins|Number=Sing|PrepCase=Pre|Variant=Shortnim, niemnią
Case=Ins|Number=Plur|PrepCase=Npr|Variant=Shortniemi, niminimi
Case=Ins|Number=Plur|PrepCase=Pre|Variant=Shortnimi
Case=Loc|Number=Sing|PrepCase=Npr|Variant=Shortnimniejnim
Case=Loc|Number=Sing|PrepCase=Pre|Variant=Shortnimniejnim
Case=Loc|Number=Plur|PrepCase=Npr|Variant=Shortnichnich
Case=Loc|Number=Plur|PrepCase=Pre|Variant=Shortnichnichnich
Case=Nom|Number=Sing|PrepCase=Npr|Variant=Longon
Case=Nom|Number=Sing|PrepCase=Npr|Variant=Shortonona
Case=Nom|Number=Sing|PrepCase=Pre|Variant=Shortona
Case=Nom|Number=Plur|PrepCase=Npr|Variant=Shortone

Gender seems to be lexical feature of PRON. 92% lemmas (12) occur only with one value of Gender.

NUM

276 NUM tokens (39% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (276; 100%), Number=Plur (244; 88%).

NUM tokens may have the following values of Gender:

Paradigm dwaMascFemNeut
Animacy=Hum|Case=Acc|Number=Plurdwóch, dwu, dwoje
Animacy=Hum|Case=Nom|Number=Plurdwaj, dwa, dwoje
Case=Acc|Number=Dualdwadwiedwie
Case=Acc|Number=Plurdwadwiedwoje, dwa
Case=Gen|Number=Dualdwóch
Case=Gen|Number=Plurdwóch, dwudwóch, dwu
Case=Ins|Number=Dualdwiema
Case=Ins|Number=Plurdwiema, dwojgiemdwiema, dwomadwiema
Case=Loc|Number=Plurdwu, dwóchdwudwóch
Case=Nom|Number=DualDwaDwiedwie
Case=Nom|Number=Plurdwa, Dwojedwiedwoje

AUX

206 AUX tokens (18% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=Ind (206; 100%), Person=EMPTY (206; 100%), Tense=Past (206; 100%), Variant=EMPTY (206; 100%), VerbForm=Fin (206; 100%), Voice=Act (206; 100%), Aspect=Imp (196; 95%), Number=Sing (150; 73%).

AUX tokens may have the following values of Gender:

Paradigm byćMascFemNeut
Animacy=Hum|Number=Plurbyli
Number=Singbył, bełbyłabyło, beło
Number=Plurbyłybyłybyły

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (2604; 99%), NOUN –[det]–> DET (937; 99%), NOUN –[conj]–> NOUN (507; 50%), NOUN –[det:poss]–> DET (485; 97%), NOUN –[acl]–> ADJ (414; 98%), ADJ –[conj]–> ADJ (338; 98%), VERB –[conj]–> VERB (280; 64%), NOUN –[nummod]–> NUM (198; 94%), NOUN –[appos]–> PROPN (150; 64%), VERB –[nsubj]–> PROPN (134; 65%).