home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Polish-LFG: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

58313 tokens (45%) have a non-empty value of Gender. 26293 types (88%) occur at least once with a non-empty value of Gender. 13954 lemmas (89%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (25323; 19% instances), VERB (9421; 7% instances), ADJ (8525; 7% instances), PRON (5742; 4% instances), PROPN (4575; 3% instances), DET (3201; 2% instances), NUM (833; 1% instances), AUX (693; 1% instances).

NOUN

25323 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (18153; 72%), SubGender=EMPTY (13456; 53%).

NOUN tokens may have the following values of Gender:

Paradigm deMascFemNeut
Case=Gende
Case=Locde
Case=NomdeDe

Gender seems to be lexical feature of NOUN. 100% lemmas (6220) occur only with one value of Gender.

VERB

9421 VERB tokens (44% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=Ind (9421; 100%), Person=EMPTY (9421; 100%), VerbForm=Fin (9421; 100%), Voice=Act (9421; 100%), Tense=Past (9358; 99%), Number=Sing (7533; 80%), Aspect=Perf (5584; 59%), SubGender=Masc1 (5233; 56%).

VERB tokens may have the following values of Gender:

Paradigm miećMascFemNeut
Number=Singmiałmiałamiało
Number=Plurmieli, miałymiałymiały

ADJ

8525 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Aspect=EMPTY (7332; 86%), Polarity=EMPTY (7332; 86%), VerbForm=EMPTY (7332; 86%), Voice=EMPTY (7332; 86%), Degree=Pos (6989; 82%), Number=Sing (5897; 69%), SubGender=EMPTY (4631; 54%).

ADJ tokens may have the following values of Gender:

Paradigm samMascFemNeut
Case=Acc|Number=Singsamsamąsamo
Case=Acc|Number=Plursame, samychsamesame
Case=Dat|Number=Singsamej
Case=Gen|Number=Singsamegosamejsamego
Case=Gen|Number=Plursamychsamych
Case=Ins|Number=Singsamymsamąsamym
Case=Loc|Number=Singsamymsamejsamym
Case=Loc|Number=Plursamych
Case=Nom|Number=Singsamsamasamo
Case=Nom|Number=Plursami, samesamesame

PRON

5742 PRON tokens (62% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (5742; 100%), Number=Sing (4817; 84%), PrepCase=EMPTY (3601; 63%), PronType=Prs (3567; 62%), SubGender=EMPTY (2982; 52%), Variant=EMPTY (2872; 50%).

PRON tokens may have the following values of Gender:

Paradigm onMascFemNeut
Case=Acc|Number=Sing|PrepCase=Npr|Variant=Longjegoje
Case=Acc|Number=Sing|PrepCase=Npr|Variant=Shortgo
Case=Acc|Number=Sing|PrepCase=Pre|Variant=Longniegoniąnie
Case=Acc|Number=Sing|PrepCase=Pre|Variant=Shortń
Case=Acc|Number=Plur|PrepCase=Npr|Variant=Longich, jejeje
Case=Acc|Number=Plur|PrepCase=Pre|Variant=Longnichnienie
Case=Dat|Number=Sing|PrepCase=Npr|Variant=Longjemujej
Case=Dat|Number=Sing|PrepCase=Npr|Variant=Shortmumu
Case=Dat|Number=Sing|PrepCase=Pre|Variant=Longniemuniej
Case=Dat|Number=Plur|PrepCase=Npr|Variant=Longimimim
Case=Dat|Number=Plur|PrepCase=Pre|Variant=Longnimnim
Case=Gen|Number=Sing|PrepCase=Npr|Variant=Longjego, iegojejjego
Case=Gen|Number=Sing|PrepCase=Npr|Variant=Shortgogo
Case=Gen|Number=Sing|PrepCase=Pre|Variant=Longniegoniejniego
Case=Gen|Number=Plur|PrepCase=Npr|Variant=Longichichich
Case=Gen|Number=Plur|PrepCase=Pre|Variant=Longnichnichnich
Case=Ins|Number=Sing|PrepCase=Npr|Variant=Longnimniąnim
Case=Ins|Number=Sing|PrepCase=Pre|Variant=Longnimniąnim
Case=Ins|Number=Plur|PrepCase=Npr|Variant=Longnimi
Case=Ins|Number=Plur|PrepCase=Pre|Variant=Longniminiminimi
Case=Loc|Number=Sing|PrepCase=Pre|Variant=Longnimniejnim
Case=Loc|Number=Plur|PrepCase=Pre|Variant=Longnichnichnich
Case=Nom|Number=Sing|PrepCase=Npr|Variant=Longononaono, one
Case=Nom|Number=Plur|PrepCase=Npr|Variant=Longoni, oneoneone

PROPN

4575 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (4255; 93%), SubGender=Masc1 (2451; 54%), Case=Nom (2304; 50%).

PROPN tokens may have the following values of Gender:

Paradigm PiSMascFemNeut
Case=DatPiS-owi
Case=GenPiS-u
Case=NomPiSPiSPiS

Gender seems to be lexical feature of PROPN. 99% lemmas (2682) occur only with one value of Gender.

DET

3201 DET tokens (100% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: NumType=EMPTY (2806; 88%), Number[psor]=EMPTY (2772; 87%), Person=EMPTY (2772; 87%), Poss=EMPTY (2455; 77%), Number=Sing (1962; 61%), SubGender=EMPTY (1625; 51%).

DET tokens may have the following values of Gender:

Paradigm tenMascFemNeut
Case=Acc|Number=Singten, tego, tyntę, tą, ta, teto, te
Case=Acc|Number=Plurte, tychtete
Case=Dat|Number=Singtejtemu
Case=Dat|Number=Plurtymtym
Case=Gen|Number=Singtegotejtego
Case=Gen|Number=Plurtychtychtych
Case=Ins|Number=Singtymtym
Case=Ins|Number=Plurtymitymitymi
Case=Loc|Number=Singtymtejtym
Case=Loc|Number=Plurtychtychtych
Case=Nom|Number=Singtentato
Case=Nom|Number=Plurte, citete

NUM

833 NUM tokens (100% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Number=Plur (833; 100%), NumType=Card (828; 99%), Case=Acc (561; 67%).

NUM tokens may have the following values of Gender:

Paradigm dwaMascFemNeut
Case=Accdwa, dwóchdwiedwa
Case=Gendwóch, dwudwóchdwu
Case=Insdwomadwiema, dwoma
Case=Locdwóchdwóchdwóch
Case=Nomdwa, dwajdwiedwa

AUX

693 AUX tokens (16% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=Ind (693; 100%), Person=EMPTY (693; 100%), Tense=Past (693; 100%), Variant=EMPTY (693; 100%), VerbForm=Fin (693; 100%), Aspect=Imp (577; 83%), Number=Sing (565; 82%), Voice=Act (540; 78%).

AUX tokens may have the following values of Gender:

Paradigm byćMascFemNeut
Number=Singbyłbyłabyło
Number=Sing|Voice=Actbyłbyłabyło
Number=Plurbyły, bylibyłybyły
Number=Plur|Voice=Actbyli, byłybyłybyły

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (5634; 100%), NOUN –[det]–> DET (2591; 100%), VERB –[nsubj]–> PROPN (1029; 73%), NOUN –[acl]–> ADJ (725; 100%), NOUN –[nummod]–> NUM (716; 100%), VERB –[conj]–> VERB (670; 64%), NOUN –[flat]–> PROPN (583; 95%), PROPN –[flat]–> PROPN (325; 98%), ADJ –[conj]–> ADJ (265; 93%), ADJ –[nsubj]–> NOUN (224; 99%).