home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Polish-PUD: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

10147 tokens (55%) have a non-empty value of Gender. 6790 types (90%) occur at least once with a non-empty value of Gender. 4455 lemmas (89%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (4481; 24% instances), ADJ (2343; 13% instances), PROPN (1348; 7% instances), VERB (765; 4% instances), DET (510; 3% instances), PRON (357; 2% instances), NUM (182; 1% instances), AUX (161; 1% instances).

NOUN

4481 NOUN tokens (96% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (3210; 72%), Animacy=EMPTY (2467; 55%).

NOUN tokens may have the following values of Gender:

Paradigm państwoMascNeut
Animacy=Hum|Case=Nom|Number=Ptanpaństwo
Case=Acc|Number=Singpaństwo
Case=Dat|Number=Singpaństwu
Case=Gen|Number=Singpaństwa
Case=Nom|Number=Singpaństwo
Case=Nom|Number=Plurpaństwa

Gender seems to be lexical feature of NOUN. 100% lemmas (1920) occur only with one value of Gender.

ADJ

2343 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Aspect=EMPTY (1932; 82%), Polarity=EMPTY (1932; 82%), VerbForm=EMPTY (1932; 82%), Voice=EMPTY (1932; 82%), Degree=Pos (1852; 79%), Number=Sing (1641; 70%), Animacy=EMPTY (1187; 51%).

ADJ tokens may have the following values of Gender:

Paradigm dużyMascFemNeut
Animacy=Inan|Case=Acc|Degree=Pos|Number=Singduży
Animacy=Inan|Case=Acc|Degree=Sup|Number=Singnajwiększy
Animacy=Inan|Case=Gen|Degree=Pos|Number=Plurdużych
Animacy=Inan|Case=Gen|Degree=Cmp|Number=Singwiększego
Animacy=Inan|Case=Ins|Degree=Sup|Number=Singnajwiększym
Animacy=Inan|Case=Ins|Degree=Sup|Number=Plurnajwiększymi
Animacy=Inan|Case=Loc|Degree=Cmp|Number=Singwiększym
Animacy=Inan|Case=Loc|Degree=Sup|Number=Singnajwiększym
Animacy=Inan|Case=Nom|Degree=Pos|Number=Singduży
Animacy=Inan|Case=Nom|Degree=Cmp|Number=Singwiększy
Animacy=Inan|Case=Nom|Degree=Sup|Number=SingNajwiększy
Case=Acc|Degree=Pos|Number=Singdużąduże
Case=Acc|Degree=Pos|Number=Plurduże
Case=Acc|Degree=Cmp|Number=Singwiększą
Case=Acc|Degree=Cmp|Number=Plurwiększe
Case=Dat|Degree=Pos|Number=Singdużej
Case=Gen|Degree=Pos|Number=Singdużej
Case=Gen|Degree=Pos|Number=Plurdużych
Case=Gen|Degree=Cmp|Number=Singwiększej
Case=Gen|Degree=Cmp|Number=Plurwiększych
Case=Gen|Degree=Sup|Number=Singnajwiększej
Case=Ins|Degree=Pos|Number=Singdużą
Case=Ins|Degree=Cmp|Number=Singwiększą
Case=Ins|Degree=Sup|Number=Singnajwiększym
Case=Loc|Degree=Pos|Number=Singdużej
Case=Loc|Degree=Cmp|Number=Plurwiększych
Case=Nom|Degree=Pos|Number=Singduża
Case=Nom|Degree=Cmp|Number=Singwiększa

PROPN

1348 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1229; 91%).

PROPN tokens may have the following values of Gender:

Paradigm TrumpMascFem
Animacy=Hum|Case=AccTrumpa
Animacy=Hum|Case=DatTrumpowi
Animacy=Hum|Case=GenTrumpa
Animacy=Hum|Case=InsTrumpem
Animacy=Hum|Case=NomTrump
Case=NomTrump

Gender seems to be lexical feature of PROPN. 98% lemmas (948) occur only with one value of Gender.

VERB

765 VERB tokens (47% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=Ind (765; 100%), Person=EMPTY (765; 100%), VerbForm=Fin (765; 100%), Voice=Act (765; 100%), Tense=Past (761; 99%), Number=Sing (598; 78%), Aspect=Perf (494; 65%).

VERB tokens may have the following values of Gender:

Paradigm mócMascFemNeut
Animacy=Hum|Number=Singmógł
Animacy=Inan|Number=Plurmogły
Number=Singmogłamogło
Number=Plurmogły

DET

510 DET tokens (100% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Reflex=EMPTY (445; 87%), Poss=EMPTY (430; 84%), Number=Sing (282; 55%).

DET tokens may have the following values of Gender:

Paradigm któryMascFemNeut
Animacy=Hum|Case=Acc|Number=Sing|PronType=Relktórego
Animacy=Hum|Case=Acc|Number=Plur|PronType=Relktórych
Animacy=Hum|Case=Dat|Number=Plur|PronType=Relktórym
Animacy=Hum|Case=Gen|Number=Sing|PronType=Relktórego
Animacy=Hum|Case=Gen|Number=Plur|PronType=Relktórych
Animacy=Hum|Case=Nom|Number=Sing|PronType=Relktóry
Animacy=Hum|Case=Nom|Number=Plur|PronType=Relktórzy
Animacy=Inan|Case=Acc|Number=Sing|PronType=Intktóry
Animacy=Inan|Case=Acc|Number=Plur|PronType=Relktóre
Animacy=Inan|Case=Dat|Number=Sing|PronType=Relktóremu
Animacy=Inan|Case=Gen|Number=Sing|PronType=Relktórego
Animacy=Inan|Case=Gen|Number=Plur|PronType=Relktórych
Animacy=Inan|Case=Loc|Number=Sing|PronType=Relktórym
Animacy=Inan|Case=Nom|Number=Sing|PronType=Relktóry
Animacy=Inan|Case=Nom|Number=Plur|PronType=Relktóre
Animacy=Nhum|Case=Nom|Number=Sing|PronType=Relktóry
Case=Acc|Number=Sing|PronType=Relktóre
Case=Acc|Number=Plur|PronType=Intktóre
Case=Acc|Number=Plur|PronType=Relktóre
Case=Gen|Number=Sing|PronType=Relktórejktórego
Case=Gen|Number=Plur|PronType=Relktórychktórych
Case=Ins|Number=Sing|PronType=Relktórą
Case=Loc|Number=Sing|PronType=Relktórejktórym
Case=Loc|Number=Plur|PronType=Relktórych
Case=Nom|Number=Sing|PronType=Relktóraktóre
Case=Nom|Number=Plur|PronType=Relktórektóre

PRON

357 PRON tokens (56% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (357; 100%), Number=Sing (298; 83%), PronType=Prs (226; 63%), Person=3 (208; 58%), Animacy=EMPTY (196; 55%), Variant=Long (190; 53%), PrepCase=Npr (184; 52%).

PRON tokens may have the following values of Gender:

Paradigm onMascFemNeut
Animacy=Hum|Case=Acc|Number=Sing|PrepCase=Npr|Variant=Shortgo
Animacy=Hum|Case=Acc|Number=Plur|PrepCase=Pre|Variant=Longnich
Animacy=Hum|Case=Dat|Number=Sing|PrepCase=Npr|Variant=Shortmu
Animacy=Hum|Case=Dat|Number=Plur|PrepCase=Npr|Variant=Longim
Animacy=Hum|Case=Gen|Number=Sing|PrepCase=Npr|Variant=Longjego
Animacy=Hum|Case=Gen|Number=Sing|PrepCase=Pre|Variant=Longniego
Animacy=Hum|Case=Gen|Number=Plur|PrepCase=Npr|Variant=Longich
Animacy=Hum|Case=Gen|Number=Plur|PrepCase=Pre|Variant=Longnich
Animacy=Hum|Case=Ins|Number=Sing|PrepCase=Npr|Variant=Longnim
Animacy=Hum|Case=Ins|Number=Sing|PrepCase=Pre|Variant=Longnim
Animacy=Hum|Case=Ins|Number=Plur|PrepCase=Pre|Variant=Longnimi
Animacy=Hum|Case=Loc|Number=Sing|PrepCase=Pre|Variant=Longnim
Animacy=Hum|Case=Loc|Number=Plur|PrepCase=Pre|Variant=Longnich
Animacy=Hum|Case=Nom|Number=Sing|PrepCase=Npr|Variant=Longon
Animacy=Hum|Case=Nom|Number=Plur|PrepCase=Npr|Variant=Longoni
Animacy=Inan|Case=Acc|Number=Sing|PrepCase=Npr|Variant=Shortgo
Animacy=Inan|Case=Acc|Number=Sing|PrepCase=Pre|Variant=Longniego
Animacy=Inan|Case=Acc|Number=Plur|PrepCase=Npr|Variant=Longich, je
Animacy=Inan|Case=Gen|Number=Sing|PrepCase=Npr|Variant=Longjego
Animacy=Inan|Case=Gen|Number=Sing|PrepCase=Npr|Variant=Shortgo
Animacy=Inan|Case=Gen|Number=Sing|PrepCase=Pre|Variant=Longniego
Animacy=Inan|Case=Gen|Number=Sing|PrepCase=Pre|Variant=Shortń
Animacy=Inan|Case=Gen|Number=Plur|PrepCase=Npr|Variant=Longich
Animacy=Inan|Case=Nom|Number=Sing|PrepCase=Npr|Variant=Longon
Animacy=Inan|Case=Nom|Number=Plur|PrepCase=Npr|Variant=Longone
Case=Acc|Number=Sing|PrepCase=Npr|Variant=Longje
Case=Acc|Number=Sing|PrepCase=Pre|Variant=Longnią
Case=Acc|Number=Plur|PrepCase=Npr|Variant=Longjeje
Case=Acc|Number=Plur|PrepCase=Pre|Variant=Longnie
Case=Dat|Number=Sing|PrepCase=Npr|Variant=Longjej
Case=Dat|Number=Plur|PrepCase=Npr|Variant=Longim
Case=Gen|Number=Sing|PrepCase=Npr|Variant=Longjejjego
Case=Gen|Number=Sing|PrepCase=Pre|Variant=Longniejniego
Case=Gen|Number=Plur|PrepCase=Npr|Variant=Longichich
Case=Gen|Number=Plur|PrepCase=Pre|Variant=Longnich
Case=Loc|Number=Sing|PrepCase=Pre|Variant=Longniej
Case=Nom|Number=Sing|PrepCase=Npr|Variant=Longona
Case=Nom|Number=Plur|PrepCase=Npr|Variant=Longone

NUM

182 NUM tokens (100% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Number=Plur (178; 98%), Animacy=Inan (112; 62%), NumForm=Digit (100; 55%), NumType=Card (100; 55%).

NUM tokens may have the following values of Gender:

Paradigm dwaMascFemNeut
Animacy=Hum|Case=Accdwóch
Animacy=Hum|Case=Gendwóch
Animacy=Hum|Case=Nomdwaj, Dwóch
Animacy=Inan|Case=Accdwa
Animacy=Inan|Case=Gendwóch
Animacy=Inan|Case=Insdwoma
Animacy=Inan|Case=Locdwóch
Animacy=Inan|Case=Nomdwa
Case=Accdwie
Case=Gendwóchdwóch
Case=Insdwiema
Case=Locdwóch
Case=Nomdwa

AUX

161 AUX tokens (35% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=Ind (161; 100%), Person=EMPTY (161; 100%), Tense=Past (161; 100%), VerbForm=Fin (161; 100%), Voice=Act (161; 100%), Number=Sing (122; 76%), Aspect=Imp (91; 57%).

AUX tokens may have the following values of Gender:

Paradigm byćMascFemNeut
Animacy=Hum|Number=Singbył
Animacy=Hum|Number=Plurbyli
Animacy=Inan|Number=Singbył
Animacy=Inan|Number=Plurbyły
Number=Singbyłabyło
Number=Plurbyłybyły

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (1333; 99%), VERB –[nsubj]–> NOUN (273; 53%), NOUN –[acl]–> ADJ (226; 98%), PROPN –[flat]–> PROPN (192; 93%), NOUN –[det]–> DET (188; 98%), VERB –[nsubj]–> PROPN (167; 76%), ADJ –[aux:pass]–> AUX (91; 65%), PROPN –[amod:flat]–> ADJ (89; 100%), NOUN –[det:poss]–> DET (80; 100%), ADJ –[nsubj:pass]–> NOUN (77; 97%).