Treebank Statistics: UD_Polish-LFG: Features: SubGender
This feature is language-specific.
It occurs with 3 different values: Masc1, Masc2, Masc3.
30073 tokens (23%) have a non-empty value of SubGender.
13875 types (47%) occur at least once with a non-empty value of SubGender.
8286 lemmas (53%) occur at least once with a non-empty value of SubGender.
The feature is used with 8 part-of-speech tags: NOUN (11867; 9% instances), VERB (6036; 5% instances), ADJ (3894; 3% instances), PROPN (3028; 2% instances), PRON (2760; 2% instances), DET (1576; 1% instances), NUM (565; 0% instances), AUX (347; 0% instances).
NOUN
11867 NOUN tokens (47% of all NOUN tokens) have a non-empty value of SubGender.
The most frequent other feature values with which NOUN and SubGender co-occurred: Gender=Masc (11867; 100%), Number=Sing (8150; 69%).
NOUN tokens may have the following values of SubGender:
Masc1(3779; 32% of non-emptySubGender): pan, pana, panie, ludzi, ludzie, poseł, mężczyzna, panu, policjanci, człowiekMasc2(476; 4% of non-emptySubGender): złotych, zł, ptaki, koty, konie, kot, papierosy, psy, papierosa, psaMasc3(7612; 64% of non-emptySubGender): lat, domu, roku, raz, czas, dni, szpitala, dzień, pokoju, czasem
| Paradigm przewodnik | Masc1 | Masc2 | Masc3 |
|---|---|---|---|
| Case=Acc|Number=Sing | przewodnika | ||
| Case=Acc|Number=Plur | przewodniki | ||
| Case=Gen|Number=Plur | przewodników | ||
| Case=Ins|Number=Sing | przewodnikiem | ||
| Case=Ins|Number=Plur | przewodnikami |
SubGender seems to be lexical feature of NOUN. 99% lemmas (2769) occur only with one value of SubGender.
VERB
6036 VERB tokens (28% of all VERB tokens) have a non-empty value of SubGender.
The most frequent other feature values with which VERB and SubGender co-occurred: Gender=Masc (6036; 100%), Mood=Ind (6036; 100%), Person=EMPTY (6036; 100%), VerbForm=Fin (6036; 100%), Voice=Act (6036; 100%), Tense=Past (6009; 100%), Number=Sing (4543; 75%), Aspect=Perf (3612; 60%).
VERB tokens may have the following values of SubGender:
Masc1(5233; 87% of non-emptySubGender): miał, chciał, mógł, widział, musiał, mieli, powiedział, zaczął, spojrzał, wiedziałMasc2(147; 2% of non-emptySubGender): kantowały, miał, był, stał, chciał, grał, krążyły, miały, pojawiły, rozumiałMasc3(656; 11% of non-emptySubGender): mógł, zaczął, był, stał, były, miał, minął, nadszedł, panował, powstał
| Paradigm mieć | Masc1 | Masc2 | Masc3 |
|---|---|---|---|
| Number=Sing | miał | miał | miał |
| Number=Plur | mieli | miały | miały |
ADJ
3894 ADJ tokens (45% of all ADJ tokens) have a non-empty value of SubGender.
The most frequent other feature values with which ADJ and SubGender co-occurred: Gender=Masc (3894; 100%), Aspect=EMPTY (3296; 85%), Polarity=EMPTY (3296; 85%), VerbForm=EMPTY (3296; 85%), Voice=EMPTY (3296; 85%), Degree=Pos (3123; 80%), Number=Sing (2616; 67%).
ADJ tokens may have the following values of SubGender:
Masc1(1267; 33% of non-emptySubGender): sam, sami, inni, jeden, pierwszy, innych, stary, dobry, starszy, jednegoMasc2(139; 4% of non-emptySubGender): jeden, małe, dzikie, jednego, pokrojone, Białego, Biały, Biedny, Inne, LuksusowegoMasc3(2488; 64% of non-emptySubGender): cały, pierwszy, kolejny, jeden, drugi, inny, inne, wielki, nowy, ostatnie
| Paradigm sam | Masc1 | Masc2 | Masc3 |
|---|---|---|---|
| Case=Acc|Number=Sing | sam | ||
| Case=Acc|Number=Plur | samych | same | |
| Case=Gen|Number=Sing | samego | samego | |
| Case=Gen|Number=Plur | samych | ||
| Case=Ins|Number=Sing | samym | ||
| Case=Loc|Number=Sing | samym | ||
| Case=Nom|Number=Sing | sam | sam | |
| Case=Nom|Number=Plur | sami | same | same |
PROPN
3028 PROPN tokens (66% of all PROPN tokens) have a non-empty value of SubGender.
The most frequent other feature values with which PROPN and SubGender co-occurred: Gender=Masc (3028; 100%), Number=Sing (2840; 94%), Case=Nom (1729; 57%).
PROPN tokens may have the following values of SubGender:
Masc1(2451; 81% of non-emptySubGender): Polacy, Jerzy, Andrzej, Adam, Bóg, Michał, Krzysztof, Kwaśniewski, Niemcy, AleksanderMasc2(49; 2% of non-emptySubGender): Dior, Duduś, Puzon, stara, Bosmana, Bronek, Czerwonym, DigiPath, Dunaja, DusiołkuMasc3(528; 17% of non-emptySubGender): SLD, Izrael, Krakowa, Paryżu, Poznaniu, Wrocławia, Afganistanie, Gdańska, Izraelu, Kraków
| Paradigm Lech | Masc1 | Masc2 |
|---|---|---|
| Case=Acc | Lecha | |
| Case=Gen | Lecha | Lecha |
| Case=Ins | Lechem | |
| Case=Nom | Lech |
SubGender seems to be lexical feature of PROPN. 100% lemmas (1842) occur only with one value of SubGender.
PRON
2760 PRON tokens (30% of all PRON tokens) have a non-empty value of SubGender.
The most frequent other feature values with which PRON and SubGender co-occurred: Gender=Masc (2760; 100%), Reflex=EMPTY (2760; 100%), PronType=Prs (2376; 86%), Number=Sing (2021; 73%), PrepCase=EMPTY (1404; 51%).
PRON tokens may have the following values of SubGender:
Masc1(2505; 91% of non-emptySubGender): go, mnie, jego, mu, ja, mi, ich, nas, on, ktoMasc2(55; 2% of non-emptySubGender): go, ich, mu, jego, nim, je, on, ci, mnie, niegoMasc3(200; 7% of non-emptySubGender): go, je, nim, on, niego, one, ich, jego, nich, mu
| Paradigm on | Masc1 | Masc2 | Masc3 |
|---|---|---|---|
| Case=Acc|Number=Sing|PrepCase=Npr|Variant=Long | jego | ||
| Case=Acc|Number=Sing|PrepCase=Npr|Variant=Short | go | go | go |
| Case=Acc|Number=Sing|PrepCase=Pre|Variant=Long | niego | niego | niego |
| Case=Acc|Number=Sing|PrepCase=Pre|Variant=Short | ń | ń | |
| Case=Acc|Number=Plur|PrepCase=Npr|Variant=Long | ich | je, ich | je |
| Case=Acc|Number=Plur|PrepCase=Pre|Variant=Long | nich | ||
| Case=Dat|Number=Sing|PrepCase=Npr|Variant=Long | jemu | jemu | |
| Case=Dat|Number=Sing|PrepCase=Npr|Variant=Short | mu | mu | mu |
| Case=Dat|Number=Sing|PrepCase=Pre|Variant=Long | niemu | ||
| Case=Dat|Number=Plur|PrepCase=Npr|Variant=Long | im | ||
| Case=Dat|Number=Plur|PrepCase=Pre|Variant=Long | nim | ||
| Case=Gen|Number=Sing|PrepCase=Npr|Variant=Long | jego, iego | jego | jego |
| Case=Gen|Number=Sing|PrepCase=Npr|Variant=Short | go | go | go |
| Case=Gen|Number=Sing|PrepCase=Pre|Variant=Long | niego | niego | |
| Case=Gen|Number=Plur|PrepCase=Npr|Variant=Long | ich | ich | ich |
| Case=Gen|Number=Plur|PrepCase=Pre|Variant=Long | nich | nich | nich |
| Case=Ins|Number=Sing|PrepCase=Npr|Variant=Long | nim | nim | nim |
| Case=Ins|Number=Sing|PrepCase=Pre|Variant=Long | nim | nim | nim |
| Case=Ins|Number=Plur|PrepCase=Pre|Variant=Long | nimi | nimi | |
| Case=Loc|Number=Sing|PrepCase=Pre|Variant=Long | nim | nim | nim |
| Case=Loc|Number=Plur|PrepCase=Pre|Variant=Long | nich | nich | |
| Case=Nom|Number=Sing|PrepCase=Npr|Variant=Long | on | on | on |
| Case=Nom|Number=Plur|PrepCase=Npr|Variant=Long | oni | one |
DET
1576 DET tokens (49% of all DET tokens) have a non-empty value of SubGender.
The most frequent other feature values with which DET and SubGender co-occurred: Gender=Masc (1576; 100%), Number[psor]=EMPTY (1396; 89%), Person=EMPTY (1396; 89%), NumType=EMPTY (1353; 86%), Poss=EMPTY (1261; 80%), Number=Sing (906; 57%).
DET tokens may have the following values of SubGender:
Masc1(491; 31% of non-emptySubGender): ten, wielu, wszyscy, każdy, mój, ci, wszystkich, tego, który, niektórzyMasc2(55; 3% of non-emptySubGender): ten, te, który, nasze, takiego, tego, Twój, Wszystkie, ile, któryśMasc3(1030; 65% of non-emptySubGender): ten, tym, tego, kilka, te, swój, jakiś, taki, takie, tych
| Paradigm ten | Masc1 | Masc2 | Masc3 |
|---|---|---|---|
| Case=Acc|Number=Sing | tego | tego | ten, tyn |
| Case=Acc|Number=Plur | tych | te | te |
| Case=Dat|Number=Plur | tym | ||
| Case=Gen|Number=Sing | tego | tego | tego |
| Case=Gen|Number=Plur | tych | tych | |
| Case=Ins|Number=Sing | tym | tym | tym |
| Case=Ins|Number=Plur | tymi | ||
| Case=Loc|Number=Sing | tym | tym | |
| Case=Loc|Number=Plur | tych | ||
| Case=Nom|Number=Sing | ten | ten | ten |
| Case=Nom|Number=Plur | ci | te | te |
NUM
565 NUM tokens (68% of all NUM tokens) have a non-empty value of SubGender.
The most frequent other feature values with which NUM and SubGender co-occurred: Gender=Masc (565; 100%), Number=Plur (565; 100%), NumType=Card (563; 100%), Case=Acc (399; 71%).
NUM tokens may have the following values of SubGender:
Masc1(176; 31% of non-emptySubGender): dwóch, trzech, czterech, dwaj, obaj, pięciu, trzej, obu, sześciu, stuMasc2(49; 9% of non-emptySubGender): 1500, 20, 4, 600, dwa, pięć, siedem, sto, trzy, 1.000Masc3(340; 60% of non-emptySubGender): dwa, trzy, cztery, dwóch, dwadzieścia, sto, trzech, 10, 4, 80
| Paradigm dwa | Masc1 | Masc2 | Masc3 |
|---|---|---|---|
| Case=Acc | dwóch | dwa | |
| Case=Gen | dwóch | dwóch | dwóch, dwu |
| Case=Ins | dwoma | dwoma | |
| Case=Loc | dwóch | ||
| Case=Nom | dwaj | dwa | dwa |
AUX
347 AUX tokens (8% of all AUX tokens) have a non-empty value of SubGender.
The most frequent other feature values with which AUX and SubGender co-occurred: Gender=Masc (347; 100%), Mood=Ind (347; 100%), Person=EMPTY (347; 100%), Tense=Past (347; 100%), Variant=EMPTY (347; 100%), VerbForm=Fin (347; 100%), Voice=Act (291; 84%), Aspect=Imp (283; 82%), Number=Sing (266; 77%).
AUX tokens may have the following values of SubGender:
Masc1(194; 56% of non-emptySubGender): był, byli, został, zostali, bywałMasc2(7; 2% of non-emptySubGender): był, został, byłyMasc3(146; 42% of non-emptySubGender): był, został, były, zostały
| Paradigm być | Masc1 | Masc2 | Masc3 |
|---|---|---|---|
| Number=Sing | był | był | |
| Number=Sing|Voice=Act | był | był | był |
| Number=Plur | byli | były | |
| Number=Plur|Voice=Act | byli | były | były |
Relations with Agreement in SubGender
The 10 most frequent relations where parent and child node agree in SubGender:
NOUN –[amod]–> ADJ (2435; 100%),
VERB –[nsubj]–> NOUN (1562; 51%),
NOUN –[det]–> DET (1216; 100%),
VERB –[nsubj]–> PROPN (798; 75%),
VERB –[conj]–> VERB (487; 62%),
NOUN –[nummod]–> NUM (476; 100%),
NOUN –[flat]–> PROPN (466; 96%),
NOUN –[acl]–> ADJ (357; 100%),
PROPN –[flat]–> PROPN (247; 97%),
NOUN –[flat]–> NOUN (131; 93%).