Treebank Statistics: UD_Czech-Poetry: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem, Masc, Neut.
This is a layered feature with the following layers: Gender, Gender[psor].
2747 tokens (44%) have a non-empty value of Gender.
2009 types (75%) occur at least once with a non-empty value of Gender.
1380 lemmas (72%) occur at least once with a non-empty value of Gender.
The feature is used with 8 part-of-speech tags: NOUN (1466; 23% instances), ADJ (595; 9% instances), VERB (240; 4% instances), DET (223; 4% instances), PRON (111; 2% instances), PROPN (84; 1% instances), AUX (18; 0% instances), NUM (10; 0% instances).
NOUN
1466 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (1047; 71%), Animacy=EMPTY (830; 57%).
NOUN tokens may have the following values of Gender:
Fem(598; 41% of non-emptyGender): duše, duši, chvíli, země, duší, lásky, ruka, tvář, vůní, zářMasc(636; 43% of non-emptyGender): den, svět, boha, bože, bůh, květ, člověk, bohy, oheň, senNeut(232; 16% of non-emptyGender): žití, oči, srdce, štěstí, nebes, těla, dítě, jaro, moře, očima
| Paradigm oko | Fem | Neut |
|---|---|---|
| Case=Acc|Number=Plur | oči | |
| Case=Gen|Number=Plur | očí | očí |
| Case=Ins|Number=Sing | okem | |
| Case=Ins|Number=Dual | očima | |
| Case=Loc|Number=Sing | oku | |
| Case=Nom|Number=Sing | oko | |
| Case=Nom|Number=Plur | oči |
Gender seems to be lexical feature of NOUN. 99% lemmas (727) occur only with one value of Gender.
ADJ
595 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Polarity=Pos (572; 96%), Degree=Pos (560; 94%), Aspect=EMPTY (533; 90%), Voice=EMPTY (493; 83%), VerbForm=EMPTY (492; 83%), Number=Sing (425; 71%), Animacy=EMPTY (365; 61%).
ADJ tokens may have the following values of Gender:
Fem(260; 44% of non-emptyGender): bílé, jiné, nové, tmavou, věčné, černá, Vyloupena, Zsinalá, bledá, kalnáMasc(248; 42% of non-emptyGender): celý, plný, bílý, kamenném, tvrdém, věrni, Mnohý, Pozdní, divoké, jiníNeut(87; 15% of non-emptyGender): nesmírném, pozlátkové, smilnící, tichém, umdlená, věčné, Astartiných, Lepší, Oněmlé, bledéEMPTY(2): aj, marně
| Paradigm tichý | Masc | Fem | Neut |
|---|---|---|---|
| Animacy=Anim|Case=Gen|Number=Plur | tichých | ||
| Animacy=Inan|Case=Acc|Number=Sing | tichý | ||
| Animacy=Inan|Case=Nom|Number=Sing | tichý | ||
| Case=Acc|Number=Sing | tichou | ||
| Case=Ins|Number=Sing | tichou | ||
| Case=Loc|Number=Sing | tiché | tichém | |
| Case=Nom|Number=Sing | tiché | ||
| Case=Nom|Number=Plur | tiché |
VERB
240 VERB tokens (32% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (240; 100%), Person=EMPTY (240; 100%), Voice=Act (236; 98%), Tense=Past (233; 97%), VerbForm=Part (233; 97%), Polarity=Pos (226; 94%), Number=Sing (191; 80%), Aspect=Imp (129; 54%).
VERB tokens may have the following values of Gender:
Fem(62; 26% of non-emptyGender): Rozhučela, povylétla, rozlívala, vzrostla, Krákorala, Spadaly, Zavedla, Zřídila, bila, blysklaMasc(168; 70% of non-emptyGender): měl, chtěl, poznali, viděl, ctil, šel, cítil, dal, klečel, kmitalNeut(10; 4% of non-emptyGender): nezaneslo, nezřely, přešlo, přihodilo, sila, svitlo, vyschla, zklamalo, zrálo, zůstalyEMPTY(503): jdou, letí, chce, jde, zdá, chcem, hledá, mám, stojí, vím
| Paradigm zůstat | Masc | Fem | Neut |
|---|---|---|---|
| Animacy=Anim | zůstali | ||
| zůstaly | zůstaly |
Gender seems to be lexical feature of VERB. 93% lemmas (165) occur only with one value of Gender.
DET
223 DET tokens (77% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (177; 79%), Number[psor]=EMPTY (177; 79%), Person=EMPTY (176; 79%), Animacy=EMPTY (172; 77%), Reflex=EMPTY (172; 77%), Poss=EMPTY (128; 57%).
DET tokens may have the following values of Gender:
Fem(76; 34% of non-emptyGender): tvé, své, té, svou, ta, moje, která, mou, naší, tvouMasc(78; 35% of non-emptyGender): náš, ten, ti, každý, svůj, sám, které, všechny, žádné, jejíNeut(69; 31% of non-emptyGender): to, tom, ty, své, vše, vším, svá, svého, tvé, tímEMPTY(66): jeho, jejich, svých, jich, tolika, těm, Ti, svoje, tolik, tvých
| Paradigm ten | Masc | Fem | Neut |
|---|---|---|---|
| Animacy=Anim|Case=Dat|Number=Plur | těm | ||
| Animacy=Anim|Case=Gen|Number=Sing | toho | ||
| Animacy=Anim|Case=Nom|Number=Plur | ti | ||
| Animacy=Inan|Case=Acc|Number=Sing | ten, sěn | ||
| Case=Acc|Number=Sing | tu | to | |
| Case=Acc|Number=Plur | ty | ta | |
| Case=Dat|Number=Sing | té | tomu | |
| Case=Gen|Number=Sing | té | ||
| Case=Ins|Number=Sing | tím | ||
| Case=Loc|Number=Sing | té | tom | |
| Case=Nom|Number=Sing | ten | ta | to |
| Case=Nom|Number=Plur | Ty | ty |
PRON
111 PRON tokens (29% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (111; 100%), Variant=EMPTY (101; 91%), Number=Sing (87; 78%), Animacy=EMPTY (63; 57%), Person=EMPTY (56; 50%), PronType=Prs (56; 50%).
PRON tokens may have the following values of Gender:
Fem(37; 33% of non-emptyGender): jež, jí, ji, níž, ni, Vlasti, je, již, nich, níMasc(67; 60% of non-emptyGender): jenž, mu, jež, ho, jej, kdo, ním, Němu, jemu, nikdoNeut(7; 6% of non-emptyGender): jež, všecko, ním, něco, všeckaEMPTY(270): se, co, mi, si, já, ty, tě, tobě, nás, ti
PROPN
84 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (73; 87%), Animacy=Anim (44; 52%).
PROPN tokens may have the following values of Gender:
Fem(27; 32% of non-emptyGender): Magdaleno, Maria, Svitava, Vltavy, Evy, Francii, Golgatu, Lůnou, Madonny, PannoMasc(55; 65% of non-emptyGender): Armand, Sion, Angelico, Armandovi, Azték, Bajušáku, Baudelaira, Chodováci, Diderota, DudákoviNeut(2; 2% of non-emptyGender): Labe
Gender seems to be lexical feature of PROPN. 100% lemmas (69) occur only with one value of Gender.
AUX
18 AUX tokens (13% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Aspect=Imp (18; 100%), Mood=EMPTY (18; 100%), Person=EMPTY (18; 100%), Tense=Past (18; 100%), VerbForm=Part (18; 100%), Voice=Act (18; 100%), Polarity=Pos (17; 94%), Number=Sing (15; 83%).
AUX tokens may have the following values of Gender:
Fem(4; 22% of non-emptyGender): Nebyla, byla, byly, bývalyMasc(9; 50% of non-emptyGender): byl, byli, jsiNeut(5; 28% of non-emptyGender): byloEMPTY(118): je, by, jsem, jste, jest, jsi, jsou, budeš, bude, bych
| Paradigm být | Masc | Fem | Neut |
|---|---|---|---|
| Animacy=Anim|Number=Sing|Polarity=Pos | byl | ||
| Animacy=Anim|Number=Plur|Polarity=Pos | byli | ||
| Number=Sing|Polarity=Neg | Nebyla | ||
| Number=Sing|Polarity=Pos | byl, jsi | byla | bylo |
| Number=Plur|Polarity=Pos | byly |
NUM
10 NUM tokens (56% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (10; 100%), NumType=Card (10; 100%), Number=Sing (9; 90%), Case=Nom (6; 60%).
NUM tokens may have the following values of Gender:
Fem(3; 30% of non-emptyGender): jedna, jednou, jednéMasc(6; 60% of non-emptyGender): jeden, dvaNeut(1; 10% of non-emptyGender): jednomEMPTY(8): dvé, 80, Deset, dvacet, obou, tisícem, šesti
| Paradigm jeden | Masc | Fem | Neut |
|---|---|---|---|
| Animacy=Anim|Case=Nom | jeden | ||
| Case=Gen | jedné | ||
| Case=Ins | jednou | ||
| Case=Loc | jednom | ||
| Case=Nom | jeden | jedna |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[amod]–> ADJ (411; 96%),
NOUN –[det]–> DET (147; 74%),
ADJ –[conj]–> ADJ (41; 95%),
VERB –[conj]–> VERB (34; 59%),
VERB –[nsubj]–> PROPN (14; 74%),
ADJ –[nsubj]–> NOUN (9; 100%),
PROPN –[amod]–> ADJ (9; 100%),
PROPN –[flat]–> PROPN (9; 90%),
ADJ –[nsubj:pass]–> NOUN (8; 100%),
NOUN –[dep]–> ADJ (8; 100%).