Treebank Statistics: UD_Slovenian-SST: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem, Masc, Neut.
This is a layered feature with the following layers: Gender, Gender[psor].
28078 tokens (29%) have a non-empty value of Gender.
10263 types (77%) occur at least once with a non-empty value of Gender.
5817 lemmas (76%) occur at least once with a non-empty value of Gender.
The feature is used with 8 part-of-speech tags: NOUN (11395; 12% instances), ADJ (5272; 5% instances), DET (4585; 5% instances), VERB (3048; 3% instances), PRON (1677; 2% instances), PROPN (1271; 1% instances), NUM (496; 1% instances), AUX (334; 0% instances).
NOUN
11395 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (8242; 72%).
NOUN tokens may have the following values of Gender:
Fem(4670; 41% of non-emptyGender): strani, stvari, hvala, stvar, pot, šole, šoli, bolezni, šolo, državaMasc(4913; 43% of non-emptyGender): dan, čas, način, otrok, ljudi, primer, redu, koncu, ljudje, evrovNeut(1812; 16% of non-emptyGender): bistvu, leta, leto, let, delo, letih, mesto, vprašanje, dela, mestu
| Paradigm del | Masc | Neut |
|---|---|---|
| Animacy=Inan|Case=Acc|Number=Sing | del | |
| Case=Acc|Number=Plur | dele | |
| Case=Dat|Number=Sing | delu | |
| Case=Gen|Number=Plur | delov | |
| Case=Loc|Number=Sing | delu | Delu |
| Case=Loc|Number=Plur | delih | |
| Case=Nom|Number=Sing | del |
Gender seems to be lexical feature of NOUN. 100% lemmas (2935) occur only with one value of Gender.
ADJ
5272 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (4663; 88%), VerbForm=EMPTY (4609; 87%), Definite=EMPTY (4425; 84%), Number=Sing (3776; 72%).
ADJ tokens may have the following values of Gender:
Fem(2087; 40% of non-emptyGender): lepa, drugo, druga, sama, drugi, velika, dobra, prvi, določene, prveMasc(2046; 39% of non-emptyGender): drugi, dober, sam, prvi, sami, lep, pozdravljeni, velik, cel, drugihNeut(1139; 22% of non-emptyGender): dobro, zanimivo, pomembno, glavnem, drugo, fajn, drugega, potrebno, mogoče, super
| Paradigm drug | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Definite=Def|Number=Sing | drugi | ||
| Case=Acc|Definite=Ind|Number=Sing | drug | ||
| Case=Acc|Number=Sing | drugega | drugo | drugo |
| Case=Acc|Number=Plur | druge | druge | druga |
| Case=Dat|Number=Sing | drugemu | ||
| Case=Dat|Number=Plur | drugim | drugim | |
| Case=Gen|Number=Sing | drugega | druge | drugega |
| Case=Gen|Number=Plur | drugih | drugih | |
| Case=Ins|Number=Sing | drugo | drugim | |
| Case=Ins|Number=Plur | drugimi | drugimi | drugimi |
| Case=Loc|Number=Sing | drugem | drugi | drugem |
| Case=Loc|Number=Dual | drugih | ||
| Case=Loc|Number=Plur | drugih | drugih | |
| Case=Nom|Definite=Def|Number=Sing | drugi | ||
| Case=Nom|Definite=Ind|Number=Sing | drug | ||
| Case=Nom|Number=Sing | druga | drugo | |
| Case=Nom|Number=Plur | drugi | druge | druga |
DET
4585 DET tokens (83% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (3587; 78%), PronType=Dem (2802; 61%).
DET tokens may have the following values of Gender:
Fem(1123; 24% of non-emptyGender): te, ta, to, tej, teh, neko, eno, tiste, vse, nekeMasc(1384; 30% of non-emptyGender): ta, tisti, vsi, tem, tega, en, neki, ti, teh, vsakNeut(2078; 45% of non-emptyGender): to, vse, tega, tem, tisto, nič, temu, tole, nekaj, svojeEMPTY(942): pol, malo, več, veliko, nekaj, koliko, dosti, toliko, manj, preveč
| Paradigm ta | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Number=Sing | ta, tega | to | to |
| Case=Acc|Number=Dual | ta | ||
| Case=Acc|Number=Plur | te | te | ta |
| Case=Dat|Number=Sing | temu | tej | temu |
| Case=Dat|Number=Plur | tem | tem | tem |
| Case=Gen|Number=Sing | tega | te | tega |
| Case=Gen|Number=Plur | teh | teh | teh |
| Case=Ins|Number=Sing | tem | to | tem |
| Case=Ins|Number=Plur | temi | temi | temi |
| Case=Loc|Number=Sing | tem | tej | tem |
| Case=Loc|Number=Plur | teh | teh | teh |
| Case=Nom|Number=Sing | ta | ta | to |
| Case=Nom|Number=Dual | ta | ti | |
| Case=Nom|Number=Plur | ti | te | ta |
| Case=Nom|Number=Plur|Typo=Yes | ta |
VERB
3048 VERB tokens (30% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (3048; 100%), Person=EMPTY (3048; 100%), Polarity=EMPTY (3048; 100%), Tense=EMPTY (3048; 100%), VerbForm=Part (3048; 100%), Number=Sing (2027; 67%).
VERB tokens may have the following values of Gender:
Fem(802; 26% of non-emptyGender): rekla, bila, imela, šla, prišla, delala, videla, dala, naredila, moglaMasc(1881; 62% of non-emptyGender): rekel, bil, imeli, imel, rekli, šli, šel, bili, mogel, videlNeut(365; 12% of non-emptyGender): bilo, šlo, prišlo, zgodilo, uspelo, dalo, trajalo, spremenilo, dogajalo, imeloEMPTY(6990): je, vem, veš, mislim, recimo, so, ni, ima, pravi, imamo
| Paradigm biti | Masc | Fem | Neut |
|---|---|---|---|
| Aspect=Imp|Number=Sing | bil | bilo | |
| Number=Sing | bil | bila | bilo |
| Number=Dual | bila | bili | |
| Number=Plur | bili | bile |
PRON
1677 PRON tokens (38% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (1677; 100%), Number=Sing (1222; 73%), Variant=EMPTY (1114; 66%), PronType=Prs (941; 56%).
PRON tokens may have the following values of Gender:
Fem(301; 18% of non-emptyGender): jo, jih, ona, ji, je, njo, njej, midve, nje, njimiMasc(726; 43% of non-emptyGender): ga, mi, jih, kdo, on, vi, mu, jim, oni, nekdoNeut(650; 39% of non-emptyGender): kaj, kar, nekaj, nič, ga, jih, česa, isto, karkoli, čemerEMPTY(2707): se, jaz, mi, ti, si, nas, nam, me, meni, vam
| Paradigm on | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Number=Sing | njega | njo | |
| Case=Acc|Number=Sing|Variant=Short | ga | jo | ga |
| Case=Acc|Number=Dual|Variant=Short | ju, jih | ||
| Case=Acc|Number=Plur | njih | ||
| Case=Acc|Number=Plur|Variant=Short | jih | jih | jih |
| Case=Dat|Number=Sing | njemu | njej | |
| Case=Dat|Number=Sing|Variant=Short | mu | ji | |
| Case=Dat|Number=Dual|Variant=Short | jima | ||
| Case=Dat|Number=Plur | njim | njim | |
| Case=Dat|Number=Plur|Variant=Short | jim | jim | |
| Case=Gen|Number=Sing | njega | nje | |
| Case=Gen|Number=Sing|Variant=Short | ga | je | |
| Case=Gen|Number=Plur | njih | ||
| Case=Gen|Number=Plur|Variant=Short | jih | jih | jih |
| Case=Ins|Number=Sing | njim | njo | |
| Case=Ins|Number=Dual | njima | ||
| Case=Ins|Number=Plur | njimi | njimi | njimi |
| Case=Loc|Number=Sing | njem | njej | |
| Case=Loc|Number=Plur | njih | njih | |
| Case=Nom|Number=Sing | on | ona | |
| Case=Nom|Number=Dual | onadva | ||
| Case=Nom|Number=Plur | oni | one |
PROPN
1271 PROPN tokens (73% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1165; 92%).
PROPN tokens may have the following values of Gender:
Fem(528; 42% of non-emptyGender): Sloveniji, Slovenija, Slovenije, Ljubljani, Ljubljane, Ljubljana, rtv, Evropi, Nemčiji, NemčijoMasc(693; 55% of non-emptyGender): Mariboru, Agropop, Jones, Maribor, Tom, Triglav, David, Healy, Netflixu, RomovNeut(50; 4% of non-emptyGender): Celja, Celje, Celju, Pohorja, Slovenskem, Ivanovo, Šmarja, Štajerskem, Švedskem, CeljskegaEMPTY(467): [name:personal], [name:surname], [name:organisation], [name:address], si, ngl, [name:place], al, kk
| Paradigm RTV | Masc | Fem |
|---|---|---|
| Case=Gen | RTV-ja | RTV |
| Case=Loc | RTV-ju | RTV |
| Case=Nom | rtv |
Gender seems to be lexical feature of PROPN. 100% lemmas (644) occur only with one value of Gender.
NUM
496 NUM tokens (47% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (495; 100%), NumType=Card (494; 100%).
NUM tokens may have the following values of Gender:
Fem(196; 40% of non-emptyGender): ena, eno, dve, tri, ene, eni, štiri, dveh, štirih, trehMasc(248; 50% of non-emptyGender): dva, en, eden, enega, tri, trije, eni, štiri, štirje, dvehNeut(52; 10% of non-emptyGender): tri, eno, dve, enem, štiri, dveh, ena, tremi, drugem, enegaEMPTY(552): tisoč, pet, dvajset, trideset, deset, petnajst, sto, petdeset, sedem, šest
| Paradigm en | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Number=Sing | en, enega, een | eno | eno |
| Case=Acc|Number=Plur | ene | ||
| Case=Dat|Number=Sing | eni | ||
| Case=Gen|Number=Sing | enega | ene | enega |
| Case=Gen|Number=Plur | enih | enih | |
| Case=Ins|Number=Sing | enim | eno | enim |
| Case=Loc|Number=Sing | enem | eni | enem |
| Case=Nom|Number=Sing | en | ena | eno |
| Case=Nom|Number=Plur | eni | ene | ena |
AUX
334 AUX tokens (6% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (334; 100%), Person=EMPTY (334; 100%), Polarity=EMPTY (334; 100%), Tense=EMPTY (334; 100%), VerbForm=Part (334; 100%), Number=Sing (273; 82%).
AUX tokens may have the following values of Gender:
Fem(98; 29% of non-emptyGender): bila, bileMasc(132; 40% of non-emptyGender): bil, bili, bilaNeut(104; 31% of non-emptyGender): bilo, bilaEMPTY(4903): je, so, sem, bi, smo, ni, bo, si, ste, bom
| Paradigm biti | Masc | Fem | Neut |
|---|---|---|---|
| Aspect=Imp|Number=Sing | bil | bilo | |
| Number=Sing | bil | bila | bilo |
| Number=Dual | bila | ||
| Number=Plur | bili | bile | bila |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[amod]–> ADJ (3215; 99%),
NOUN –[det]–> DET (2043; 89%),
NOUN –[conj]–> NOUN (417; 53%),
ADJ –[nsubj]–> NOUN (274; 97%),
ADJ –[conj]–> ADJ (198; 94%),
NOUN –[nmod]–> PROPN (186; 52%),
PROPN –[flat:name]–> PROPN (134; 100%),
NOUN –[appos]–> NOUN (127; 59%),
ADJ –[nsubj]–> DET (105; 95%),
ADJ –[det]–> DET (70; 89%).