Treebank Statistics: UD_Slovenian-SST: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem
, Masc
, Neut
.
This is a layered feature with the following layers: Gender, Gender[psor].
28078 tokens (29%) have a non-empty value of Gender
.
10263 types (77%) occur at least once with a non-empty value of Gender
.
5817 lemmas (76%) occur at least once with a non-empty value of Gender
.
The feature is used with 8 part-of-speech tags: NOUN (11395; 12% instances), ADJ (5272; 5% instances), DET (4585; 5% instances), VERB (3048; 3% instances), PRON (1677; 2% instances), PROPN (1271; 1% instances), NUM (496; 1% instances), AUX (334; 0% instances).
NOUN
11395 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (8242; 72%).
NOUN
tokens may have the following values of Gender
:
Fem
(4670; 41% of non-emptyGender
): strani, stvari, hvala, stvar, pot, šole, šoli, bolezni, šolo, državaMasc
(4913; 43% of non-emptyGender
): dan, čas, način, otrok, ljudi, primer, redu, koncu, ljudje, evrovNeut
(1812; 16% of non-emptyGender
): bistvu, leta, leto, let, delo, letih, mesto, vprašanje, dela, mestu
Paradigm del | Masc | Neut |
---|---|---|
Animacy=Inan|Case=Acc|Number=Sing | del | |
Case=Acc|Number=Plur | dele | |
Case=Dat|Number=Sing | delu | |
Case=Gen|Number=Plur | delov | |
Case=Loc|Number=Sing | delu | Delu |
Case=Loc|Number=Plur | delih | |
Case=Nom|Number=Sing | del |
Gender
seems to be lexical feature of NOUN
. 100% lemmas (2935) occur only with one value of Gender
.
ADJ
5272 ADJ tokens (100% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Degree=Pos (4663; 88%), VerbForm=EMPTY (4609; 87%), Definite=EMPTY (4425; 84%), Number=Sing (3776; 72%).
ADJ
tokens may have the following values of Gender
:
Fem
(2087; 40% of non-emptyGender
): lepa, drugo, druga, sama, drugi, velika, dobra, prvi, določene, prveMasc
(2046; 39% of non-emptyGender
): drugi, dober, sam, prvi, sami, lep, pozdravljeni, velik, cel, drugihNeut
(1139; 22% of non-emptyGender
): dobro, zanimivo, pomembno, glavnem, drugo, fajn, drugega, potrebno, mogoče, super
Paradigm drug | Masc | Fem | Neut |
---|---|---|---|
Case=Acc|Definite=Def|Number=Sing | drugi | ||
Case=Acc|Definite=Ind|Number=Sing | drug | ||
Case=Acc|Number=Sing | drugega | drugo | drugo |
Case=Acc|Number=Plur | druge | druge | druga |
Case=Dat|Number=Sing | drugemu | ||
Case=Dat|Number=Plur | drugim | drugim | |
Case=Gen|Number=Sing | drugega | druge | drugega |
Case=Gen|Number=Plur | drugih | drugih | |
Case=Ins|Number=Sing | drugo | drugim | |
Case=Ins|Number=Plur | drugimi | drugimi | drugimi |
Case=Loc|Number=Sing | drugem | drugi | drugem |
Case=Loc|Number=Dual | drugih | ||
Case=Loc|Number=Plur | drugih | drugih | |
Case=Nom|Definite=Def|Number=Sing | drugi | ||
Case=Nom|Definite=Ind|Number=Sing | drug | ||
Case=Nom|Number=Sing | druga | drugo | |
Case=Nom|Number=Plur | drugi | druge | druga |
DET
4585 DET tokens (83% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (3587; 78%), PronType=Dem (2802; 61%).
DET
tokens may have the following values of Gender
:
Fem
(1123; 24% of non-emptyGender
): te, ta, to, tej, teh, neko, eno, tiste, vse, nekeMasc
(1384; 30% of non-emptyGender
): ta, tisti, vsi, tem, tega, en, neki, ti, teh, vsakNeut
(2078; 45% of non-emptyGender
): to, vse, tega, tem, tisto, nič, temu, tole, nekaj, svojeEMPTY
(942): pol, malo, več, veliko, nekaj, koliko, dosti, toliko, manj, preveč
Paradigm ta | Masc | Fem | Neut |
---|---|---|---|
Case=Acc|Number=Sing | ta, tega | to | to |
Case=Acc|Number=Dual | ta | ||
Case=Acc|Number=Plur | te | te | ta |
Case=Dat|Number=Sing | temu | tej | temu |
Case=Dat|Number=Plur | tem | tem | tem |
Case=Gen|Number=Sing | tega | te | tega |
Case=Gen|Number=Plur | teh | teh | teh |
Case=Ins|Number=Sing | tem | to | tem |
Case=Ins|Number=Plur | temi | temi | temi |
Case=Loc|Number=Sing | tem | tej | tem |
Case=Loc|Number=Plur | teh | teh | teh |
Case=Nom|Number=Sing | ta | ta | to |
Case=Nom|Number=Dual | ta | ti | |
Case=Nom|Number=Plur | ti | te | ta |
Case=Nom|Number=Plur|Typo=Yes | ta |
VERB
3048 VERB tokens (30% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (3048; 100%), Person=EMPTY (3048; 100%), Polarity=EMPTY (3048; 100%), Tense=EMPTY (3048; 100%), VerbForm=Part (3048; 100%), Number=Sing (2027; 67%).
VERB
tokens may have the following values of Gender
:
Fem
(802; 26% of non-emptyGender
): rekla, bila, imela, šla, prišla, delala, videla, dala, naredila, moglaMasc
(1881; 62% of non-emptyGender
): rekel, bil, imeli, imel, rekli, šli, šel, bili, mogel, videlNeut
(365; 12% of non-emptyGender
): bilo, šlo, prišlo, zgodilo, uspelo, dalo, trajalo, spremenilo, dogajalo, imeloEMPTY
(6990): je, vem, veš, mislim, recimo, so, ni, ima, pravi, imamo
Paradigm biti | Masc | Fem | Neut |
---|---|---|---|
Aspect=Imp|Number=Sing | bil | bilo | |
Number=Sing | bil | bila | bilo |
Number=Dual | bila | bili | |
Number=Plur | bili | bile |
PRON
1677 PRON tokens (38% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Reflex=EMPTY (1677; 100%), Number=Sing (1222; 73%), Variant=EMPTY (1114; 66%), PronType=Prs (941; 56%).
PRON
tokens may have the following values of Gender
:
Fem
(301; 18% of non-emptyGender
): jo, jih, ona, ji, je, njo, njej, midve, nje, njimiMasc
(726; 43% of non-emptyGender
): ga, mi, jih, kdo, on, vi, mu, jim, oni, nekdoNeut
(650; 39% of non-emptyGender
): kaj, kar, nekaj, nič, ga, jih, česa, isto, karkoli, čemerEMPTY
(2707): se, jaz, mi, ti, si, nas, nam, me, meni, vam
Paradigm on | Masc | Fem | Neut |
---|---|---|---|
Case=Acc|Number=Sing | njega | njo | |
Case=Acc|Number=Sing|Variant=Short | ga | jo | ga |
Case=Acc|Number=Dual|Variant=Short | ju, jih | ||
Case=Acc|Number=Plur | njih | ||
Case=Acc|Number=Plur|Variant=Short | jih | jih | jih |
Case=Dat|Number=Sing | njemu | njej | |
Case=Dat|Number=Sing|Variant=Short | mu | ji | |
Case=Dat|Number=Dual|Variant=Short | jima | ||
Case=Dat|Number=Plur | njim | njim | |
Case=Dat|Number=Plur|Variant=Short | jim | jim | |
Case=Gen|Number=Sing | njega | nje | |
Case=Gen|Number=Sing|Variant=Short | ga | je | |
Case=Gen|Number=Plur | njih | ||
Case=Gen|Number=Plur|Variant=Short | jih | jih | jih |
Case=Ins|Number=Sing | njim | njo | |
Case=Ins|Number=Dual | njima | ||
Case=Ins|Number=Plur | njimi | njimi | njimi |
Case=Loc|Number=Sing | njem | njej | |
Case=Loc|Number=Plur | njih | njih | |
Case=Nom|Number=Sing | on | ona | |
Case=Nom|Number=Dual | onadva | ||
Case=Nom|Number=Plur | oni | one |
PROPN
1271 PROPN tokens (73% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (1165; 92%).
PROPN
tokens may have the following values of Gender
:
Fem
(528; 42% of non-emptyGender
): Sloveniji, Slovenija, Slovenije, Ljubljani, Ljubljane, Ljubljana, rtv, Evropi, Nemčiji, NemčijoMasc
(693; 55% of non-emptyGender
): Mariboru, Agropop, Jones, Maribor, Tom, Triglav, David, Healy, Netflixu, RomovNeut
(50; 4% of non-emptyGender
): Celja, Celje, Celju, Pohorja, Slovenskem, Ivanovo, Šmarja, Štajerskem, Švedskem, CeljskegaEMPTY
(467): [name:personal], [name:surname], [name:organisation], [name:address], si, ngl, [name:place], al, kk
Paradigm RTV | Masc | Fem |
---|---|---|
Case=Gen | RTV-ja | RTV |
Case=Loc | RTV-ju | RTV |
Case=Nom | rtv |
Gender
seems to be lexical feature of PROPN
. 100% lemmas (644) occur only with one value of Gender
.
NUM
496 NUM tokens (47% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumForm=Word (495; 100%), NumType=Card (494; 100%).
NUM
tokens may have the following values of Gender
:
Fem
(196; 40% of non-emptyGender
): ena, eno, dve, tri, ene, eni, štiri, dveh, štirih, trehMasc
(248; 50% of non-emptyGender
): dva, en, eden, enega, tri, trije, eni, štiri, štirje, dvehNeut
(52; 10% of non-emptyGender
): tri, eno, dve, enem, štiri, dveh, ena, tremi, drugem, enegaEMPTY
(552): tisoč, pet, dvajset, trideset, deset, petnajst, sto, petdeset, sedem, šest
Paradigm en | Masc | Fem | Neut |
---|---|---|---|
Case=Acc|Number=Sing | en, enega, een | eno | eno |
Case=Acc|Number=Plur | ene | ||
Case=Dat|Number=Sing | eni | ||
Case=Gen|Number=Sing | enega | ene | enega |
Case=Gen|Number=Plur | enih | enih | |
Case=Ins|Number=Sing | enim | eno | enim |
Case=Loc|Number=Sing | enem | eni | enem |
Case=Nom|Number=Sing | en | ena | eno |
Case=Nom|Number=Plur | eni | ene | ena |
AUX
334 AUX tokens (6% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Mood=EMPTY (334; 100%), Person=EMPTY (334; 100%), Polarity=EMPTY (334; 100%), Tense=EMPTY (334; 100%), VerbForm=Part (334; 100%), Number=Sing (273; 82%).
AUX
tokens may have the following values of Gender
:
Fem
(98; 29% of non-emptyGender
): bila, bileMasc
(132; 40% of non-emptyGender
): bil, bili, bilaNeut
(104; 31% of non-emptyGender
): bilo, bilaEMPTY
(4903): je, so, sem, bi, smo, ni, bo, si, ste, bom
Paradigm biti | Masc | Fem | Neut |
---|---|---|---|
Aspect=Imp|Number=Sing | bil | bilo | |
Number=Sing | bil | bila | bilo |
Number=Dual | bila | ||
Number=Plur | bili | bile | bila |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[amod]–> ADJ (3215; 99%),
NOUN –[det]–> DET (2043; 89%),
NOUN –[conj]–> NOUN (417; 53%),
ADJ –[nsubj]–> NOUN (274; 97%),
ADJ –[conj]–> ADJ (198; 94%),
NOUN –[nmod]–> PROPN (186; 52%),
PROPN –[flat:name]–> PROPN (134; 100%),
NOUN –[appos]–> NOUN (127; 59%),
ADJ –[nsubj]–> DET (105; 95%),
ADJ –[det]–> DET (70; 89%).