Treebank Statistics: UD_Slovenian-SST: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem
, Masc
, Neut
.
This is a layered feature with the following layers: Gender, Gender[psor].
28106 tokens (37%) have a non-empty value of Gender
.
10295 types (78%) occur at least once with a non-empty value of Gender
.
5848 lemmas (77%) occur at least once with a non-empty value of Gender
.
The feature is used with 8 part-of-speech tags: NOUN (11411; 15% instances), ADJ (5271; 7% instances), DET (4438; 6% instances), VERB (3049; 4% instances), PRON (1678; 2% instances), PROPN (1290; 2% instances), NUM (635; 1% instances), AUX (334; 0% instances).
NOUN
11411 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (8256; 72%).
NOUN
tokens may have the following values of Gender
:
Fem
(4676; 41% of non-emptyGender
): strani, stvari, hvala, stvar, pot, šole, šoli, bolezni, šolo, državaMasc
(4924; 43% of non-emptyGender
): dan, čas, način, otrok, ljudi, primer, redu, koncu, ljudje, evrovNeut
(1811; 16% of non-emptyGender
): bistvu, leta, leto, let, delo, letih, mesto, vprašanje, dela, mestu
Paradigm del | Masc | Neut |
---|---|---|
Animacy=Inan|Case=Acc|Number=Sing | del | |
Case=Acc|Number=Plur | dele | |
Case=Dat|Number=Sing | delu | |
Case=Gen|Number=Plur | delov | |
Case=Loc|Number=Sing | delu | Delu |
Case=Loc|Number=Plur | delih | |
Case=Nom|Number=Sing | del |
Gender
seems to be lexical feature of NOUN
. 100% lemmas (2949) occur only with one value of Gender
.
ADJ
5271 ADJ tokens (100% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Degree=Pos (4661; 88%), VerbForm=EMPTY (4608; 87%), Definite=EMPTY (4424; 84%), Number=Sing (3777; 72%).
ADJ
tokens may have the following values of Gender
:
Fem
(2087; 40% of non-emptyGender
): lepa, drugo, druga, sama, drugi, velika, dobra, prvi, določene, glavnaMasc
(2044; 39% of non-emptyGender
): drugi, sam, dober, prvi, sami, lep, pozdravljeni, velik, cel, drugihNeut
(1140; 22% of non-emptyGender
): dobro, zanimivo, pomembno, glavnem, drugo, fajn, drugega, potrebno, mogoče, super
Paradigm drug | Masc | Fem | Neut |
---|---|---|---|
Case=Acc|Definite=Def|Number=Sing | drugi | ||
Case=Acc|Definite=Ind|Number=Sing | drug | ||
Case=Acc|Number=Sing | drugega | drugo | drugo |
Case=Acc|Number=Plur | druge | druge | druga |
Case=Dat|Number=Sing | drugemu | ||
Case=Dat|Number=Plur | drugim | drugim | |
Case=Gen|Number=Sing | drugega | druge | drugega |
Case=Gen|Number=Plur | drugih | drugih | |
Case=Ins|Number=Sing | drugo | drugim | |
Case=Ins|Number=Plur | drugimi | drugimi | drugimi |
Case=Loc|Number=Sing | drugem | drugi | drugem |
Case=Loc|Number=Dual | drugih | ||
Case=Loc|Number=Plur | drugih | drugih | |
Case=Nom|Definite=Def|Number=Sing | drugi | ||
Case=Nom|Definite=Ind|Number=Sing | drug | ||
Case=Nom|Number=Sing | druga | drugo | |
Case=Nom|Number=Plur | drugi | druge | druga |
DET
4438 DET tokens (82% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (3450; 78%), PronType=Dem (2799; 63%).
DET
tokens may have the following values of Gender
:
Fem
(1060; 24% of non-emptyGender
): te, ta, to, tej, teh, neko, tiste, vse, neke, takeMasc
(1314; 30% of non-emptyGender
): ta, tisti, vsi, tem, tega, neki, ti, teh, vsak, kakšenNeut
(2064; 47% of non-emptyGender
): to, vse, tega, tem, tisto, nič, temu, tole, nekaj, svojeEMPTY
(945): pol, malo, več, veliko, nekaj, koliko, dosti, toliko, manj, preveč
Paradigm ta | Masc | Fem | Neut |
---|---|---|---|
Case=Acc|Number=Sing | ta, tega | to | to |
Case=Acc|Number=Dual | ta | ||
Case=Acc|Number=Plur | te | te | ta |
Case=Dat|Number=Sing | temu | tej | temu |
Case=Dat|Number=Plur | tem | tem | tem |
Case=Gen|Number=Sing | tega | te | tega |
Case=Gen|Number=Plur | teh | teh | teh |
Case=Ins|Number=Sing | tem | to | tem |
Case=Ins|Number=Plur | temi | temi | temi |
Case=Loc|Number=Sing | tem | tej | tem |
Case=Loc|Number=Plur | teh | teh | teh |
Case=Nom|Number=Sing | ta | ta | to |
Case=Nom|Number=Dual | ta | ti | |
Case=Nom|Number=Plur | ti | te | ta |
Case=Nom|Number=Plur|Typo=Yes | ta |
VERB
3049 VERB tokens (30% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Mood=EMPTY (3049; 100%), Person=EMPTY (3049; 100%), Polarity=EMPTY (3049; 100%), Tense=EMPTY (3049; 100%), VerbForm=Part (3049; 100%), Number=Sing (2026; 66%).
VERB
tokens may have the following values of Gender
:
Fem
(802; 26% of non-emptyGender
): rekla, bila, imela, šla, prišla, delala, videla, dala, naredila, moglaMasc
(1881; 62% of non-emptyGender
): rekel, bil, imeli, imel, rekli, šli, šel, bili, videl, mogelNeut
(366; 12% of non-emptyGender
): bilo, šlo, prišlo, zgodilo, uspelo, dalo, trajalo, spremenilo, dogajalo, imeloEMPTY
(6999): je, vem, veš, mislim, recimo, so, ni, ima, pravi, imamo
Paradigm biti | Masc | Fem | Neut |
---|---|---|---|
Aspect=Imp|Number=Sing | bil | bilo | |
Number=Sing | bil | bila | bilo |
Number=Dual | bila | bili | |
Number=Plur | bili | bile |
PRON
1678 PRON tokens (38% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Reflex=EMPTY (1678; 100%), Number=Sing (1223; 73%), Variant=EMPTY (1115; 66%), PronType=Prs (941; 56%).
PRON
tokens may have the following values of Gender
:
Fem
(301; 18% of non-emptyGender
): jo, jih, ona, ji, je, njo, njej, midve, nje, njimiMasc
(726; 43% of non-emptyGender
): ga, mi, jih, kdo, on, vi, mu, jim, oni, nekdoNeut
(651; 39% of non-emptyGender
): kaj, kar, nekaj, nič, ga, jih, česa, isto, karkoli, čemerEMPTY
(2709): se, jaz, mi, ti, si, nas, nam, me, meni, vam
Paradigm on | Masc | Fem | Neut |
---|---|---|---|
Case=Acc|Number=Sing | njega | njo | |
Case=Acc|Number=Sing|Variant=Short | ga | jo | ga |
Case=Acc|Number=Dual|Variant=Short | ju, jih | ||
Case=Acc|Number=Plur | njih | ||
Case=Acc|Number=Plur|Variant=Short | jih | jih | jih |
Case=Dat|Number=Sing | njemu | njej | |
Case=Dat|Number=Sing|Variant=Short | mu | ji | |
Case=Dat|Number=Dual|Variant=Short | jima | ||
Case=Dat|Number=Plur | njim | njim | |
Case=Dat|Number=Plur|Variant=Short | jim | jim | |
Case=Gen|Number=Sing | njega | nje | |
Case=Gen|Number=Sing|Variant=Short | ga | je | |
Case=Gen|Number=Plur | njih | ||
Case=Gen|Number=Plur|Variant=Short | jih | jih | jih |
Case=Ins|Number=Sing | njim | njo | |
Case=Ins|Number=Dual | njima | ||
Case=Ins|Number=Plur | njimi | njimi | njimi |
Case=Loc|Number=Sing | njem | njej | |
Case=Loc|Number=Plur | njih | njih | |
Case=Nom|Number=Sing | on | ona | |
Case=Nom|Number=Dual | onadva | ||
Case=Nom|Number=Plur | oni | one |
PROPN
1290 PROPN tokens (74% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (1187; 92%).
PROPN
tokens may have the following values of Gender
:
Fem
(529; 41% of non-emptyGender
): Sloveniji, Slovenija, Slovenije, Ljubljani, Ljubljane, Ljubljana, rtv, Evropi, Nemčiji, NemčijoMasc
(711; 55% of non-emptyGender
): Mariboru, Agropop, Jones, Maribor, Tom, Triglav, David, Healy, Netflixu, RomovNeut
(50; 4% of non-emptyGender
): Celja, Celje, Celju, Pohorja, Slovenskem, Ivanovo, Šmarja, Štajerskem, Švedskem, CeljskegaEMPTY
(459): [name:personal], [name:surname], [name:organisation], [name:address], [name:place]
Paradigm RTV | Masc | Fem |
---|---|---|
Case=Gen | RTV-ja | RTV |
Case=Loc | RTV-ju | RTV |
Case=Nom | rtv |
Gender
seems to be lexical feature of PROPN
. 100% lemmas (662) occur only with one value of Gender
.
NUM
635 NUM tokens (53% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumForm=Word (634; 100%), NumType=Card (633; 100%), Number=Sing (358; 56%).
NUM
tokens may have the following values of Gender
:
Fem
(260; 41% of non-emptyGender
): ena, eno, dve, tri, ene, eni, štiri, dveh, štirih, trehMasc
(315; 50% of non-emptyGender
): en, dva, enega, eden, tri, eni, trije, štiri, enim, štirjeNeut
(60; 9% of non-emptyGender
): eno, tri, dve, enem, štiri, dveh, ena, tremi, enega, enihEMPTY
(552): tisoč, pet, dvajset, trideset, deset, petnajst, sto, petdeset, sedem, šest
Paradigm en | Masc | Fem | Neut |
---|---|---|---|
Case=Acc|Number=Sing | en, enega, een | eno | eno |
Case=Acc|Number=Plur | ene | ||
Case=Dat|Number=Sing | enemu | eni | |
Case=Gen|Number=Sing | enega | ene | enega |
Case=Gen|Number=Plur | enih | enih | enih |
Case=Ins|Number=Sing | enim | eno | enim |
Case=Loc|Number=Sing | enem | eni | enem |
Case=Nom|Number=Sing | en | ena | eno |
Case=Nom|Number=Dual | ena | ||
Case=Nom|Number=Plur | eni | ene | ena |
AUX
334 AUX tokens (6% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Mood=EMPTY (334; 100%), Person=EMPTY (334; 100%), Polarity=EMPTY (334; 100%), Tense=EMPTY (334; 100%), VerbForm=Part (334; 100%), Number=Sing (273; 82%).
AUX
tokens may have the following values of Gender
:
Fem
(98; 29% of non-emptyGender
): bila, bileMasc
(132; 40% of non-emptyGender
): bil, bili, bilaNeut
(104; 31% of non-emptyGender
): bilo, bilaEMPTY
(4895): je, so, sem, bi, smo, ni, bo, si, ste, bom
Paradigm biti | Masc | Fem | Neut |
---|---|---|---|
Aspect=Imp|Number=Sing | bil | bilo | |
Number=Sing | bil | bila | bilo |
Number=Dual | bila | ||
Number=Plur | bili | bile | bila |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[amod]–> ADJ (3211; 99%),
NOUN –[det]–> DET (1908; 89%),
NOUN –[conj]–> NOUN (418; 53%),
NOUN –[nummod]–> NUM (365; 54%),
ADJ –[nsubj]–> NOUN (273; 97%),
ADJ –[conj]–> ADJ (191; 94%),
NOUN –[nmod]–> PROPN (184; 51%),
PROPN –[flat:name]–> PROPN (132; 100%),
NOUN –[appos]–> NOUN (125; 59%),
ADJ –[nsubj]–> DET (104; 95%).