home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Slovenian-SST: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

28106 tokens (37%) have a non-empty value of Gender. 10295 types (78%) occur at least once with a non-empty value of Gender. 5848 lemmas (77%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (11411; 15% instances), ADJ (5271; 7% instances), DET (4438; 6% instances), VERB (3049; 4% instances), PRON (1678; 2% instances), PROPN (1290; 2% instances), NUM (635; 1% instances), AUX (334; 0% instances).

NOUN

11411 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (8256; 72%).

NOUN tokens may have the following values of Gender:

Paradigm delMascNeut
Animacy=Inan|Case=Acc|Number=Singdel
Case=Acc|Number=Plurdele
Case=Dat|Number=Singdelu
Case=Gen|Number=Plurdelov
Case=Loc|Number=SingdeluDelu
Case=Loc|Number=Plurdelih
Case=Nom|Number=Singdel

Gender seems to be lexical feature of NOUN. 100% lemmas (2949) occur only with one value of Gender.

ADJ

5271 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (4661; 88%), VerbForm=EMPTY (4608; 87%), Definite=EMPTY (4424; 84%), Number=Sing (3777; 72%).

ADJ tokens may have the following values of Gender:

Paradigm drugMascFemNeut
Case=Acc|Definite=Def|Number=Singdrugi
Case=Acc|Definite=Ind|Number=Singdrug
Case=Acc|Number=Singdrugegadrugodrugo
Case=Acc|Number=Plurdrugedrugedruga
Case=Dat|Number=Singdrugemu
Case=Dat|Number=Plurdrugimdrugim
Case=Gen|Number=Singdrugegadrugedrugega
Case=Gen|Number=Plurdrugihdrugih
Case=Ins|Number=Singdrugodrugim
Case=Ins|Number=Plurdrugimidrugimidrugimi
Case=Loc|Number=Singdrugemdrugidrugem
Case=Loc|Number=Dualdrugih
Case=Loc|Number=Plurdrugihdrugih
Case=Nom|Definite=Def|Number=Singdrugi
Case=Nom|Definite=Ind|Number=Singdrug
Case=Nom|Number=Singdrugadrugo
Case=Nom|Number=Plurdrugidrugedruga

DET

4438 DET tokens (82% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (3450; 78%), PronType=Dem (2799; 63%).

DET tokens may have the following values of Gender:

Paradigm taMascFemNeut
Case=Acc|Number=Singta, tegatoto
Case=Acc|Number=Dualta
Case=Acc|Number=Plurteteta
Case=Dat|Number=Singtemutejtemu
Case=Dat|Number=Plurtemtemtem
Case=Gen|Number=Singtegatetega
Case=Gen|Number=Plurtehtehteh
Case=Ins|Number=Singtemtotem
Case=Ins|Number=Plurtemitemitemi
Case=Loc|Number=Singtemtejtem
Case=Loc|Number=Plurtehtehteh
Case=Nom|Number=Singtatato
Case=Nom|Number=Dualtati
Case=Nom|Number=Plurtiteta
Case=Nom|Number=Plur|Typo=Yesta

VERB

3049 VERB tokens (30% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (3049; 100%), Person=EMPTY (3049; 100%), Polarity=EMPTY (3049; 100%), Tense=EMPTY (3049; 100%), VerbForm=Part (3049; 100%), Number=Sing (2026; 66%).

VERB tokens may have the following values of Gender:

Paradigm bitiMascFemNeut
Aspect=Imp|Number=Singbilbilo
Number=Singbilbilabilo
Number=Dualbilabili
Number=Plurbilibile

PRON

1678 PRON tokens (38% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (1678; 100%), Number=Sing (1223; 73%), Variant=EMPTY (1115; 66%), PronType=Prs (941; 56%).

PRON tokens may have the following values of Gender:

Paradigm onMascFemNeut
Case=Acc|Number=Singnjeganjo
Case=Acc|Number=Sing|Variant=Shortgajoga
Case=Acc|Number=Dual|Variant=Shortju, jih
Case=Acc|Number=Plurnjih
Case=Acc|Number=Plur|Variant=Shortjihjihjih
Case=Dat|Number=Singnjemunjej
Case=Dat|Number=Sing|Variant=Shortmuji
Case=Dat|Number=Dual|Variant=Shortjima
Case=Dat|Number=Plurnjimnjim
Case=Dat|Number=Plur|Variant=Shortjimjim
Case=Gen|Number=Singnjeganje
Case=Gen|Number=Sing|Variant=Shortgaje
Case=Gen|Number=Plurnjih
Case=Gen|Number=Plur|Variant=Shortjihjihjih
Case=Ins|Number=Singnjimnjo
Case=Ins|Number=Dualnjima
Case=Ins|Number=Plurnjiminjiminjimi
Case=Loc|Number=Singnjemnjej
Case=Loc|Number=Plurnjihnjih
Case=Nom|Number=Singonona
Case=Nom|Number=Dualonadva
Case=Nom|Number=Pluronione

PROPN

1290 PROPN tokens (74% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1187; 92%).

PROPN tokens may have the following values of Gender:

Paradigm RTVMascFem
Case=GenRTV-jaRTV
Case=LocRTV-juRTV
Case=Nomrtv

Gender seems to be lexical feature of PROPN. 100% lemmas (662) occur only with one value of Gender.

NUM

635 NUM tokens (53% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (634; 100%), NumType=Card (633; 100%), Number=Sing (358; 56%).

NUM tokens may have the following values of Gender:

Paradigm enMascFemNeut
Case=Acc|Number=Singen, enega, eenenoeno
Case=Acc|Number=Plurene
Case=Dat|Number=Singenemueni
Case=Gen|Number=Singenegaeneenega
Case=Gen|Number=Plurenihenihenih
Case=Ins|Number=Singenimenoenim
Case=Loc|Number=Singenemenienem
Case=Nom|Number=Singenenaeno
Case=Nom|Number=Dualena
Case=Nom|Number=Plurenieneena

AUX

334 AUX tokens (6% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (334; 100%), Person=EMPTY (334; 100%), Polarity=EMPTY (334; 100%), Tense=EMPTY (334; 100%), VerbForm=Part (334; 100%), Number=Sing (273; 82%).

AUX tokens may have the following values of Gender:

Paradigm bitiMascFemNeut
Aspect=Imp|Number=Singbilbilo
Number=Singbilbilabilo
Number=Dualbila
Number=Plurbilibilebila

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (3211; 99%), NOUN –[det]–> DET (1908; 89%), NOUN –[conj]–> NOUN (418; 53%), NOUN –[nummod]–> NUM (365; 54%), ADJ –[nsubj]–> NOUN (273; 97%), ADJ –[conj]–> ADJ (191; 94%), NOUN –[nmod]–> PROPN (184; 51%), PROPN –[flat:name]–> PROPN (132; 100%), NOUN –[appos]–> NOUN (125; 59%), ADJ –[nsubj]–> DET (104; 95%).