home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Slovenian-SST: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

28078 tokens (29%) have a non-empty value of Gender. 10263 types (77%) occur at least once with a non-empty value of Gender. 5817 lemmas (76%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (11395; 12% instances), ADJ (5272; 5% instances), DET (4585; 5% instances), VERB (3048; 3% instances), PRON (1677; 2% instances), PROPN (1271; 1% instances), NUM (496; 1% instances), AUX (334; 0% instances).

NOUN

11395 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (8242; 72%).

NOUN tokens may have the following values of Gender:

Paradigm delMascNeut
Animacy=Inan|Case=Acc|Number=Singdel
Case=Acc|Number=Plurdele
Case=Dat|Number=Singdelu
Case=Gen|Number=Plurdelov
Case=Loc|Number=SingdeluDelu
Case=Loc|Number=Plurdelih
Case=Nom|Number=Singdel

Gender seems to be lexical feature of NOUN. 100% lemmas (2935) occur only with one value of Gender.

ADJ

5272 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (4663; 88%), VerbForm=EMPTY (4609; 87%), Definite=EMPTY (4425; 84%), Number=Sing (3776; 72%).

ADJ tokens may have the following values of Gender:

Paradigm drugMascFemNeut
Case=Acc|Definite=Def|Number=Singdrugi
Case=Acc|Definite=Ind|Number=Singdrug
Case=Acc|Number=Singdrugegadrugodrugo
Case=Acc|Number=Plurdrugedrugedruga
Case=Dat|Number=Singdrugemu
Case=Dat|Number=Plurdrugimdrugim
Case=Gen|Number=Singdrugegadrugedrugega
Case=Gen|Number=Plurdrugihdrugih
Case=Ins|Number=Singdrugodrugim
Case=Ins|Number=Plurdrugimidrugimidrugimi
Case=Loc|Number=Singdrugemdrugidrugem
Case=Loc|Number=Dualdrugih
Case=Loc|Number=Plurdrugihdrugih
Case=Nom|Definite=Def|Number=Singdrugi
Case=Nom|Definite=Ind|Number=Singdrug
Case=Nom|Number=Singdrugadrugo
Case=Nom|Number=Plurdrugidrugedruga

DET

4585 DET tokens (83% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (3587; 78%), PronType=Dem (2802; 61%).

DET tokens may have the following values of Gender:

Paradigm taMascFemNeut
Case=Acc|Number=Singta, tegatoto
Case=Acc|Number=Dualta
Case=Acc|Number=Plurteteta
Case=Dat|Number=Singtemutejtemu
Case=Dat|Number=Plurtemtemtem
Case=Gen|Number=Singtegatetega
Case=Gen|Number=Plurtehtehteh
Case=Ins|Number=Singtemtotem
Case=Ins|Number=Plurtemitemitemi
Case=Loc|Number=Singtemtejtem
Case=Loc|Number=Plurtehtehteh
Case=Nom|Number=Singtatato
Case=Nom|Number=Dualtati
Case=Nom|Number=Plurtiteta
Case=Nom|Number=Plur|Typo=Yesta

VERB

3048 VERB tokens (30% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (3048; 100%), Person=EMPTY (3048; 100%), Polarity=EMPTY (3048; 100%), Tense=EMPTY (3048; 100%), VerbForm=Part (3048; 100%), Number=Sing (2027; 67%).

VERB tokens may have the following values of Gender:

Paradigm bitiMascFemNeut
Aspect=Imp|Number=Singbilbilo
Number=Singbilbilabilo
Number=Dualbilabili
Number=Plurbilibile

PRON

1677 PRON tokens (38% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (1677; 100%), Number=Sing (1222; 73%), Variant=EMPTY (1114; 66%), PronType=Prs (941; 56%).

PRON tokens may have the following values of Gender:

Paradigm onMascFemNeut
Case=Acc|Number=Singnjeganjo
Case=Acc|Number=Sing|Variant=Shortgajoga
Case=Acc|Number=Dual|Variant=Shortju, jih
Case=Acc|Number=Plurnjih
Case=Acc|Number=Plur|Variant=Shortjihjihjih
Case=Dat|Number=Singnjemunjej
Case=Dat|Number=Sing|Variant=Shortmuji
Case=Dat|Number=Dual|Variant=Shortjima
Case=Dat|Number=Plurnjimnjim
Case=Dat|Number=Plur|Variant=Shortjimjim
Case=Gen|Number=Singnjeganje
Case=Gen|Number=Sing|Variant=Shortgaje
Case=Gen|Number=Plurnjih
Case=Gen|Number=Plur|Variant=Shortjihjihjih
Case=Ins|Number=Singnjimnjo
Case=Ins|Number=Dualnjima
Case=Ins|Number=Plurnjiminjiminjimi
Case=Loc|Number=Singnjemnjej
Case=Loc|Number=Plurnjihnjih
Case=Nom|Number=Singonona
Case=Nom|Number=Dualonadva
Case=Nom|Number=Pluronione

PROPN

1271 PROPN tokens (73% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1165; 92%).

PROPN tokens may have the following values of Gender:

Paradigm RTVMascFem
Case=GenRTV-jaRTV
Case=LocRTV-juRTV
Case=Nomrtv

Gender seems to be lexical feature of PROPN. 100% lemmas (644) occur only with one value of Gender.

NUM

496 NUM tokens (47% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (495; 100%), NumType=Card (494; 100%).

NUM tokens may have the following values of Gender:

Paradigm enMascFemNeut
Case=Acc|Number=Singen, enega, eenenoeno
Case=Acc|Number=Plurene
Case=Dat|Number=Singeni
Case=Gen|Number=Singenegaeneenega
Case=Gen|Number=Plurenihenih
Case=Ins|Number=Singenimenoenim
Case=Loc|Number=Singenemenienem
Case=Nom|Number=Singenenaeno
Case=Nom|Number=Plurenieneena

AUX

334 AUX tokens (6% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (334; 100%), Person=EMPTY (334; 100%), Polarity=EMPTY (334; 100%), Tense=EMPTY (334; 100%), VerbForm=Part (334; 100%), Number=Sing (273; 82%).

AUX tokens may have the following values of Gender:

Paradigm bitiMascFemNeut
Aspect=Imp|Number=Singbilbilo
Number=Singbilbilabilo
Number=Dualbila
Number=Plurbilibilebila

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (3215; 99%), NOUN –[det]–> DET (2043; 89%), NOUN –[conj]–> NOUN (417; 53%), ADJ –[nsubj]–> NOUN (274; 97%), ADJ –[conj]–> ADJ (198; 94%), NOUN –[nmod]–> PROPN (186; 52%), PROPN –[flat:name]–> PROPN (134; 100%), NOUN –[appos]–> NOUN (127; 59%), ADJ –[nsubj]–> DET (105; 95%), ADJ –[det]–> DET (70; 89%).