home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Slovenian-SST: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

9589 tokens (33%) have a non-empty value of Gender. 4433 types (72%) occur at least once with a non-empty value of Gender. 2967 lemmas (75%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (3626; 12% instances), ADJ (1664; 6% instances), DET (1611; 5% instances), VERB (1164; 4% instances), PRON (682; 2% instances), PROPN (444; 2% instances), NUM (270; 1% instances), AUX (128; 0% instances).

NOUN

3626 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Animacy=EMPTY (3245; 89%), Number=Sing (2736; 75%).

NOUN tokens may have the following values of Gender:

Paradigm očiMascFem
Case=Gen|Number=Pluroči
Case=Nom|Number=Singoči

Gender seems to be lexical feature of NOUN. 100% lemmas (1523) occur only with one value of Gender.

ADJ

1664 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: VerbForm=EMPTY (1478; 89%), Degree=Pos (1442; 87%), Definite=EMPTY (1350; 81%), Number=Sing (1266; 76%), Case=Nom (880; 53%).

ADJ tokens may have the following values of Gender:

Paradigm drugMascFemNeut
Case=Acc|Definite=Def|Number=Singdrugi
Case=Acc|Number=Singdrugodrugo
Case=Acc|Number=Plurdrugedruge
Case=Dat|Number=Singdrugemu
Case=Gen|Number=Singdrugegadrugedrugega
Case=Gen|Number=Plurdrugihdrugih
Case=Ins|Number=Singdrugodrugim
Case=Ins|Number=Plurdrugimi
Case=Loc|Number=Singdrugidrugem
Case=Loc|Number=Dualdrugih
Case=Nom|Definite=Def|Number=Singdrugi
Case=Nom|Definite=Ind|Number=Singdrug
Case=Nom|Number=Singdrugadrugo
Case=Nom|Number=Plurdrugidruge

DET

1611 DET tokens (87% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (1332; 83%), PronType=Dem (1055; 65%).

DET tokens may have the following values of Gender:

Paradigm taMascFemNeut
Case=Acc|Number=Singta, tegatoto
Case=Acc|Number=Plurteteta
Case=Dat|Number=Singtemutejtemu
Case=Dat|Number=Plurtemtemtem
Case=Gen|Number=Singtegatetega
Case=Gen|Number=Plurtehtehteh
Case=Ins|Number=Singtemtotem
Case=Ins|Number=Plurtemitemi
Case=Loc|Number=Singtemtejtem
Case=Loc|Number=Plurtehteh
Case=Nom|Number=Singtatato
Case=Nom|Number=Dualti
Case=Nom|Number=Plurtiteta

VERB

1164 VERB tokens (30% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (1164; 100%), Person=EMPTY (1164; 100%), Polarity=EMPTY (1164; 100%), Tense=EMPTY (1164; 100%), VerbForm=Part (1164; 100%), Number=Sing (781; 67%).

VERB tokens may have the following values of Gender:

Paradigm bitiMascFemNeut
Aspect=Imp|Number=Singbilbilo
Number=Singbilbilabilo
Number=Dualbila
Number=Plurbilibile

PRON

682 PRON tokens (42% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (682; 100%), Number=Sing (493; 72%), Variant=EMPTY (473; 69%), PronType=Prs (397; 58%).

PRON tokens may have the following values of Gender:

Paradigm onMascFemNeut
Case=Acc|Number=Singnjeganjo
Case=Acc|Number=Sing|Variant=Shortgajoga
Case=Acc|Number=Plurnjih
Case=Acc|Number=Plur|Variant=Shortjihjihjih
Case=Dat|Number=Singnjemunjej
Case=Dat|Number=Sing|Variant=Shortmuji
Case=Dat|Number=Plurnjim
Case=Dat|Number=Plur|Variant=Shortjimjim
Case=Gen|Number=Singnjeganje
Case=Gen|Number=Sing|Variant=Shortgaje
Case=Gen|Number=Plur|Variant=Shortjihjih
Case=Ins|Number=Singnjimnjo
Case=Ins|Number=Dualnjima
Case=Ins|Number=Plurnjiminjimi
Case=Loc|Number=Singnjemnjej
Case=Loc|Number=Plurnjihnjih
Case=Nom|Number=Singonona
Case=Nom|Number=Dualonadva
Case=Nom|Number=Pluronione

PROPN

444 PROPN tokens (59% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (403; 91%), Case=Nom (233; 52%).

PROPN tokens may have the following values of Gender:

Gender seems to be lexical feature of PROPN. 100% lemmas (306) occur only with one value of Gender.

NUM

270 NUM tokens (54% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (270; 100%), NumType=Card (269; 100%), Number=Sing (153; 57%).

NUM tokens may have the following values of Gender:

Paradigm enMascFemNeut
Case=Acc|Number=Singen, enegaenoeno
Case=Acc|Number=Plurene
Case=Dat|Number=Singenemu
Case=Gen|Number=Singenegaene
Case=Gen|Number=Plurenih
Case=Ins|Number=Singenimenoenim
Case=Loc|Number=Singeni
Case=Nom|Number=Singenenaeno
Case=Nom|Number=Plureniena

AUX

128 AUX tokens (7% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (128; 100%), Person=EMPTY (128; 100%), Polarity=EMPTY (128; 100%), Tense=EMPTY (128; 100%), VerbForm=Part (128; 100%), Number=Sing (105; 82%).

AUX tokens may have the following values of Gender:

Paradigm bitiMascFemNeut
Aspect=Imp|Number=Singbilbilo
Number=Singbilbilabilo
Number=Dualbila
Number=Plurbilibilebila

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (942; 99%), NOUN –[det]–> DET (581; 90%), NOUN –[nummod]–> NUM (141; 54%), NOUN –[conj]–> NOUN (106; 60%), ADJ –[nsubj]–> NOUN (75; 96%), PROPN –[flat:name]–> PROPN (75; 100%), ADJ –[conj]–> ADJ (50; 93%), ADJ –[nsubj]–> DET (39; 93%), NOUN –[appos]–> NOUN (30; 64%), ADJ –[det]–> DET (27; 93%).