home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Slovenian-SSJ: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

64842 tokens (46%) have a non-empty value of Gender. 29189 types (92%) occur at least once with a non-empty value of Gender. 14634 lemmas (87%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (30194; 21% instances), ADJ (15062; 11% instances), VERB (6927; 5% instances), PROPN (4690; 3% instances), DET (4470; 3% instances), PRON (2272; 2% instances), AUX (743; 1% instances), NUM (484; 0% instances).

NOUN

30194 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (21376; 71%).

NOUN tokens may have the following values of Gender:

Paradigm potMascFem
Case=Acc|Number=Singpot
Case=Acc|Number=Plurpoti
Case=Dat|Number=Singpoti
Case=Gen|Number=Singpotapoti
Case=Gen|Number=Plurpoti
Case=Ins|Number=Singpotjo
Case=Ins|Number=Plurpotmi
Case=Loc|Number=Singpoti
Case=Loc|Number=Plurpoteh
Case=Nom|Number=Singpot
Case=Nom|Number=Plurpoti

Gender seems to be lexical feature of NOUN. 100% lemmas (6412) occur only with one value of Gender.

ADJ

15062 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (13796; 92%), VerbForm=EMPTY (13131; 87%), Definite=EMPTY (12988; 86%), Number=Sing (10150; 67%).

ADJ tokens may have the following values of Gender:

Paradigm drugMascFemNeut
Case=Acc|Definite=Def|Number=Singdrugi
Case=Acc|Definite=Ind|Number=Singdrug
Case=Acc|Number=Singdrugegadrugodrugo
Case=Acc|Number=Plurdrugedrugedruga
Case=Dat|Number=Singdrugemudrugi
Case=Dat|Number=Plurdrugim
Case=Gen|Number=Singdrugegadrugedrugega
Case=Gen|Number=Plurdrugihdrugih
Case=Ins|Number=Singdrugimdrugodrugim
Case=Ins|Number=Plurdrugimidrugimi
Case=Loc|Number=Singdrugemdrugidrugem
Case=Loc|Number=Dualdrugih
Case=Loc|Number=Plurdrugihdrugihdrugih
Case=Nom|Definite=Def|Number=Singdrugi
Case=Nom|Definite=Ind|Number=Singdrug
Case=Nom|Number=Singdrugadrugo
Case=Nom|Number=Plurdrugidrugedruga

VERB

6927 VERB tokens (48% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (6927; 100%), Person=EMPTY (6927; 100%), Tense=EMPTY (6927; 100%), VerbForm=Part (6927; 100%), Number=Sing (4538; 66%), Aspect=Perf (4182; 60%).

VERB tokens may have the following values of Gender:

Paradigm bitiMascFemNeut
Number=Singbilbilabilo, blo
Number=Dualbila, blabili
Number=Plurbilibile

PROPN

4690 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (4416; 94%), Case=Nom (2433; 52%).

PROPN tokens may have the following values of Gender:

Paradigm EUMascFem
Case=AccEU
Case=GenEU
Case=LocEU
Case=NomEUEU

Gender seems to be lexical feature of PROPN. 99% lemmas (2581) occur only with one value of Gender.

DET

4470 DET tokens (85% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number[psor]=EMPTY (3644; 82%), Person=EMPTY (3644; 82%), Number=Sing (3187; 71%), Poss=EMPTY (3164; 71%).

DET tokens may have the following values of Gender:

Paradigm taMascFemNeut
Case=Acc|Number=Singta, tegatoto
Case=Acc|Number=Dualti
Case=Acc|Number=Plurteteta
Case=Dat|Number=Singtemutejtemu
Case=Dat|Number=Plurtemtem
Case=Gen|Number=Singtegatetega
Case=Gen|Number=Dualteh
Case=Gen|Number=Plurtehtehteh
Case=Ins|Number=Singtemtotem
Case=Ins|Number=Plurtemitemi
Case=Loc|Number=Singtemtejtem
Case=Loc|Number=Plurtehtehteh
Case=Nom|Number=Singtatato
Case=Nom|Number=Dualta
Case=Nom|Number=Plurtiteta

PRON

2272 PRON tokens (42% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (2272; 100%), Number=Sing (1733; 76%), PronType=Prs (1668; 73%), Person=3 (1644; 72%), Variant=Short (1259; 55%), Case=Acc (1162; 51%).

PRON tokens may have the following values of Gender:

Paradigm onMascFemNeut
Case=Acc|Number=Singnjeganjo
Case=Acc|Number=Sing|Variant=Shortgajoga
Case=Acc|Number=Dualnjiju
Case=Acc|Number=Dual|Variant=Shortjujuju
Case=Acc|Number=Plurnjih, nje
Case=Acc|Number=Plur|Variant=Shortjihjihjih
Case=Dat|Number=Singnjemunjej
Case=Dat|Number=Sing|Variant=Shortmujimu
Case=Dat|Number=Dualnjima
Case=Dat|Number=Dual|Variant=Shortjimajima
Case=Dat|Number=Plurnjimnjim
Case=Dat|Number=Plur|Variant=Shortjimjimjim
Case=Gen|Number=Singnjeganjenjega
Case=Gen|Number=Sing|Variant=Shortgajega
Case=Gen|Number=Dualnjiju
Case=Gen|Number=Dual|Variant=Shortju
Case=Gen|Number=Plurnjihnjihnjih
Case=Gen|Number=Plur|Variant=Shortjihjihjih
Case=Ins|Number=Singnjimnjonjim
Case=Ins|Number=Dualnjimanjima
Case=Ins|Number=Plurnjiminjiminjimi
Case=Loc|Number=Singnjemnjejnjem
Case=Loc|Number=Dualnjiju
Case=Loc|Number=Plurnjihnjihnjih
Case=Nom|Number=Singonona
Case=Nom|Number=Dualonadva
Case=Nom|Number=Pluroni

AUX

743 AUX tokens (7% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (743; 100%), Person=EMPTY (743; 100%), Polarity=EMPTY (743; 100%), Tense=EMPTY (743; 100%), VerbForm=Part (743; 100%), Number=Sing (568; 76%).

AUX tokens may have the following values of Gender:

Paradigm bitiMascFemNeut
Number=Singbilbilabilo
Number=Dualbilabili
Number=Plurbilibilebila

NUM

484 NUM tokens (25% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=Word (484; 100%), NumType=Card (479; 99%).

NUM tokens may have the following values of Gender:

Paradigm enMascFemNeut
Case=Acc|Number=Singen, enegaenoeno
Case=Dat|Number=Singenemueni
Case=Gen|Number=Singenegaeneenega
Case=Ins|Number=Singenimenoenim
Case=Loc|Number=Singenemenienem
Case=Loc|Number=Plurenih
Case=Nom|Number=Singenenaeno
Case=Nom|Number=Plureni

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[amod]–> ADJ (11328; 99%), NOUN –[det]–> DET (2859; 87%), ADJ –[nsubj]–> NOUN (883; 98%), NOUN –[nmod]–> PROPN (828; 55%), PROPN –[flat:name]–> PROPN (685; 100%), ADJ –[conj]–> ADJ (633; 93%), VERB –[nsubj]–> PROPN (580; 73%), VERB –[conj]–> VERB (557; 69%), PROPN –[amod]–> ADJ (245; 99%), PROPN –[conj]–> PROPN (226; 71%).