This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home got/feat issue tracker

Gender: gender

This document is a placeholder for the language-specific documentation for Gender.


Treebank Statistics (UD_Gothic)

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut. Some words have combined values of the feature; 3 combinations have been observed: Fem|Masc, Fem|Neut, Masc|Neut.

28082 tokens (50%) have a non-empty value of Gender. 5449 types (62%) occur at least once with a non-empty value of Gender. 2504 lemmas (75%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: got-pos/NOUN (10320; 18% instances), got-pos/PRON (8020; 14% instances), got-pos/ADJ (3235; 6% instances), got-pos/VERB (2672; 5% instances), got-pos/DET (1805; 3% instances), got-pos/PROPN (1739; 3% instances), got-pos/NUM (291; 1% instances).

NOUN

10320 got-pos/NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (7794; 76%).

NOUN tokens may have the following values of Gender:

Paradigm sunnoFem,NeutFemNeut
Case=Accsunnon
Case=Datsunnin
Case=Nomsunno

Gender seems to be lexical feature of NOUN. 99% lemmas (1196) occur only with one value of Gender.

PRON

8020 got-pos/PRON tokens (98% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Poss=EMPTY (6901; 86%), PronType=Prs (5838; 73%), Number=Sing (5425; 68%).

PRON tokens may have the following values of Gender:

Paradigm allsFem,MascMascMasc,NeutFemNeut
Case=Acc|Number=Singallanaallaall, allata
Case=Acc|Number=Plurallansallosalla
Case=Dat|Number=Singallammaallammaallaiallamma
Case=Dat|Number=Plurallaimallaimallaimallaim
Case=Gen|Number=Singallaizosallis
Case=Gen|Number=Plurallaizeallaizeallaizoallaize
Case=Nom|Number=Singallsallaall, allata
Case=Nom|Number=Plurallaiallosalla
Case=Voc|Number=Plurallos

ADJ

3235 got-pos/ADJ tokens (99% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (2160; 67%), Degree=Pos (2074; 64%).

ADJ tokens may have the following values of Gender:

Paradigm silbaFem,NeutMascMasc,NeutFemNeut
Case=Acc|Number=Sing|Strength=Strongsilbo
Case=Acc|Number=Sing|Strength=Weaksilbansilbo
Case=Acc|Number=Plur|Strength=Weaksilbans
Case=Dat|Number=Sing|Strength=Strongsilbinsilbin
Case=Dat|Number=Sing|Strength=Weaksilbinsilbin
Case=Dat|Number=Plur|Strength=Weaksilbamsilbam
Case=Gen|Number=Sing|Strength=Weaksilbinssilbons
Case=Nom|Number=Sing|Strength=Strongsilbosilbo
Case=Nom|Number=Sing|Strength=Weaksilbasilbosilbo
Case=Nom|Number=Plur|Strength=Weaksilbans

VERB

2672 got-pos/VERB tokens (21% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: VerbForm=Part (2672; 100%), Person=EMPTY (2672; 100%), Mood=EMPTY (2672; 100%), Case=Nom (2024; 76%), Voice=Act (1952; 73%), Tense=Pres (1950; 73%), Number=Sing (1654; 62%), Strength=Strong (1384; 52%).

VERB tokens may have the following values of Gender:

Paradigm wisan#1MascMasc,NeutFemNeut
Aspect=Perf|Case=Acc|Number=Sing|Strength=Strong|Tense=Past|Voice=Passwisan
Case=Acc|Number=Sing|Strength=Weak|Tense=Pres|Voice=Actwisandanwisando
Case=Acc|Number=Plur|Strength=Weak|Tense=Pres|Voice=Actwisandans
Case=Dat|Number=Sing|Strength=Weak|Tense=Pres|Voice=Actwisandinwisandein, wisandinwisandin
Case=Dat|Number=Plur|Strength=Weak|Tense=Pres|Voice=Actwisandamwisandamwisandeim
Case=Gen|Number=Sing|Strength=Weak|Tense=Pres|Voice=Actwisandinswisandins
Case=Nom|Number=Sing|Strength=Strong|Tense=Pres|Voice=Actwisands
Case=Nom|Number=Sing|Strength=Weak|Tense=Pres|Voice=Actwisandei
Case=Nom|Number=Plur|Strength=Weak|Tense=Pres|Voice=Actwisandanswisandona

DET

1805 got-pos/DET tokens (96% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (1186; 66%).

DET tokens may have the following values of Gender:

Paradigm saMascMasc,NeutFemNeut
Case=Acc|Number=Singþanaþoþata, þat
Case=Acc|Number=Plurþansþosþo
Case=Dat|Number=Singþammaþammaþizaiþamma
Case=Dat|Number=Plurþaimþaimþaimþaim
Case=Gen|Number=Singþisþisþizosþis
Case=Gen|Number=Plurþizeþize, þizeiþizoþize
Case=Nom|Number=Singsasoþata, þat
Case=Nom|Number=Plurþaiþosþo

PROPN

1739 got-pos/PROPN tokens (94% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1709; 98%).

PROPN tokens may have the following values of Gender:

Paradigm IairusaulwmaMascFem
Case=AccIairusaulwma
Case=DatIairusaulwmaiIairausaulwmai
Case=GenIairusaulwmos

Gender seems to be lexical feature of PROPN. 97% lemmas (231) occur only with one value of Gender.

NUM

291 got-pos/NUM tokens (73% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: Number=Sing (179; 62%).

NUM tokens may have the following values of Gender:

Paradigm ainsMascMasc,NeutFemNeut
Case=Acc|Number=Singainanaainaain
Case=Acc|Number=Plurainans
Case=Dat|Number=Singainammaainammaainaiainamma
Case=Dat|Number=Plurainaim
Case=Gen|Number=Singainisainisainaizosainis
Case=Nom|Number=Singains, ainzainaain, ainata
Case=Nom|Number=Plurainai

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[nmod]–> PRON (1285; 70%), NOUN –[det]–> DET (938; 90%), NOUN –[amod]–> ADJ (567; 81%), VERB –[det]–> DET (350; 84%), NOUN –[nmod]–> ADJ (266; 77%), ADJ –[det]–> DET (219; 82%), NOUN –[nmod]–> PROPN (199; 56%), PROPN –[appos]–> NOUN (188; 93%), VERB –[conj]–> VERB (174; 92%), PROPN –[name]–> PROPN (136; 100%).


Gender in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]