home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_German-GSD: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

137764 tokens (47%) have a non-empty value of Gender. 40699 types (80%) occur at least once with a non-empty value of Gender. 35038 lemmas (83%) occur at least once with a non-empty value of Gender. The feature is used with 10 part-of-speech tags: NOUN (50901; 17% instances), DET (36729; 13% instances), PROPN (26384; 9% instances), ADJ (14161; 5% instances), PRON (9218; 3% instances), NUM (204; 0% instances), X (86; 0% instances), ADV (64; 0% instances), SYM (9; 0% instances), PART (8; 0% instances).

NOUN

50901 NOUN tokens (97% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (36841; 72%).

NOUN tokens may have the following values of Gender:

Paradigm TagMascFemNeut
Case=Acc|Number=SingTag
Case=Acc|Number=PlurTage
Case=Dat|Number=SingTag, Tage
Case=Dat|Number=PlurTagen
Case=Gen|Number=SingTages, Tags
Case=Gen|Number=PlurTageTages
Case=Nom|Number=SingTagTage

Gender seems to be lexical feature of NOUN. 94% lemmas (17032) occur only with one value of Gender.

DET

36729 DET tokens (98% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (33765; 92%), Number=Sing (31863; 87%), Definite=Def (28577; 78%).

DET tokens may have the following values of Gender:

Paradigm derMascFemNeut
Case=Acc|Definite=Def|Number=Sing|PronType=Artdendie, derdas
Case=Acc|Definite=Def|Number=Plur|PronType=Artdie, dendiedie
Case=Acc|Number=Sing|PronType=Demdendas
Case=Acc|Number=Sing|PronType=Reldendiedas
Case=Acc|Number=Plur|PronType=Reldiedie
Case=Dat|Definite=Def|Number=Sing|PronType=Artdemder, diedem, den
Case=Dat|Definite=Def|Number=Plur|PronType=Artdenden, derden, der
Case=Dat|Number=Singdemdem
Case=Dat|Number=Sing|PronType=Demderdem
Case=Dat|Number=Sing|PronType=Reldemderdem
Case=Gen|Definite=Def|Number=Sing|PronType=Artdesderdes, der
Case=Gen|Definite=Def|Number=Plur|PronType=Artderderder
Case=Gen|Number=Sing|PronType=Demdessenderendessen
Case=Gen|Number=Sing|PronType=Reldessenderendessen
Case=Nom|Definite=Def|Number=Sing|PronType=Artderdiedas
Case=Nom|Definite=Def|Number=Plur|PronType=Artdiediedie
Case=Nom|Number=Sing|PronType=Demderdiedas
Case=Nom|Number=Sing|PronType=Relderdiedas
Case=Nom|Number=Plur|PronType=Reldiedie
Definite=Def|Number=Sing|PronType=Artder

PROPN

26384 PROPN tokens (86% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (25236; 96%).

PROPN tokens may have the following values of Gender:

Paradigm DeutschlandMascFemNeut
Case=AccDeutschland
Case=DatDeutschlandDeutschland
Case=GenDeutschlands, Deutschland
Case=NomDeutschlandDeutschland

Gender seems to be lexical feature of PROPN. 91% lemmas (13258) occur only with one value of Gender.

ADJ

14161 ADJ tokens (69% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (9844; 70%).

ADJ tokens may have the following values of Gender:

Paradigm großMascFemNeut
Case=Acc|Number=Singgroßen, größten, größeren, grossengroße, größere, grosse, größtegroßes, große, größeres
Case=Acc|Number=Plurgroße, größeregroße, größten, großen, grössere, größeregroße, größere
Case=Dat|Number=Singgroßen, großem, größten, größeremgroßen, großer, größten, größter, größerergroßen, großem, größerem, größeren, größten
Case=Dat|Number=Plurgroßen, größerengroßen, größeren, größten, grossengroßen
Case=Gen|Number=Singgroßen, größtengroßen, größeren, großer, größerer, größtengroßen, größten
Case=Gen|Number=Plurgroßer, großen, größerer, größtengrößten, großen, größerer, größerengrößerer, großen, größten
Case=Nom|Number=Singgroßer, größte, große, größter, größerer, grosse, grossergroße, größte, größeregroßes, große, grösste, größeres, größte
Case=Nom|Number=Plurgroße, größere, großen, größtengroße, großen, größten, größeregroße, grosse, großen

PRON

9218 PRON tokens (64% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (9217; 100%), Number=Sing (8141; 88%), Definite=EMPTY (7195; 78%), Case=Nom (6078; 66%), Person=EMPTY (5010; 54%).

PRON tokens may have the following values of Gender:

Paradigm derMascFemNeut
Case=Acc|Definite=Def|Number=Sing|PronType=Artdendiedas
Case=Acc|Definite=Def|Number=Plur|PronType=Artdiediedie
Case=Acc|Number=Singder
Case=Acc|Number=Sing|PronType=Demdendiedas
Case=Acc|Number=Sing|PronType=Reldendiedas
Case=Acc|Number=Plur|PronType=Demdie
Case=Acc|Number=Plur|PronType=Reldiedie
Case=Dat|Definite=Def|Number=Sing|PronType=Artdemderdem
Case=Dat|Definite=Def|Number=Plur|PronType=Artden
Case=Dat|Number=Sing|PronType=Demdemderdem
Case=Dat|Number=Sing|PronType=Reldemderdem
Case=Dat|Number=Plur|Poss=Yesderen
Case=Dat|Number=Plur|PronType=Demdenen
Case=Dat|Number=Plur|PronType=Reldenendenendenen
Case=Gen|Definite=Def|Number=Sing|PronType=Artderdie
Case=Gen|Definite=Def|Number=Plur|PronType=Artder
Case=Gen|Number=Sing|PronType=Demdessenderendessen
Case=Gen|Number=Sing|PronType=Reldessenderen, derer
Case=Gen|Number=Plur|PronType=Demderenderen
Case=Gen|Number=Plur|PronType=Relderen
Case=Nom|Definite=Def|Number=Sing|PronType=Artderdiedas
Case=Nom|Definite=Def|Number=Plur|PronType=Artdiediedie
Case=Nom|Number=Singder, die
Case=Nom|Number=Sing|PronType=Demderdiedas, des, die
Case=Nom|Number=Sing|PronType=Relderdiedas
Case=Nom|Number=Plur|PronType=Reldiediedie

NUM

204 NUM tokens (3% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (202; 99%).

NUM tokens may have the following values of Gender:

Paradigm einMascFemNeut
Case=Acceinen, eineineein
Case=Dateinem, eineinereinem
Case=Geneiner
Case=Nomeineineein

X

86 X tokens (25% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Foreign=EMPTY (86; 100%), Number=Sing (66; 77%).

X tokens may have the following values of Gender:

Paradigm B.MascFemNeut
Case=DatB.
Case=NomB.B.

Gender seems to be lexical feature of X. 92% lemmas (48) occur only with one value of Gender.

ADV

64 ADV tokens (0% of all ADV tokens) have a non-empty value of Gender.

ADV tokens may have the following values of Gender:

Paradigm superMascFem
Case=Accsuper
Case=NomSupersuper

Gender seems to be lexical feature of ADV. 92% lemmas (45) occur only with one value of Gender.

SYM

9 SYM tokens (10% of all SYM tokens) have a non-empty value of Gender.

SYM tokens may have the following values of Gender:

Paradigm °MascFem
°°

PART

8 PART tokens (0% of all PART tokens) have a non-empty value of Gender.

The most frequent other feature values with which PART and Gender co-occurred: Polarity=Neg (5; 63%).

PART tokens may have the following values of Gender:

Paradigm nichtFemNeut
Case=Acc|Number=Singnicht
Case=Dat|Number=Singnicht
Case=Dat|Number=Plurnicht

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (26980; 95%), NOUN –[amod]–> ADJ (12057; 91%), PROPN –[flat]–> PROPN (4782; 82%), PROPN –[det]–> DET (4680; 87%), NOUN –[det:poss]–> DET (2159; 95%), NOUN –[appos]–> PROPN (1765; 55%), NOUN –[det]–> PRON (1615; 88%), PROPN –[conj]–> PROPN (1308; 63%), PROPN –[amod]–> PROPN (1160; 77%), NOUN –[compound]–> NOUN (669; 78%).