home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_German-GSD: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

137715 tokens (47%) have a non-empty value of Gender. 40675 types (80%) occur at least once with a non-empty value of Gender. 34990 lemmas (83%) occur at least once with a non-empty value of Gender. The feature is used with 9 part-of-speech tags: NOUN (50882; 17% instances), DET (36989; 13% instances), PROPN (26292; 9% instances), ADJ (14256; 5% instances), PRON (8962; 3% instances), NUM (175; 0% instances), X (86; 0% instances), ADV (64; 0% instances), SYM (9; 0% instances).

NOUN

50882 NOUN tokens (97% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (36825; 72%).

NOUN tokens may have the following values of Gender:

Paradigm TagMascFemNeut
Case=Acc|Number=SingTag
Case=Acc|Number=PlurTage
Case=Dat|Number=SingTag, Tage
Case=Dat|Number=PlurTagen
Case=Gen|Number=SingTages, Tags
Case=Gen|Number=PlurTageTages
Case=Nom|Number=SingTagTage

Gender seems to be lexical feature of NOUN. 94% lemmas (17031) occur only with one value of Gender.

DET

36989 DET tokens (98% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (34028; 92%), Number=Sing (32082; 87%), Definite=Def (28840; 78%).

DET tokens may have the following values of Gender:

Paradigm derMascFemNeut
Case=Acc|Definite=Def|Number=Sing|PronType=Artdendie, derdas, 's
Case=Acc|Definite=Def|Number=Plur|PronType=Artdie, dendiedie
Case=Acc|Number=Sing|PronType=Demdendas
Case=Acc|Number=Sing|PronType=Reldendiedas
Case=Acc|Number=Plur|PronType=Reldiedie
Case=Dat|Definite=Def|Number=Sing|PronType=Artdemder, diedem, den
Case=Dat|Definite=Def|Number=Plur|PronType=Artdenden, derden, der
Case=Dat|Number=Sing|PronType=Demderdem
Case=Dat|Number=Sing|PronType=Reldemderdem
Case=Gen|Definite=Def|Number=Sing|PronType=Artdesder, Diedes, der
Case=Gen|Definite=Def|Number=Plur|PronType=Artderderder
Case=Gen|Number=Sing|PronType=Demdessenderendessen
Case=Gen|Number=Sing|PronType=Reldessenderendessen
Case=Nom|Definite=Def|Number=Sing|PronType=Artderdiedas
Case=Nom|Definite=Def|Number=Plur|PronType=Artdiediedie
Case=Nom|Number=Sing|PronType=Demderdiedas
Case=Nom|Number=Sing|PronType=Relderdiedas
Case=Nom|Number=Plur|PronType=Reldiedie
Definite=Def|Number=Sing|PronType=Artder

PROPN

26292 PROPN tokens (86% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (25145; 96%).

PROPN tokens may have the following values of Gender:

Paradigm DeutschlandMascFemNeut
Case=AccDeutschland
Case=DatDeutschlandDeutschland
Case=GenDeutschlands, Deutschland
Case=NomDeutschlandDeutschland

Gender seems to be lexical feature of PROPN. 91% lemmas (13253) occur only with one value of Gender.

ADJ

14256 ADJ tokens (69% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (13335; 94%), Number=Sing (9936; 70%).

ADJ tokens may have the following values of Gender:

Paradigm erstMascFemNeut
Case=Acc|Number=Singerstenersteerste, erstes
Case=Acc|Number=Plurersten, ersteerste, erstenerste, ersten
Case=Dat|Number=Singerstenersten, ersterersten
Case=Dat|Number=Plurerstenerstenersten
Case=Gen|Number=Singerstenersten, ersterersten
Case=Gen|Number=Plurerstenerstenersten
Case=Nom|Number=Singerste, ersterersteerste, erstes
Case=Nom|Number=Plurersten, ersteersten, ersteersten, Erste

PRON

8962 PRON tokens (63% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (8961; 100%), Number=Sing (7925; 88%), Case=Nom (5983; 67%), Person=EMPTY (4753; 53%).

PRON tokens may have the following values of Gender:

Paradigm derMascFemNeut
Case=Acc|Number=Sing|PronType=Demdendiedas
Case=Acc|Number=Sing|PronType=Relden, derdiedas
Case=Acc|Number=Plur|PronType=Demdie
Case=Acc|Number=Plur|PronType=Reldiediedie
Case=Dat|Number=Sing|PronType=Demdemderdem
Case=Dat|Number=Sing|PronType=Reldemderdem
Case=Dat|Number=Plur|Poss=Yes|PronType=Relderen
Case=Dat|Number=Plur|PronType=Demdenen
Case=Dat|Number=Plur|PronType=Reldenendenen, dendenen
Case=Gen|Number=Sing|PronType=Demdessenderendessen
Case=Gen|Number=Sing|PronType=Reldessender, deren, dererdie
Case=Gen|Number=Plur|PronType=Demderenderen
Case=Gen|Number=Plur|PronType=Relderen
Case=Nom|Number=Sing|PronType=Demderdiedas, des, die
Case=Nom|Number=Sing|PronType=Relder, diediedas
Case=Nom|Number=Plur|PronType=Reldiediedie

NUM

175 NUM tokens (2% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (175; 100%).

NUM tokens may have the following values of Gender:

Paradigm einMascFemNeut
Case=Acceinen, eineineein
Case=Dateinem, eineinereinem
Case=Geneiner
Case=Nomeineineein

X

86 X tokens (25% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Foreign=EMPTY (86; 100%), Number=Sing (66; 77%).

X tokens may have the following values of Gender:

Paradigm B.MascFemNeut
Case=DatB.
Case=NomB.B.

Gender seems to be lexical feature of X. 92% lemmas (48) occur only with one value of Gender.

ADV

64 ADV tokens (0% of all ADV tokens) have a non-empty value of Gender.

ADV tokens may have the following values of Gender:

Paradigm superMascFem
Case=Accsuper
Case=NomSupersuper

Gender seems to be lexical feature of ADV. 92% lemmas (45) occur only with one value of Gender.

SYM

9 SYM tokens (10% of all SYM tokens) have a non-empty value of Gender.

SYM tokens may have the following values of Gender:

Paradigm °MascFem
°°

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (27006; 95%), NOUN –[amod]–> ADJ (12061; 91%), PROPN –[det]–> DET (4882; 87%), PROPN –[flat]–> PROPN (4779; 82%), NOUN –[det:poss]–> DET (2159; 95%), NOUN –[appos]–> PROPN (1765; 55%), NOUN –[det]–> PRON (1584; 88%), PROPN –[conj]–> PROPN (1307; 63%), PROPN –[amod]–> PROPN (1075; 75%), NOUN –[compound]–> NOUN (668; 78%).