home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_German: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut. Some words have combined values of the feature; 1 combinations have been observed: Masc|Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

65616 tokens (22%) have a non-empty value of Gender. 16530 types (33%) occur at least once with a non-empty value of Gender. 14421 lemmas (34%) occur at least once with a non-empty value of Gender. The feature is used with 11 part-of-speech tags: DET (24385; 8% instances), NOUN (24295; 8% instances), ADJ (6577; 2% instances), PRON (5571; 2% instances), PROPN (4750; 2% instances), NUM (29; 0% instances), VERB (4; 0% instances), ADP (2; 0% instances), ADV (1; 0% instances), SCONJ (1; 0% instances), X (1; 0% instances).

DET

24385 DET tokens (67% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (24238; 99%), Number=Sing (23888; 98%), Definite=Def (20522; 84%).

DET tokens may have the following values of Gender:

Paradigm derMascMasc,NeutFemNeut
Case=Acc|Definite=Def|Number=Sing|PronType=Artdendemdie, derdas, dem
Case=Acc|Definite=Def|Number=Plur|PronType=Artdie, dendie, dendie, den
Case=Acc|Number=Sing|PronType=Demdessen, dendiedas
Case=Acc|Number=Sing|PronType=Reldessen, dendiedas
Case=Acc|Number=Plur|PronType=Demdas
Case=Dat|Definite=Def|Number=Sing|PronType=Artdemder, dem, den, die
Case=Dat|Definite=Def|Number=Plur|PronType=Artdendenden
Case=Dat|Number=Sing|PronType=Demdemderer
Case=Dat|Number=Sing|PronType=Reldemder
Case=Dat|Number=Plur|PronType=Demdessendessen
Case=Dat|Number=Plur|PronType=Reldessen
Case=Gen|Definite=Def|Number=Sing|PronType=Artdesder
Case=Gen|Definite=Def|Number=Plur|PronType=Artderder
Case=Nom|Definite=Def|Number=Sing|PronType=Artder, dendes, demdiedas
Case=Nom|Definite=Def|Number=Plur|PronType=Artdie, der, Dasdie, dendie, das
Case=Nom|Number=Sing|PronType=Demdessen, derdiedas
Case=Nom|Number=Sing|PronType=Reldessen, derdiedas
Case=Nom|Number=Plur|PronType=Demdie
Case=Nom|Number=Plur|PronType=ReldessenDie

NOUN

24295 NOUN tokens (47% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (22704; 93%).

NOUN tokens may have the following values of Gender:

Paradigm WappenMascMasc,NeutFemNeut
Case=AccWappenWappen
Case=DatWappenWappen
Case=GenWappens
Case=NomWappenWappen

ADJ

6577 ADJ tokens (31% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (5954; 91%), Degree=Pos (5811; 88%).

ADJ tokens may have the following values of Gender:

Paradigm erstMascMasc,NeutFemNeut
Case=Acc|Degree=Pos|Number=Singersten, ersteserste, erstenerste, erstes
Case=Acc|Degree=Pos|Number=Plurerste
Case=Dat|Degree=Pos|Number=Singerstenersten
Case=Dat|Degree=Pos|Number=Plurersten
Case=Gen|Degree=Pos|Number=Singersten
Case=Nom|Degree=Pos|Number=Singersteersteserste, erstenerste, erstes
Case=Nom|Degree=Pos|Number=Plurersten, ersteersten
Case=Nom|Degree=Cmp,Pos|Number=Singerster

PRON

5571 PRON tokens (38% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (5571; 100%), Number=Sing (5391; 97%), PronType=Prs (4464; 80%), Person=3 (4234; 76%), Case=Nom (3631; 65%).

PRON tokens may have the following values of Gender:

Paradigm derMascMasc,NeutFemNeut
Case=Acc|Number=Sing|PronType=Demderen, dessendiedas
Case=Acc|Number=Plur|PronType=Demdie
Case=Dat|Number=Sing|PronType=Demdemder
Case=Dat|Number=Plur|PronType=Relderen
Case=Nom|Number=Sing|PronType=Demderen, derdiedas
Case=Nom|Number=Sing|PronType=Reldessen, deren
Case=Nom|Number=Plur|PronType=Relderen

PROPN

4750 PROPN tokens (15% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (4717; 99%).

PROPN tokens may have the following values of Gender:

Paradigm deutschMascMasc,NeutFemNeut
Case=AccDeutsche
Case=DatDeutschDeutschenDeutschen
Case=GenDeutschen
Case=NomDeutscheDeutscheDeutsche

Gender seems to be lexical feature of PROPN. 92% lemmas (2614) occur only with one value of Gender.

NUM

29 NUM tokens (0% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (29; 100%).

NUM tokens may have the following values of Gender:

Paradigm 2Masc,NeutFem
Case=Dat2
Case=Nom2

Gender seems to be lexical feature of NUM. 95% lemmas (21) occur only with one value of Gender.

VERB

4 VERB tokens (0% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Number=Sing (4; 100%), Person=EMPTY (4; 100%), VerbForm=EMPTY (3; 75%).

VERB tokens may have the following values of Gender:

ADP

2 ADP tokens (0% of all ADP tokens) have a non-empty value of Gender.

ADP tokens may have the following values of Gender:

Paradigm alsMasc,NeutFem
alsals

ADV

1 ADV tokens (0% of all ADV tokens) have a non-empty value of Gender.

ADV tokens may have the following values of Gender:

SCONJ

1 SCONJ tokens (0% of all SCONJ tokens) have a non-empty value of Gender.

SCONJ tokens may have the following values of Gender:

X

1 X tokens (0% of all X tokens) have a non-empty value of Gender.

X tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (20406; 100%), NOUN –[amod]–> ADJ (6138; 100%), PROPN –[det]–> DET (2777; 70%), NOUN –[det]–> PRON (1065; 99%), PROPN –[amod]–> PROPN (668; 91%), NOUN –[det:poss]–> PRON (412; 100%), PROPN –[amod]–> ADJ (248; 62%), NOUN –[amod]–> PROPN (119; 88%), NOUN –[amod]–> PRON (25; 100%), NOUN –[det]–> ADJ (17; 100%).