home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_German-PUD: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut. Some words have combined values of the feature; 1 combinations have been observed: Masc|Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

10195 tokens (48%) have a non-empty value of Gender. 4441 types (68%) occur at least once with a non-empty value of Gender. 3793 lemmas (71%) occur at least once with a non-empty value of Gender. The feature is used with 6 part-of-speech tags: NOUN (4133; 19% instances), DET (2784; 13% instances), ADJ (1176; 6% instances), PROPN (1118; 5% instances), PRON (983; 5% instances), NUM (1; 0% instances).

NOUN

4133 NOUN tokens (97% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Person=3 (4133; 100%), Number=Sing (2946; 71%).

NOUN tokens may have the following values of Gender:

Paradigm einMascFemNeut
Case=Acceineeines
Case=Dateinemeiner
Case=Nomeinereineeines

Gender seems to be lexical feature of NOUN. 99% lemmas (2323) occur only with one value of Gender.

DET

2784 DET tokens (98% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Person=3 (2780; 100%), Number=Sing (2262; 81%), Definite=Def (2195; 79%).

DET tokens may have the following values of Gender:

Paradigm derMascMasc,NeutFemNeut
Case=Acc|Number=Sing|Person=3dendiedas
Case=Acc|Number=Plurdie
Case=Acc|Number=Plur|Person=3diediedie
Case=Dat|Number=Sing|Person=3demderdem
Case=Dat|Number=Sing|PronType=Artdem
Case=Dat|Number=Plur|Person=3dendenden
Case=Gen|Number=Sing|Person=3des, dessenderdes
Case=Gen|Number=Plur|Person=3der, derenderder
Case=Nom|Number=Sing|Person=3derdiedas
Case=Nom|Number=Plur|Person=3diediedie

ADJ

1176 ADJ tokens (84% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Person=3 (1161; 99%), Degree=Pos (1112; 95%), Number=Sing (811; 69%).

ADJ tokens may have the following values of Gender:

Paradigm neuMascFemNeut
Case=Acc|Degree=Pos|Number=Singneuenneueneues
Case=Acc|Degree=Pos|Number=Plurneue, neuenneue, neuenneue
Case=Dat|Degree=Pos|Number=Singneuen
Case=Dat|Degree=Pos|Number=Plurneuen
Case=Gen|Degree=Pos|Number=Singneuenneuen
Case=Gen|Degree=Pos|Number=Plurneuerneuer
Case=Nom|Degree=Pos|Number=Singneue
Case=Nom|Degree=Pos|Number=Plurneueneue, neuenneue
Case=Nom|Degree=Sup|Number=Singneuestes
Case=Nom|Degree=Sup|Number=Plurneuesten

PROPN

1118 PROPN tokens (92% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Person=3 (1118; 100%), Number=Sing (1086; 97%).

PROPN tokens may have the following values of Gender:

Paradigm TrumpMascFem
Case=AccTrump
Case=DatTrump
Case=NomTrumpTrump

Gender seems to be lexical feature of PROPN. 97% lemmas (764) occur only with one value of Gender.

PRON

983 PRON tokens (83% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Person=3 (981; 100%), Gender[psor]=EMPTY (808; 82%), Number=Sing (789; 80%), Number[psor]=EMPTY (772; 79%), Person[psor]=EMPTY (760; 77%), PronType=EMPTY (639; 65%), Case=Nom (567; 58%).

PRON tokens may have the following values of Gender:

Paradigm derMascFemNeut
Case=Acc|Number=Singdendiedas
Case=Acc|Number=Plurdiediedie
Case=Dat|Number=Singdemderdem
Case=Dat|Number=Plurdenendenendenen
Case=Gen|Number=Singdessen
Case=Gen|Number=Plurderen
Case=Nom|Number=Singderdiedas
Case=Nom|Number=Plurdiediedie

NUM

1 NUM tokens (0% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=EMPTY (1; 100%).

NUM tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (2581; 100%), NOUN –[amod]–> ADJ (1040; 100%), NOUN –[nmod:poss]–> PRON (222; 99%), PROPN –[flat:name]–> PROPN (158; 100%), PROPN –[det]–> DET (105; 100%), ADJ –[conj]–> ADJ (93; 100%), NOUN –[appos]–> PROPN (90; 67%), NOUN –[compound]–> NOUN (84; 91%), NOUN –[compound]–> PROPN (73; 100%), PROPN –[conj]–> PROPN (41; 77%).