home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_German-PUD: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

9608 tokens (45%) have a non-empty value of Gender. 4430 types (68%) occur at least once with a non-empty value of Gender. 3775 lemmas (71%) occur at least once with a non-empty value of Gender. The feature is used with 6 part-of-speech tags: NOUN (4080; 19% instances), DET (2642; 12% instances), ADJ (1187; 6% instances), PROPN (1116; 5% instances), PRON (579; 3% instances), NUM (4; 0% instances).

NOUN

4080 NOUN tokens (97% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (2896; 71%).

NOUN tokens may have the following values of Gender:

Paradigm anderMascFemNeut
Case=Acc|Number=Plurandere
Case=Dat|Number=Singanderem
Case=Nom|Number=Singandere
Case=Nom|Number=Plurandere

Gender seems to be lexical feature of NOUN. 99% lemmas (2316) occur only with one value of Gender.

DET

2642 DET tokens (85% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (2452; 93%), NumType=EMPTY (2199; 83%), PronType=Art (2132; 81%), Definite=Def (1700; 64%).

DET tokens may have the following values of Gender:

Paradigm derMascFemNeut
Case=Accdendiedas
Case=Datdemderdem
Case=Gendesderdes
Case=Nomderdiedas

ADJ

1187 ADJ tokens (84% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (1112; 94%), Number=Sing (815; 69%).

ADJ tokens may have the following values of Gender:

Paradigm neuMascFemNeut
Case=Acc|Degree=Pos|Number=Singneuenneueneues
Case=Acc|Degree=Pos|Number=Plurneue, neuenneue, neuenneue
Case=Dat|Degree=Pos|Number=Singneuen
Case=Dat|Degree=Pos|Number=Plurneuen
Case=Gen|Degree=Pos|Number=Singneuenneuen
Case=Gen|Degree=Pos|Number=Plurneuerneuer
Case=Nom|Degree=Pos|Number=Singneue
Case=Nom|Degree=Pos|Number=Plurneueneue, neuenneue
Case=Nom|Degree=Sup|Number=Singneuestes
Case=Nom|Degree=Sup|Number=Plurneuesten

PROPN

1116 PROPN tokens (92% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (1084; 97%).

PROPN tokens may have the following values of Gender:

Paradigm TrumpMascFem
Case=AccTrump
Case=DatTrump
Case=NomTrumpTrump

Gender seems to be lexical feature of PROPN. 97% lemmas (762) occur only with one value of Gender.

PRON

579 PRON tokens (59% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Reflex=EMPTY (579; 100%), Number=Sing (550; 95%), Case=Nom (459; 79%), Person=3 (403; 70%), PronType=Prs (403; 70%).

PRON tokens may have the following values of Gender:

Paradigm derMascFemNeut
Case=Accdendiedas
Case=Datdemderdem
Case=Gendessen
Case=Nomderdiedas

NUM

4 NUM tokens (1% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (3; 75%).

NUM tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (2207; 86%), NOUN –[amod]–> ADJ (1051; 100%), NOUN –[det:poss]–> DET (220; 100%), PROPN –[flat:name]–> PROPN (158; 100%), ADJ –[conj]–> ADJ (93; 100%), NOUN –[appos]–> PROPN (90; 67%), PROPN –[det]–> DET (87; 83%), NOUN –[compound]–> NOUN (82; 91%), NOUN –[compound]–> PROPN (73; 100%), PROPN –[conj]–> PROPN (41; 77%).