home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_German-HDT: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut. Some words have combined values of the feature; 1 combinations have been observed: Masc|Neut.

This is a layered feature with the following layers: Gender, Gender[psor].

1392292 tokens (40%) have a non-empty value of Gender. 125497 types (67%) occur at least once with a non-empty value of Gender. 99515 lemmas (69%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (687983; 20% instances), DET (456716; 13% instances), ADJ (175354; 5% instances), PRON (44113; 1% instances), PROPN (27734; 1% instances), ADV (188; 0% instances), X (178; 0% instances), NUM (26; 0% instances).

NOUN

687983 NOUN tokens (94% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (453591; 66%).

NOUN tokens may have the following values of Gender:

Paradigm unknownMascFemNeut
Case=Acc|Number=Sing256bittigenMilliardstel48bittige, COmputergestütztes
Case=Dat|Number=SingWirtschaftswissenschaftlichen
Case=Dat|Number=PlurMilliaren
Case=Gen|Number=SingInternationbalen
Case=Gen|Number=PlurRekonfigurierbaren128bittigen, Zellularen
Number=SingMiliarden, MilliardennAmyotrophe
Number=PlurKostenpflichtigeRegenerative

Gender seems to be lexical feature of NOUN. 100% lemmas (86791) occur only with one value of Gender.

DET

456716 DET tokens (92% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (417891; 91%), Number=Sing (394985; 86%), NumType=EMPTY (387796; 85%), Definite=Def (348969; 76%).

DET tokens may have the following values of Gender:

Paradigm derMascMasc,NeutFemNeut
Case=Acc|Number=Singden, derdiedas, 's
Case=Acc|Number=Plurdie, dendiedie
Case=Dat|Number=Singdem, des, dendemder, diedem, das, des
Case=Dat|Number=Plurden, die, derden, derden, der, die
Case=Gen|Number=Singdes, derderdes
Case=Gen|Number=Plurderderder
Case=Nom|Number=Singderdie, derdas
Case=Nom|Number=Plurdie, derdiedie

ADJ

175354 ADJ tokens (67% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Variant=EMPTY (175352; 100%), Degree=Pos (147844; 84%), Number=Sing (115545; 66%).

ADJ tokens may have the following values of Gender:

Paradigm neuMascFemNeut
Case=Acc|Degree=Pos|Number=Singneuenneueneues, neue
Case=Acc|Degree=Pos|Number=Plurneue, neuenneue, neuenneue, neuen
Case=Acc|Degree=Cmp|Number=Singneuerenneuereneueres
Case=Acc|Degree=Cmp|Number=Plurneuere, neueren
Case=Acc|Degree=Sup|Number=Singneuestenneuesteneueste, neuestes
Case=Acc|Degree=Sup|Number=Plurneuestenneuesten, neueste, neustenneuesten, neueste, neusten
Case=Dat|Degree=Pos|Number=Singneuen, neuemneuen, neuer, neueneuen, neuem
Case=Dat|Degree=Pos|Number=Plurneuenneuen, neueneuen, neue
Case=Dat|Degree=Cmp|Number=Singneuerenneueren, neuerer
Case=Dat|Degree=Cmp|Number=Plurneuerenneuerenneueren
Case=Dat|Degree=Sup|Number=Singneuesten, neuestem, neustenneuesten, neuester, neustenneuesten, neuestem, neusten
Case=Dat|Degree=Sup|Number=Plurneuestenneuestenneuesten
Case=Gen|Degree=Pos|Number=Singneuenneuen, neueneuen, neues
Case=Gen|Degree=Pos|Number=Plurneuer, neuenneuer, neuenneuer, neuen, neue
Case=Gen|Degree=Cmp|Number=Singneuerenneuerenneueren
Case=Gen|Degree=Cmp|Number=Plurneueren
Case=Gen|Degree=Sup|Number=Singneuestenneuestenneuesten
Case=Gen|Degree=Sup|Number=Plurneuesten, neuesterneuesten
Case=Nom|Degree=Pos|Number=Singneue, neuerneueneue, neues
Case=Nom|Degree=Pos|Number=Plurneuen, neueneuen, neueneuen, neue
Case=Nom|Degree=Cmp|Number=Singneuere, neuererneuere
Case=Nom|Degree=Cmp|Number=PlurneuerenNeuereNeuere, neueren
Case=Nom|Degree=Sup|Number=Singneueste, neuester, neusteneuesteneueste
Case=Nom|Degree=Sup|Number=Plurneuestenneuestenneuesten
Degree=Pos|Number=Singneuenneue, neuer, neuenneues, neue, neuen
Degree=Pos|Number=Plurneue, neuenneue, neuen, Internet/Neueneue, neuen
Degree=Cmp|Number=Singneuere, neueres
Degree=Cmp|Number=Plurneuereneuere
Degree=Sup|Number=Singneueste, neuesterneuestes, neueste, neuesten
Degree=Sup|Number=Plurneueste, neuestenneueste, neuesten

PRON

44113 PRON tokens (47% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (44113; 100%), Reflex=EMPTY (44113; 100%), Case=Nom (34207; 78%), Person=3 (22812; 52%), PronType=Prs (22810; 52%).

PRON tokens may have the following values of Gender:

Paradigm derMascFemNeut
Abbr=Yes|Case=Nomd.
Case=Accdendiedas
Case=Datdemderdem
Case=Gendessenderer, Derendessen
Case=Nomderdiedas
Case=Nom|Typo=Yesda

Gender seems to be lexical feature of PRON. 93% lemmas (13) occur only with one value of Gender.

PROPN

27734 PROPN tokens (14% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (27723; 100%), Case=EMPTY (25062; 90%).

PROPN tokens may have the following values of Gender:

Paradigm NylisMascFem
NylisNylis

Gender seems to be lexical feature of PROPN. 100% lemmas (1583) occur only with one value of Gender.

ADV

188 ADV tokens (0% of all ADV tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADV and Gender co-occurred: PronType=Ind (187; 99%).

ADV tokens may have the following values of Gender:

Paradigm meistMascFemNeut
Case=Accmeisten
meistemeiste

X

178 X tokens (0% of all X tokens) have a non-empty value of Gender.

The most frequent other feature values with which X and Gender co-occurred: Foreign=Yes (178; 100%).

X tokens may have the following values of Gender:

NUM

26 NUM tokens (0% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (26; 100%), Number=Sing (26; 100%).

NUM tokens may have the following values of Gender:

Paradigm einMascFemNeut
Case=Acceineneineein
Case=Dateinemeinereinem
Case=Nomeineein

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (406003; 92%), NOUN –[amod]–> ADJ (166389; 97%), ADJ –[conj]–> ADJ (2043; 97%), DET –[nmod]–> NOUN (1262; 65%), NOUN –[expl]–> PRON (250; 61%), NOUN –[nmod]–> ADJ (204; 51%), NOUN –[appos]–> ADJ (67; 63%), DET –[conj]–> NOUN (50; 52%), DET –[conj]–> DET (48; 59%), DET –[nsubj]–> PRON (44; 54%).