Treebank Statistics: UD_German-GSD: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem
, Masc
, Neut
.
This is a layered feature with the following layers: Gender, Gender[psor].
133600 tokens (46%) have a non-empty value of Gender
.
40697 types (80%) occur at least once with a non-empty value of Gender
.
34924 lemmas (83%) occur at least once with a non-empty value of Gender
.
The feature is used with 9 part-of-speech tags: NOUN (50961; 17% instances), DET (35813; 12% instances), PROPN (26203; 9% instances), ADJ (14124; 5% instances), PRON (6249; 2% instances), NUM (102; 0% instances), X (79; 0% instances), ADV (59; 0% instances), SYM (10; 0% instances).
NOUN
50961 NOUN tokens (97% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (36862; 72%).
NOUN
tokens may have the following values of Gender
:
Fem
(21204; 42% of non-emptyGender
): Zeit, Stadt, Familie, Gemeinde, Saison, Frau, Gruppe, Region, Geschichte, KircheMasc
(18421; 36% of non-emptyGender
): Teil, Ort, Menschen, Platz, Sohn, km, Namen, Anfang, Titel, MeterNeut
(11336; 22% of non-emptyGender
): jahr, Jahre, Jahren, Prozent, Ende, %, Unternehmen, Kinder, Leben, MitgliedEMPTY
(1336): mm, Eltern, Jahrhundert, Leute, Kosten, °, m, mal, Deutschen, Beschäftigten
Paradigm Tag | Masc | Fem | Neut |
---|---|---|---|
Case=Acc|Number=Sing | Tag | ||
Case=Acc|Number=Plur | Tage | ||
Case=Dat|Number=Sing | Tag, Tage | ||
Case=Dat|Number=Plur | Tagen | ||
Case=Gen|Number=Sing | Tages, Tags | ||
Case=Gen|Number=Plur | Tage | Tages | |
Case=Nom|Number=Sing | Tag | Tage | |
Case=Nom|Number=Plur | Tage |
Gender
seems to be lexical feature of NOUN
. 94% lemmas (16986) occur only with one value of Gender
.
DET
35813 DET tokens (87% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (33995; 95%), NumType=EMPTY (30232; 84%), PronType=Art (30045; 84%), Definite=Def (24595; 69%).
DET
tokens may have the following values of Gender
:
Fem
(15479; 43% of non-emptyGender
): der, die, eine, einer, seine, diese, seiner, dieser, ihre, keineMasc
(12195; 34% of non-emptyGender
): dem, der, den, des, ein, einen, einem, eines, seinen, diesemNeut
(8139; 23% of non-emptyGender
): dem, das, ein, des, einem, dies, sein, eines, dieses, allemEMPTY
(5393): die, den, der, the, diese, alle, mehr, viel, viele, beiden
Paradigm der | Masc | Fem | Neut |
---|---|---|---|
Case=Acc | den | die | das, 's |
Case=Dat | dem, der, des | der, die | dem, das, des |
Case=Gen | des | der | des, der |
Case=Nom | der | die | das |
PROPN
26203 PROPN tokens (86% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Number=Sing (25072; 96%).
PROPN
tokens may have the following values of Gender
:
Fem
(5726; 22% of non-emptyGender
): SPD, Mark, Universität, Schweiz, US, Maria, DDR, Deutschen, CDU, StraßeMasc
(12777; 49% of non-emptyGender
): Oktober, US, August, Mai, November, September, Juli, Peter, Weltkrieg, JohannNeut
(7700; 29% of non-emptyGender
): Deutschland, Berlin, Frankreich, München, Wien, London, New, Paris, St., ItalienEMPTY
(4216): of, de, la, a, University, II, Wiener, Berliner, 1, B
Paradigm Deutschland | Masc | Fem | Neut |
---|---|---|---|
Case=Acc | Deutschland | ||
Case=Dat | Deutschland | Deutschland | |
Case=Gen | Deutschlands, Deutschland | ||
Case=Nom | Deutschland | Deutschland |
Gender
seems to be lexical feature of PROPN
. 91% lemmas (13219) occur only with one value of Gender
.
ADJ
14124 ADJ tokens (65% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Degree=Pos (13050; 92%), Number=Sing (9925; 70%).
ADJ
tokens may have the following values of Gender
:
Fem
(6407; 45% of non-emptyGender
): erste, ersten, neue, weitere, große, gute, deutschen, verschiedenen, deutsche, großenMasc
(4726; 33% of non-emptyGender
): ersten, zweiten, neuen, großen, erste, weiteren, weitere, heutigen, amerikanischen, neueNeut
(2991; 21% of non-emptyGender
): ersten, erste, letzten, weitere, neuen, gleichen, neues, gutes, neue, folgendenEMPTY
(7615): später, gut, bekannt, kurz, freundlich, schnell, lang, neu, direkt, super
Paradigm erst | Masc | Fem | Neut |
---|---|---|---|
Case=Acc|Number=Sing | ersten | erste | erste, erstes |
Case=Acc|Number=Plur | ersten, erste | erste, ersten | erste, ersten |
Case=Dat|Number=Sing | ersten | ersten, erster | ersten |
Case=Dat|Number=Plur | ersten | ersten | ersten |
Case=Gen|Number=Sing | ersten | ersten | ersten |
Case=Gen|Number=Plur | ersten | ersten | ersten |
Case=Nom|Number=Sing | erste, erster | erste | erste, erstes |
Case=Nom|Number=Plur | ersten, erste | ersten, erste | ersten, Erste |
PRON
6249 PRON tokens (58% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Reflex=EMPTY (6249; 100%), Number=Sing (6221; 100%), Case=Nom (4875; 78%), PronType=Prs (4293; 69%), Person=3 (4269; 68%).
PRON
tokens may have the following values of Gender
:
Fem
(1278; 20% of non-emptyGender
): sie, die, der, ihr, deren, ich, mich, wir, She, dererMasc
(3085; 49% of non-emptyGender
): er, der, ihm, ihn, dem, dessen, den, ich, wer, sieNeut
(1886; 30% of non-emptyGender
): es, das, was, dem, nichts, etwas, it, dessen, ‘s, nixEMPTY
(4596): sich, ich, die, sie, man, wir, uns, mir, mich, denen
Paradigm der | Masc | Fem | Neut |
---|---|---|---|
Case=Acc | den, der | die | das |
Case=Dat | dem, der | der | dem, Das |
Case=Gen | dessen | deren, der, derer | dessen |
Case=Nom | der, die | die | das, die |
NUM
102 NUM tokens (1% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumType=Card (102; 100%).
NUM
tokens may have the following values of Gender
:
Fem
(39; 38% of non-emptyGender
): Millionen, zweier, 15, Million, 30, 35, 6, 1.681.469, 132,5-165, 1834-1911Masc
(34; 33% of non-emptyGender
): 50, 10, 28, 7, -10, -2288,9, -60, 0:2, 0:3, 1Neut
(29; 28% of non-emptyGender
): 10, 3, 1:1, ², +7,6, 100, 1000, 17, 1846-1925, 1882-1953EMPTY
(7235): zwei, drei, vier, 2007, 2006, fünf, 2009, 2010, sechs, 2008
Paradigm 2 | Masc | Fem | Neut |
---|---|---|---|
Case=Acc | 2 | ||
Case=Dat | 2 | ||
Case=Nom | 2 |
X
79 X tokens (25% of all X
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which X
and Gender
co-occurred: Foreign=EMPTY (79; 100%), Number=Sing (60; 76%).
X
tokens may have the following values of Gender
:
Fem
(25; 32% of non-emptyGender
): Chr, B., E, S., €, #, B, C, La, MEZMasc
(20; 25% of non-emptyGender
): :-), B., :), ???a?, ??µ?????, A, Fr, Hauswurde, Hl, MinNeut
(34; 43% of non-emptyGender
): %, B., Abs, 4Jahren, ???????, Aufl, Az., C., Chr, GrEMPTY
(234): ’s, u.a., etc., z.B., z., a, †, u, z, *
Paradigm B. | Masc | Fem | Neut |
---|---|---|---|
Case=Dat | B. | ||
Case=Nom | B. | B. |
Gender
seems to be lexical feature of X
. 92% lemmas (46) occur only with one value of Gender
.
ADV
59 ADV tokens (0% of all ADV
tokens) have a non-empty value of Gender
.
ADV
tokens may have the following values of Gender
:
Fem
(17; 29% of non-emptyGender
): lange, super, Allzeit, Kehrt, Nahe, Wenige, Zügig, absolute, aka, caMasc
(21; 36% of non-emptyGender
): Abends, Anfangs, ECHT, EINFACH, Ex, Gottlob, Katzelmacher, Křižanov, NIE, NIEMALSNeut
(21; 36% of non-emptyGender
): was, ca, anderem, Dort, How, Mal, PMMA, Rääts, SEHR, WeitereEMPTY
(13825): auch, nur, noch, sehr, so, dort, wieder, hier, mehr, heute
Paradigm ca | Fem | Neut |
---|---|---|
Case=Acc | ca | ca |
Case=Dat | ca |
Gender
seems to be lexical feature of ADV
. 92% lemmas (44) occur only with one value of Gender
.
SYM
10 SYM tokens (10% of all SYM
tokens) have a non-empty value of Gender
.
SYM
tokens may have the following values of Gender
:
Fem
(1; 10% of non-emptyGender
): °Masc
(4; 40% of non-emptyGender
): :-), o, °, ·Neut
(5; 50% of non-emptyGender
): %, ×EMPTY
(90): &, =, /, +, ×, *, €, “, -, :-)
Paradigm ° | Masc | Fem |
---|---|---|
° | ° |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (26061; 84%),
NOUN –[amod]–> ADJ (11918; 91%),
PROPN –[flat]–> PROPN (4766; 82%),
PROPN –[det]–> DET (4540; 82%),
NOUN –[det:poss]–> DET (2175; 95%),
NOUN –[appos]–> PROPN (1763; 55%),
PROPN –[conj]–> PROPN (1313; 63%),
PROPN –[amod]–> PROPN (1060; 75%),
NOUN –[compound]–> NOUN (667; 78%),
PROPN –[flat]–> NOUN (660; 84%).