Treebank Statistics: UD_Turkish_German-SAGT: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem, Masc, Neut.
5211 tokens (14%) have a non-empty value of Gender.
1523 types (23%) occur at least once with a non-empty value of Gender.
1262 lemmas (34%) occur at least once with a non-empty value of Gender.
The feature is used with 5 part-of-speech tags: NOUN (2106; 6% instances), PRON (1371; 4% instances), DET (1116; 3% instances), PROPN (346; 1% instances), ADJ (272; 1% instances).
NOUN
2106 NOUN tokens (40% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number[psor]=EMPTY (2106; 100%), Person[psor]=EMPTY (2106; 100%), Number=Sing (1660; 79%).
NOUN tokens may have the following values of Gender:
Fem(870; 41% of non-emptyGender): Zeit, Ahnung, Sachen, Prüfungen, Uni, Pause, Sprache, Wochen, Linguistik, StadtMasc(589; 28% of non-emptyGender): Tag, Kunden, Spaß, Fall, Typ, Abend, Filme, Freunde, Garten, BachelorNeut(647; 31% of non-emptyGender): Beispiel, Buch, Dings, Leute, Semester, Hause, Praktikum, Ding, Jahr, EndeEMPTY(3177): şey, şimdi, zaman, sene, şeyi, şeyler, hafta, saat, kitap, tane
| Paradigm Türkisch | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc | Türkisch | Türkisch | |
| Case=Dat | Türkisch | ||
| Case=Nom | Türkisch | Türkisch |
Gender seems to be lexical feature of NOUN. 98% lemmas (936) occur only with one value of Gender.
PRON
1371 PRON tokens (33% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (1227; 89%), Case=Nom (990; 72%), Person=EMPTY (700; 51%).
PRON tokens may have the following values of Gender:
Fem(163; 12% of non-emptyGender): die, sie, meine, alle, meiner, deine, diese, eine, der, irgendwelcheMasc(176; 13% of non-emptyGender): er, der, die, jeder, den, ihn, meinem, dein, dem, ihmNeut(1032; 75% of non-emptyGender): das, es, ich, alles, was, die, allem, du, mein, irgendetwasEMPTY(2783): ich, du, ben, o, wir, ondan, man, mir, was, orada
| Paradigm der | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Number=Sing|Person=3 | das | ||
| Case=Acc|Number=Sing|PronType=Dem | den | die | das, den |
| Case=Acc|Number=Sing|PronType=Rel | den | die | das |
| Case=Acc|Number=Plur|PronType=Dem | die | die | |
| Case=Acc|Number=Plur|PronType=Rel | die | die | |
| Case=Dat|Number=Sing|PronType=Dem | dem | der | dem |
| Case=Dat|Number=Plur|PronType=Dem | denen | ||
| Case=Gen|Number=Sing|PronType=Dem | dessen | ||
| Case=Gen|Number=Plur|PronType=Dem | deren | ||
| Case=Nom|Number=Sing | Das | ||
| Case=Nom|Number=Sing|PronType=Dem | der, die | die | das, die |
| Case=Nom|Number=Sing|PronType=Rel | der | die, der | das, der, die |
| Case=Nom|Number=Plur | Das | ||
| Case=Nom|Number=Plur|PronType=Dem | die | die | die, das |
| Case=Nom|Number=Plur|PronType=Rel | die | die | die |
DET
1116 DET tokens (64% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (967; 87%), Number=Sing (944; 85%), Definite=Def (721; 65%).
DET tokens may have the following values of Gender:
Fem(426; 38% of non-emptyGender): die, der, eine, keine, den, viele, einer, alle, manche, mehrMasc(312; 28% of non-emptyGender): dem, ein, einen, der, den, die, einem, jeden, viele, keinNeut(378; 34% of non-emptyGender): dem, das, ein, die, dieses, des, alles, viele, kein, denEMPTY(622): bir, o, her, bu, böyle, çok, öyle, hangi, şu, birkaç
| Paradigm der | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Number=Sing | den | die, den | das |
| Case=Acc|Number=Sing|Typo=Yes | die | des | |
| Case=Acc|Number=Plur | die, den | die, den | die |
| Case=Dat|Number=Sing | dem, der | der | dem |
| Case=Dat|Number=Sing|Typo=Yes | der | ||
| Case=Dat|Number=Plur | den | den, der | den |
| Case=Gen|Number=Sing | der | der | des |
| Case=Gen|Number=Sing|Typo=Yes | des | ||
| Case=Gen|Number=Plur | der | der | |
| Case=Nom|Number=Sing | der | die | das, der, des |
| Case=Nom|Number=Sing|Typo=Yes | der | ||
| Case=Nom|Number=Plur | die | die | die |
PROPN
346 PROPN tokens (39% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (329; 95%).
PROPN tokens may have the following values of Gender:
Fem(73; 21% of non-emptyGender): Türkei, Konstanz, BK, Henna, Informatik, Alka, BWL, Insel, Kanarische, MSVMasc(99; 29% of non-emptyGender): Rap, Schatz, Bosch, Erasmus, Freitag, Jonny, August, Euro, Februar, HutchinsonNeut(174; 50% of non-emptyGender): Englisch, Deutschland, Stuttgart, Bondorf, Deutsch, Dortmund, Düsseldorf, Wuppertal, Herrenberg, IstanbulEMPTY(537): Türkçe, İngilizce, Türkiye’de, Alman, Türk, Almanya’da, Netflix, Türkler, İstanbul’a, Allah
| Paradigm Deutschland | Fem | Neut |
|---|---|---|
| Case=Acc | Deutschland | |
| Case=Dat | Deutschland | Deutschland |
| Case=Nom | Deutschland |
Gender seems to be lexical feature of PROPN. 95% lemmas (190) occur only with one value of Gender.
ADJ
272 ADJ tokens (15% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (205; 75%).
ADJ tokens may have the following values of Gender:
Fem(127; 47% of non-emptyGender): anderen, ganze, türkisch, türkische, erste, große, kleine, andere, englische, ganzenMasc(76; 28% of non-emptyGender): großen, ganzen, einfach, letzten, türkische, anderen, eigenen, großer, guten, scheißNeut(69; 25% of non-emptyGender): nächstes, anderes, eigenes, ganzen, letztes, normales, schönes, vollkommen, Emotionales, allgemeinemEMPTY(1514): var, güzel, einfach, gut, iyi, lazım, yok, bayağı, başka, zor
| Paradigm gut | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Number=Sing | guten | ||
| Case=Nom|Number=Sing | guter, gutes | beste | |
| Case=Nom|Number=Plur | gute | gute |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (869; 86%),
NOUN –[amod]–> ADJ (200; 76%),
NOUN –[nmod]–> PRON (93; 52%),
PROPN –[det]–> DET (37; 73%),
NOUN –[reparandum]–> NOUN (23; 51%),
PROPN –[amod]–> ADJ (10; 71%),
PRON –[det]–> DET (8; 53%),
DET –[reparandum]–> DET (7; 88%),
PROPN –[appos]–> PROPN (7; 100%),
PROPN –[nmod]–> PRON (4; 67%).