home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_German-HDT: Features: Case

This feature is universal. It occurs with 4 different values: Acc, Dat, Gen, Nom.

1668242 tokens (48%) have a non-empty value of Case. 109398 types (58%) occur at least once with a non-empty value of Case. 87458 lemmas (60%) occur at least once with a non-empty value of Case. The feature is used with 9 part-of-speech tags: NOUN (529106; 15% instances), DET (489419; 14% instances), ADP (345545; 10% instances), ADJ (147865; 4% instances), PRON (93577; 3% instances), PROPN (61547; 2% instances), ADV (1074; 0% instances), X (61; 0% instances), NUM (48; 0% instances).

NOUN

529106 NOUN tokens (73% of all NOUN tokens) have a non-empty value of Case.

The most frequent other feature values with which NOUN and Case co-occurred: Number=Sing (379181; 72%).

NOUN tokens may have the following values of Case:

Paradigm unknownNomAccDatGen
_Hacker, Initialisieren, Massachusetts, Met@box-Infonet, Online-Store, Portierung, StatementG3-Minitower, IE, Java-VM, Nachfrager, Newsgroups, Pf./Min., Pf/Min., TLD, URL, ZersplitterungNavigator, Kriminellen, Dot.Com-Companies, 1133-MHz-Pentium, 3.10a, Backups, Beiteiligten, Beschenkten, Bonnern, Boygroups, Breathe, Browser-Release, CCDs, Celera, Cern, Chattern, Clients, Colorama, Comdirect, Connectivity, Crusoe, Cyber-Entrepeneuren, Dauer-Rennerin, Desktop-Coppermines, Deut, E-Commercelern, ECMA, ECommerce, EFF, Einzelnen, Entrechtung, Excite.de, Firmenehe, GFlops, GMD, Gebaren, Gehetzten, Gehörlosen, Guides, Heavyweights, Heise-Ticker, Herstellter, Hightech, ICANN, ISPs, Internet-Appliances, Internetties, Iridium, Itanium, Jüngsten, Krawattis, Kritischeres, LANL, Lingubot, Lyan, Lüfter, MRJ-Plugin, Matloff, NCs, NSA, Nahost, Nicht-Windows-Clients, Nneben, Object, PC-Videobegeisterten, Password-Sniffern, Paybox, Pentium, Philanthropie, Plugin, Rechteverletzer, Registries, Resellern, S/390-Mainframes, Salinger, Schnellebigkeit, Schwerhörigen, Startup-Companies, Symbian, Taiwanern, Text-Mining, Threads, Tumorart, Tux, USB, VRML, Verleiher, Vice-President, W3C, Wang, What's, iCast-Clients, Ältestenrat, Öffentlich-Rechtlichen
Gender=Masc|Number=Sing256bittigenInternationbalen
Gender=Masc|Number=PlurRekonfigurierbaren
Gender=Fem|Number=SingMilliardstel
Gender=Fem|Number=PlurMilliaren128bittigen, Zellularen
Gender=Neut|Number=Sing48bittige, COmputergestütztesWirtschaftswissenschaftlichen

DET

489419 DET tokens (99% of all DET tokens) have a non-empty value of Case.

The most frequent other feature values with which DET and Case co-occurred: PronType=Art (428875; 88%), NumType=EMPTY (419513; 86%), Number=Sing (393614; 80%), Definite=Def (359943; 74%).

DET tokens may have the following values of Case:

Paradigm derNomAccDatGen
Gender=Masc,Neut|Number=Singdem
Gender=Masc|Number=Singderden, derdem, des, dendes, der
Gender=Masc|Number=Plurdie, derdie, denden, die, derder
Gender=Fem|Number=Singdie, derdieder, dieder
Gender=Fem|Number=Plurdiedieden, derder
Gender=Neut|Number=Singdasdas, 'sdem, das, desdes
Gender=Neut|Number=Plurdiedieden, der, dieder
Number=Singder, Diedie, dender
Number=Plurdie, derdieden, der, dieder

ADP

345545 ADP tokens (90% of all ADP tokens) have a non-empty value of Case.

The most frequent other feature values with which ADP and Case co-occurred: AdpType=Prep (344347; 100%).

ADP tokens may have the following values of Case:

Paradigm inAccDatGen
_in
AdpType=Prepininin

ADJ

147865 ADJ tokens (56% of all ADJ tokens) have a non-empty value of Case.

The most frequent other feature values with which ADJ and Case co-occurred: Variant=EMPTY (147861; 100%), Degree=Pos (124376; 84%), Number=Sing (103820; 70%).

ADJ tokens may have the following values of Case:

Paradigm neuNomAccDatGen
Degree=Pos|Gender=Masc|Number=Singneue, neuerneuenneuen, neuemneuen
Degree=Pos|Gender=Masc|Number=Plurneuen, neueneue, neuenneuenneuer, neuen
Degree=Pos|Gender=Fem|Number=Singneueneueneuen, neuer, neueneuen, neue
Degree=Pos|Gender=Fem|Number=Plurneuen, neueneue, neuenneuen, neueneuer, neuen
Degree=Pos|Gender=Neut|Number=Singneue, neuesneues, neueneuen, neuemneuen, neues
Degree=Pos|Gender=Neut|Number=Plurneuen, neueneue, neuenneuen, neueneuer, neuen, neue
Degree=Pos|Number=Singneuen, neuemneuen
Degree=Pos|Number=Plurneuen, neueneue, neuenneuenneuer, neuen
Degree=Cmp|Gender=Masc|Number=Singneuere, neuererneuerenneuerenneueren
Degree=Cmp|Gender=Masc|Number=Plurneuerenneueren
Degree=Cmp|Gender=Fem|Number=Singneuereneuereneueren, neuererneueren
Degree=Cmp|Gender=Fem|Number=PlurNeuereneuere, neuerenneuerenneueren
Degree=Cmp|Gender=Neut|Number=Singneueresneueren
Degree=Cmp|Gender=Neut|Number=PlurNeuere, neuerenneueren
Degree=Cmp|Number=Plurneueren
Degree=Sup|Gender=Masc|Number=Singneueste, neuester, neusteneuestenneuesten, neuestem, neustenneuesten
Degree=Sup|Gender=Masc|Number=Plurneuestenneuestenneuesten
Degree=Sup|Gender=Fem|Number=Singneuesteneuesteneuesten, neuester, neustenneuesten
Degree=Sup|Gender=Fem|Number=Plurneuestenneuesten, neueste, neustenneuestenneuesten, neuester
Degree=Sup|Gender=Neut|Number=Singneuesteneueste, neuestesneuesten, neuestem, neustenneuesten
Degree=Sup|Gender=Neut|Number=Plurneuestenneuesten, neueste, neustenneuestenneuesten
Degree=Sup|Number=Singneuestem, neuesten

PRON

93577 PRON tokens (99% of all PRON tokens) have a non-empty value of Case.

The most frequent other feature values with which PRON and Case co-occurred: Reflex=EMPTY (72457; 77%), PronType=Prs (54176; 58%), Number=Sing (53222; 57%), Gender=EMPTY (50701; 54%), Person=3 (48743; 52%).

PRON tokens may have the following values of Case:

Paradigm derNomAccDatGen
Abbr=Yes|Gender=Neut|Number=Singd.
Gender=Masc|Number=Singderdendemdessen
Gender=Fem|Number=Singdiediederderer, Deren
Gender=Neut|Number=Singdasdasdemdessen
Gender=Neut|Number=Sing|Typo=Yesda
Number=Singdasdessen
Number=Plurdiediedenenderer, der
deren

PROPN

61547 PROPN tokens (32% of all PROPN tokens) have a non-empty value of Case.

The most frequent other feature values with which PROPN and Case co-occurred: Gender=EMPTY (58875; 96%), Number=Sing (58722; 95%).

PROPN tokens may have the following values of Case:

Paradigm TelekomNomAccDatGen
TelekomTelekomTelekomTelekom

ADV

1074 ADV tokens (1% of all ADV tokens) have a non-empty value of Case.

The most frequent other feature values with which ADV and Case co-occurred: PronType=Ind (1074; 100%).

ADV tokens may have the following values of Case:

Paradigm mehrNomAccDatGen
Number=Plurmehreremehreremehrerenmehrerer
mehr

X

61 X tokens (0% of all X tokens) have a non-empty value of Case.

The most frequent other feature values with which X and Case co-occurred: Foreign=Yes (61; 100%).

X tokens may have the following values of Case:

Paradigm DigitalNomDat
DigitalDigital

Case seems to be lexical feature of X. 98% lemmas (49) occur only with one value of Case.

NUM

48 NUM tokens (0% of all NUM tokens) have a non-empty value of Case.

The most frequent other feature values with which NUM and Case co-occurred: NumType=Card (48; 100%), Number=Sing (26; 54%).

NUM tokens may have the following values of Case:

Paradigm einNomAccDat
Gender=Masceineneinem
Gender=Femeineeineeiner
Gender=Neuteineineinem

Case seems to be lexical feature of NUM. 93% lemmas (14) occur only with one value of Case.

Relations with Agreement in Case

The 10 most frequent relations where parent and child node agree in Case: NOUN –[det]–> DET (402114; 90%), NOUN –[case]–> ADP (264553; 94%), NOUN –[amod]–> ADJ (141937; 99%), NOUN –[conj]–> NOUN (20650; 78%), PRON –[case]–> ADP (5703; 96%), DET –[case]–> ADP (4567; 97%), ADJ –[conj]–> ADJ (1594; 98%), DET –[det]–> DET (357; 58%), NOUN –[nsubj]–> DET (286; 70%), NOUN –[nmod]–> ADJ (166; 61%).