Treebank Statistics: UD_German-HDT: Features: Case
This feature is universal.
It occurs with 4 different values: Acc, Dat, Gen, Nom.
1668242 tokens (48%) have a non-empty value of Case.
109398 types (58%) occur at least once with a non-empty value of Case.
87458 lemmas (60%) occur at least once with a non-empty value of Case.
The feature is used with 9 part-of-speech tags: NOUN (529106; 15% instances), DET (489419; 14% instances), ADP (345545; 10% instances), ADJ (147865; 4% instances), PRON (93577; 3% instances), PROPN (61547; 2% instances), ADV (1074; 0% instances), X (61; 0% instances), NUM (48; 0% instances).
NOUN
529106 NOUN tokens (73% of all NOUN tokens) have a non-empty value of Case.
The most frequent other feature values with which NOUN and Case co-occurred: Number=Sing (379181; 72%).
NOUN tokens may have the following values of Case:
Acc(141087; 27% of non-emptyCase): Prozent, Internet, Markt, US-Dollar, Mark, Unternehmen, Euro, Jahr, Netz, EntwicklungDat(211096; 40% of non-emptyCase): Internet, Jahr, Prozent, Angaben, US-Dollar, Jahren, Quartal, Mark, Euro, ZeitGen(64103; 12% of non-emptyCase): Jahres, Unternehmens, Firma, Internet, Welt, Konzerns, Unternehmen, Kunden, Regulierungsbehörde, BrancheNom(112820; 21% of non-emptyCase): Unternehmen, Firma, Internet, Konzern, Hersteller, Umsatz, Zahl, Sprecher, Version, SoftwareEMPTY(199994): Millionen, Prozent, Milliarden, Mark, US-Dollar, Ende, Kunden, Unternehmen, AG, anfang
| Paradigm unknown | Nom | Acc | Dat | Gen |
|---|---|---|---|---|
| _ | Hacker, Initialisieren, Massachusetts, Met@box-Infonet, Online-Store, Portierung, Statement | G3-Minitower, IE, Java-VM, Nachfrager, Newsgroups, Pf./Min., Pf/Min., TLD, URL, Zersplitterung | Navigator, Kriminellen, Dot.Com-Companies, 1133-MHz-Pentium, 3.10a, Backups, Beiteiligten, Beschenkten, Bonnern, Boygroups, Breathe, Browser-Release, CCDs, Celera, Cern, Chattern, Clients, Colorama, Comdirect, Connectivity, Crusoe, Cyber-Entrepeneuren, Dauer-Rennerin, Desktop-Coppermines, Deut, E-Commercelern, ECMA, ECommerce, EFF, Einzelnen, Entrechtung, Excite.de, Firmenehe, GFlops, GMD, Gebaren, Gehetzten, Gehörlosen, Guides, Heavyweights, Heise-Ticker, Herstellter, Hightech, ICANN, ISPs, Internet-Appliances, Internetties, Iridium, Itanium, Jüngsten, Krawattis, Kritischeres, LANL, Lingubot, Lyan, Lüfter, MRJ-Plugin, Matloff, NCs, NSA, Nahost, Nicht-Windows-Clients, Nneben, Object, PC-Videobegeisterten, Password-Sniffern, Paybox, Pentium, Philanthropie, Plugin, Rechteverletzer, Registries, Resellern, S/390-Mainframes, Salinger, Schnellebigkeit, Schwerhörigen, Startup-Companies, Symbian, Taiwanern, Text-Mining, Threads, Tumorart, Tux, USB, VRML, Verleiher, Vice-President, W3C, Wang, What's, iCast-Clients, Ältestenrat, Öffentlich-Rechtlichen | |
| Gender=Masc|Number=Sing | 256bittigen | Internationbalen | ||
| Gender=Masc|Number=Plur | Rekonfigurierbaren | |||
| Gender=Fem|Number=Sing | Milliardstel | |||
| Gender=Fem|Number=Plur | Milliaren | 128bittigen, Zellularen | ||
| Gender=Neut|Number=Sing | 48bittige, COmputergestütztes | Wirtschaftswissenschaftlichen |
DET
489419 DET tokens (99% of all DET tokens) have a non-empty value of Case.
The most frequent other feature values with which DET and Case co-occurred: PronType=Art (428875; 88%), NumType=EMPTY (419513; 86%), Number=Sing (393614; 80%), Definite=Def (359943; 74%).
DET tokens may have the following values of Case:
Acc(129760; 27% of non-emptyCase): die, den, das, eine, einen, ein, ihre, seine, keine, dieseDat(152321; 31% of non-emptyCase): dem, der, den, einem, einer, diesem, allem, anderem, seiner, anderenGen(71958; 15% of non-emptyCase): der, des, eines, einer, dieser, seiner, dieses, aller, ihrer, seinesNom(135380; 28% of non-emptyCase): die, der, das, ein, eine, diese, dies, alle, viele, keineEMPTY(4948): andere, mehr, anderen, viel, all, keinerlei, einig, wenig, meisten, anderes
| Paradigm der | Nom | Acc | Dat | Gen |
|---|---|---|---|---|
| Gender=Masc,Neut|Number=Sing | dem | |||
| Gender=Masc|Number=Sing | der | den, der | dem, des, den | des, der |
| Gender=Masc|Number=Plur | die, der | die, den | den, die, der | der |
| Gender=Fem|Number=Sing | die, der | die | der, die | der |
| Gender=Fem|Number=Plur | die | die | den, der | der |
| Gender=Neut|Number=Sing | das | das, 's | dem, das, des | des |
| Gender=Neut|Number=Plur | die | die | den, der, die | der |
| Number=Sing | der, Die | die, den | der | |
| Number=Plur | die, der | die | den, der, die | der |
ADP
345545 ADP tokens (90% of all ADP tokens) have a non-empty value of Case.
The most frequent other feature values with which ADP and Case co-occurred: AdpType=Prep (344347; 100%).
ADP tokens may have the following values of Case:
Acc(85305; 25% of non-emptyCase): für, auf, in, über, um, durch, an, gegen, ohne, unterDat(257485; 75% of non-emptyCase): in, von, mit, zu, bei, an, nach, auf, aus, vorGen(2424; 1% of non-emptyCase): angesichts, aufgrund, wegen, außerhalb, anhand, innerhalb, trotz, hinsichtlich, während, zugunstenNom(331; 0% of non-emptyCase): namens, vollerEMPTY(40180): für, bis, an, aus, vor, per, ab, ein, auf, wegen
| Paradigm in | Acc | Dat | Gen |
|---|---|---|---|
| _ | in | ||
| AdpType=Prep | in | in | in |
ADJ
147865 ADJ tokens (56% of all ADJ tokens) have a non-empty value of Case.
The most frequent other feature values with which ADJ and Case co-occurred: Variant=EMPTY (147861; 100%), Degree=Pos (124376; 84%), Number=Sing (103820; 70%).
ADJ tokens may have the following values of Case:
Acc(41623; 28% of non-emptyCase): neue, neuen, weitere, ersten, eigene, erste, eigenen, deutsche, große, großenDat(52355; 35% of non-emptyCase): neuen, ersten, vergangenen, eigenen, letzten, deutschen, nächsten, heutigen, zweiten, 1.Gen(20585; 14% of non-emptyCase): neuen, deutschen, nächsten, letzten, neuer, europäischen, ersten, amerikanischen, vergangenen, beidenNom(33302; 23% of non-emptyCase): neue, deutsche, erste, beiden, größte, amerikanische, neuer, neuen, große, europäischeEMPTY(114746): neue, weitere, möglich, gut, ganz, deutsche, weltweit, deutlich, knapp, künftig
| Paradigm neu | Nom | Acc | Dat | Gen |
|---|---|---|---|---|
| Degree=Pos|Gender=Masc|Number=Sing | neue, neuer | neuen | neuen, neuem | neuen |
| Degree=Pos|Gender=Masc|Number=Plur | neuen, neue | neue, neuen | neuen | neuer, neuen |
| Degree=Pos|Gender=Fem|Number=Sing | neue | neue | neuen, neuer, neue | neuen, neue |
| Degree=Pos|Gender=Fem|Number=Plur | neuen, neue | neue, neuen | neuen, neue | neuer, neuen |
| Degree=Pos|Gender=Neut|Number=Sing | neue, neues | neues, neue | neuen, neuem | neuen, neues |
| Degree=Pos|Gender=Neut|Number=Plur | neuen, neue | neue, neuen | neuen, neue | neuer, neuen, neue |
| Degree=Pos|Number=Sing | neuen, neuem | neuen | ||
| Degree=Pos|Number=Plur | neuen, neue | neue, neuen | neuen | neuer, neuen |
| Degree=Cmp|Gender=Masc|Number=Sing | neuere, neuerer | neueren | neueren | neueren |
| Degree=Cmp|Gender=Masc|Number=Plur | neueren | neueren | ||
| Degree=Cmp|Gender=Fem|Number=Sing | neuere | neuere | neueren, neuerer | neueren |
| Degree=Cmp|Gender=Fem|Number=Plur | Neuere | neuere, neueren | neueren | neueren |
| Degree=Cmp|Gender=Neut|Number=Sing | neueres | neueren | ||
| Degree=Cmp|Gender=Neut|Number=Plur | Neuere, neueren | neueren | ||
| Degree=Cmp|Number=Plur | neueren | |||
| Degree=Sup|Gender=Masc|Number=Sing | neueste, neuester, neuste | neuesten | neuesten, neuestem, neusten | neuesten |
| Degree=Sup|Gender=Masc|Number=Plur | neuesten | neuesten | neuesten | |
| Degree=Sup|Gender=Fem|Number=Sing | neueste | neueste | neuesten, neuester, neusten | neuesten |
| Degree=Sup|Gender=Fem|Number=Plur | neuesten | neuesten, neueste, neusten | neuesten | neuesten, neuester |
| Degree=Sup|Gender=Neut|Number=Sing | neueste | neueste, neuestes | neuesten, neuestem, neusten | neuesten |
| Degree=Sup|Gender=Neut|Number=Plur | neuesten | neuesten, neueste, neusten | neuesten | neuesten |
| Degree=Sup|Number=Sing | neuestem, neuesten |
PRON
93577 PRON tokens (99% of all PRON tokens) have a non-empty value of Case.
The most frequent other feature values with which PRON and Case co-occurred: Reflex=EMPTY (72457; 77%), PronType=Prs (54176; 58%), Number=Sing (53222; 57%), Gender=EMPTY (50701; 54%), Person=3 (48743; 52%).
PRON tokens may have the following values of Case:
Acc(25215; 27% of non-emptyCase): sich, die, das, sie, es, den, was, ihn, uns, michDat(8854; 9% of non-emptyCase): sich, dem, denen, der, ihm, ihnen, uns, mir, ihr, wemGen(1624; 2% of non-emptyCase): deren, dessen, derer, der, jedermannsNom(57884; 62% of non-emptyCase): es, die, man, sie, er, das, der, wir, was, werEMPTY(1270): nichts, etwas, sich, nix, irgendetwas, irgendjemand, was, irgendwas, E-irgendwas, einander
| Paradigm der | Nom | Acc | Dat | Gen |
|---|---|---|---|---|
| Abbr=Yes|Gender=Neut|Number=Sing | d. | |||
| Gender=Masc|Number=Sing | der | den | dem | dessen |
| Gender=Fem|Number=Sing | die | die | der | derer, Deren |
| Gender=Neut|Number=Sing | das | das | dem | dessen |
| Gender=Neut|Number=Sing|Typo=Yes | da | |||
| Number=Sing | das | dessen | ||
| Number=Plur | die | die | denen | derer, der |
| deren |
PROPN
61547 PROPN tokens (32% of all PROPN tokens) have a non-empty value of Case.
The most frequent other feature values with which PROPN and Case co-occurred: Gender=EMPTY (58875; 96%), Number=Sing (58722; 95%).
PROPN tokens may have the following values of Case:
Acc(5246; 9% of non-emptyCase): Microsoft, AOL, Intel, Napster, Java, Palm, IBM, Apple, OS/2, MacDat(15114; 25% of non-emptyCase): Microsoft, heise, AOL, Intel, IBM, Napster, Apple, Frankreich, Telepolis, NetscapeGen(6888; 11% of non-emptyCase): Microsofts, Intels, Apples, AMDs, Deutschlands, Europas, ICANNs, Suns, IBMs, SonysNom(34299; 56% of non-emptyCase): Microsoft, Intel, AOL, IBM, Apple, Napster, Compaq, Siemens, Sony, GatesEMPTY(132392): Telekom, Deutschland, USA, c’t, Europa, Linux, Windows, telepolis, online, Sun
| Paradigm Telekom | Nom | Acc | Dat | Gen |
|---|---|---|---|---|
| Telekom | Telekom | Telekom | Telekom |
ADV
1074 ADV tokens (1% of all ADV tokens) have a non-empty value of Case.
The most frequent other feature values with which ADV and Case co-occurred: PronType=Ind (1074; 100%).
ADV tokens may have the following values of Case:
Acc(376; 35% of non-emptyCase): mehrere, meistenDat(356; 33% of non-emptyCase): mehreren, meisten, mehr, wenigerGen(69; 6% of non-emptyCase): mehrerer, wenigerNom(273; 25% of non-emptyCase): mehrere, meistenEMPTY(195519): auch, noch, nur, so, aber, mehr, bereits, allerdings, damit, schon
| Paradigm mehr | Nom | Acc | Dat | Gen |
|---|---|---|---|---|
| Number=Plur | mehrere | mehrere | mehreren | mehrerer |
| mehr |
X
61 X tokens (0% of all X tokens) have a non-empty value of Case.
The most frequent other feature values with which X and Case co-occurred: Foreign=Yes (61; 100%).
X tokens may have the following values of Case:
Acc(1; 2% of non-emptyCase): InternetbankingDat(54; 89% of non-emptyCase): Internet, World, Baby, France, Instant, Open, Vice, endlich, .web-Domain, AbstractNom(6; 10% of non-emptyCase): AID, Anti-Spam-Petition, Digital, Push, Telekom-Mitarbeiter, dmmvEMPTY(53630): of, internet, the, and, digital, mobile, media, for, OS, network
| Paradigm Digital | Nom | Dat |
|---|---|---|
| Digital | Digital |
Case seems to be lexical feature of X. 98% lemmas (49) occur only with one value of Case.
NUM
48 NUM tokens (0% of all NUM tokens) have a non-empty value of Case.
The most frequent other feature values with which NUM and Case co-occurred: NumType=Card (48; 100%), Number=Sing (26; 54%).
NUM tokens may have the following values of Case:
Acc(11; 23% of non-emptyCase): eine, ein, einenDat(25; 52% of non-emptyCase): einer, einem, drei, dreien, 1394, 15.000, 300, 4, 86a, AchtGen(8; 17% of non-emptyCase): zweier, TausenderNom(4; 8% of non-emptyCase): eine, ein, einsEMPTY(71260): zwei, 2000, drei, 2001, 1999, vier, fünf, 20, 100, 30
| Paradigm ein | Nom | Acc | Dat |
|---|---|---|---|
| Gender=Masc | einen | einem | |
| Gender=Fem | eine | eine | einer |
| Gender=Neut | ein | ein | einem |
Case seems to be lexical feature of NUM. 93% lemmas (14) occur only with one value of Case.
Relations with Agreement in Case
The 10 most frequent relations where parent and child node agree in Case:
NOUN –[det]–> DET (402114; 90%),
NOUN –[case]–> ADP (264553; 94%),
NOUN –[amod]–> ADJ (141937; 99%),
NOUN –[conj]–> NOUN (20650; 78%),
PRON –[case]–> ADP (5703; 96%),
DET –[case]–> ADP (4567; 97%),
ADJ –[conj]–> ADJ (1594; 98%),
DET –[det]–> DET (357; 58%),
NOUN –[nsubj]–> DET (286; 70%),
NOUN –[nmod]–> ADJ (166; 61%).