home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_German-GSD: POS Tags: DET

There are 67 DET lemmas (0%), 124 DET types (0%) and 37391 DET tokens (13%). Out of 17 observed tags, the rank of DET is: 11 in number of lemmas, 10 in number of types and 3 in number of tokens.

The 10 most frequent DET lemmas: der, ein, sein, ihr, mein, viel, unser, Ihr|ihr, wenig, meist

The 10 most frequent DET types: der, die, dem, den, das, des, eine, ein, einer, einem

The 10 most frequent ambiguous lemmas: der (DET 29328, PRON 2097, PROPN 1), ein (DET 5217, PRON 182, ADV 140, NUM 74, ADP 1), sein (AUX 4643, DET 1387, VERB 353, PROPN 11, NOUN 5), ihr (DET 611, PRON 85), viel (DET 139, PRON 126, ADV 16, VERB 2, ADJ 1), Ihr|ihr (DET 60, PRON 5), wenig (DET 50, PRON 34, ADV 15, ADJ 4), meist (ADV 68, DET 38, PRON 18), selb (DET 31, PRON 1), dies (PRON 1253, DET 21)

The 10 most frequent ambiguous types: der (DET 8314, PRON 565, PROPN 1), die (DET 5047, PRON 945, X 1), dem (DET 5860, PRON 164), den (DET 2682, PRON 55, SCONJ 1, X 1), das (DET 1636, PRON 287, SCONJ 22), des (DET 2039, PRON 34, PROPN 7, ADP 1), eine (DET 1491, PRON 35, NUM 15), ein (DET 1355, ADV 140, NUM 29, PRON 2, ADP 1), einer (DET 677, PRON 53, NUM 12), einem (DET 563, PRON 22, NUM 10)

Morphology

The form / lemma ratio of DET is 1.850746 (the average of all parts of speech is 1.185816).

The 1st highest number of forms (9) was observed with the lemma “der”: das, dem, den, der, deren, derer, des, dessen, die.

The 2nd highest number of forms (8) was observed with the lemma “ein”: ein, eine, einem, einen, einer, eines, eins, ne.

The 3rd highest number of forms (6) was observed with the lemma “ihr”: ihr, ihre, ihrem, ihren, ihrer, ihres.

DET occurs with 15 features: Number (37332; 100% instances), Case (37323; 100% instances), PronType (37290; 100% instances), Gender (36729; 98% instances), Definite (34235; 92% instances), Poss (2379; 6% instances), Person (2301; 6% instances), Number[psor] (1710; 5% instances), Gender[psor] (1387; 4% instances), NumType (8; 0% instances), Polarity (7; 0% instances), Foreign (4; 0% instances), VerbForm (3; 0% instances), Mood (1; 0% instances), Tense (1; 0% instances)

DET occurs with 32 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Nom, Definite=Def, Definite=Ind, Foreign=Yes, Gender=Fem, Gender=Masc, Gender=Neut, Gender[psor]=Masc,Neut, Mood=Ind, NumType=Card, Number=Plur, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Poss=Yes, PronType=Art, PronType=Dem, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Rel, Tense=Pres, VerbForm=Fin, VerbForm=Inf

DET occurs with 259 feature combinations. The most frequent feature combination is Case=Dat|Definite=Def|Gender=Fem|Number=Sing|PronType=Art (3707 tokens). Examples: der, die

Relations

DET nodes are attached to their parents using 18 different relations: det (34343; 92% instances), det:poss (2342; 6% instances), amod (339; 1% instances), dep (186; 0% instances), nmod:poss (74; 0% instances), nsubj (39; 0% instances), nmod (21; 0% instances), obl (17; 0% instances), conj (10; 0% instances), obj (6; 0% instances), compound (4; 0% instances), iobj (4; 0% instances), acl (1; 0% instances), advmod (1; 0% instances), appos (1; 0% instances), flat (1; 0% instances), nsubj:pass (1; 0% instances), root (1; 0% instances)

Parents of DET nodes belong to 14 different parts of speech: NOUN (31371; 84% instances), PROPN (5576; 15% instances), ADJ (133; 0% instances), VERB (105; 0% instances), PRON (92; 0% instances), NUM (78; 0% instances), ADV (9; 0% instances), DET (9; 0% instances), ADP (5; 0% instances), AUX (4; 0% instances), X (4; 0% instances), CCONJ (2; 0% instances), PART (2; 0% instances), (1; 0% instances)

37150 (99%) DET nodes are leaves.

217 (1%) DET nodes have one child.

19 (0%) DET nodes have two children.

5 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 5.

Children of DET nodes are attached using 16 different relations: punct (130; 47% instances), advmod (43; 16% instances), case (34; 12% instances), nmod (19; 7% instances), conj (14; 5% instances), dep (8; 3% instances), cc (6; 2% instances), det (6; 2% instances), appos (3; 1% instances), flat (3; 1% instances), amod (2; 1% instances), fixed (2; 1% instances), nsubj (2; 1% instances), compound (1; 0% instances), cop (1; 0% instances), parataxis (1; 0% instances)

Children of DET nodes belong to 14 different parts of speech: PUNCT (130; 47% instances), ADV (49; 18% instances), ADP (33; 12% instances), NOUN (17; 6% instances), DET (9; 3% instances), ADJ (8; 3% instances), PRON (8; 3% instances), PROPN (7; 3% instances), CCONJ (6; 2% instances), NUM (3; 1% instances), X (2; 1% instances), AUX (1; 0% instances), PART (1; 0% instances), VERB (1; 0% instances)