home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_German-GSD: POS Tags: DET

There are 68 DET lemmas (0%), 128 DET types (0%) and 37661 DET tokens (13%). Out of 17 observed tags, the rank of DET is: 10 in number of lemmas, 10 in number of types and 3 in number of tokens.

The 10 most frequent DET lemmas: der, ein, sein, ihr, mein, viel, unser, Ihr|ihr, wenig, meist

The 10 most frequent DET types: der, die, dem, den, das, des, eine, ein, einer, einem

The 10 most frequent ambiguous lemmas: der (DET 29594, PRON 2065, PROPN 1), ein (DET 5217, PRON 182, ADV 140, NUM 74, ADP 1), sein (AUX 4644, DET 1388, VERB 353, PROPN 10, NOUN 5), ihr (DET 611, PRON 85), viel (DET 139, PRON 126, ADV 16, VERB 2, ADJ 1), Ihr|ihr (DET 60, PRON 5), wenig (DET 50, PRON 34, ADV 15, ADJ 5), meist (ADV 68, DET 38, PRON 18), selb (DET 31, PRON 1), dies (PRON 1253, DET 21)

The 10 most frequent ambiguous types: der (DET 8415, PRON 464, PROPN 1), die (DET 5073, PRON 919, X 1), dem (DET 5867, PRON 157), den (DET 2691, PRON 46, SCONJ 1, X 1), das (DET 1644, PRON 279, SCONJ 22), des (DET 2072, PROPN 7, ADP 1, PRON 1), eine (DET 1491, PRON 35, NUM 15), ein (DET 1355, ADV 140, NUM 29, PRON 2, ADP 1), einer (DET 677, PRON 53, NUM 12), einem (DET 563, PRON 22, NUM 10)

Morphology

The form / lemma ratio of DET is 1.882353 (the average of all parts of speech is 1.187208).

The 1st highest number of forms (10) was observed with the lemma “der”: ’s, das, dem, den, der, deren, derer, des, dessen, die.

The 2nd highest number of forms (8) was observed with the lemma “ein”: ein, eine, einem, einen, einer, eines, eins, ne.

The 3rd highest number of forms (6) was observed with the lemma “ihr”: ihr, ihre, ihrem, ihren, ihrer, ihres.

DET occurs with 16 features: Number (37601; 100% instances), Case (37592; 100% instances), PronType (37566; 100% instances), Gender (36989; 98% instances), Definite (34501; 92% instances), Poss (2380; 6% instances), Person (2302; 6% instances), Number[psor] (1711; 5% instances), Gender[psor] (1388; 4% instances), NumType (8; 0% instances), Polarity (7; 0% instances), Foreign (4; 0% instances), VerbForm (3; 0% instances), Degree (1; 0% instances), Mood (1; 0% instances), Tense (1; 0% instances)

DET occurs with 34 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Nom, Definite=Def, Definite=Ind, Degree=Sup, Foreign=Yes, Gender=Fem, Gender=Masc, Gender=Neut, Gender[psor]=Masc,Neut, Mood=Ind, NumType=Card, Number=Plur, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Poss=Yes, PronType=Art, PronType=Dem, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot, Tense=Pres, VerbForm=Fin, VerbForm=Inf

DET occurs with 256 feature combinations. The most frequent feature combination is Case=Dat|Definite=Def|Gender=Fem|Number=Sing|PronType=Art (3731 tokens). Examples: der, die

Relations

DET nodes are attached to their parents using 18 different relations: det (34614; 92% instances), det:poss (2343; 6% instances), amod (339; 1% instances), dep (186; 0% instances), nmod:poss (74; 0% instances), nsubj (39; 0% instances), nmod (17; 0% instances), obl (17; 0% instances), conj (10; 0% instances), obj (6; 0% instances), compound (4; 0% instances), iobj (4; 0% instances), flat (2; 0% instances), nsubj:pass (2; 0% instances), acl (1; 0% instances), advmod (1; 0% instances), appos (1; 0% instances), root (1; 0% instances)

Parents of DET nodes belong to 14 different parts of speech: NOUN (31400; 83% instances), PROPN (5799; 15% instances), ADJ (151; 0% instances), VERB (106; 0% instances), PRON (91; 0% instances), NUM (78; 0% instances), DET (10; 0% instances), ADV (9; 0% instances), ADP (5; 0% instances), AUX (4; 0% instances), X (4; 0% instances), CCONJ (2; 0% instances), PART (1; 0% instances), (1; 0% instances)

37418 (99%) DET nodes are leaves.

218 (1%) DET nodes have one child.

19 (0%) DET nodes have two children.

6 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 5.

Children of DET nodes are attached using 16 different relations: punct (130; 47% instances), advmod (44; 16% instances), case (34; 12% instances), nmod (19; 7% instances), conj (14; 5% instances), dep (8; 3% instances), det (7; 3% instances), cc (6; 2% instances), flat (4; 1% instances), amod (3; 1% instances), appos (3; 1% instances), fixed (2; 1% instances), nsubj (2; 1% instances), compound (1; 0% instances), cop (1; 0% instances), parataxis (1; 0% instances)

Children of DET nodes belong to 14 different parts of speech: PUNCT (130; 47% instances), ADV (44; 16% instances), ADP (33; 12% instances), NOUN (18; 6% instances), ADJ (15; 5% instances), DET (10; 4% instances), PRON (8; 3% instances), PROPN (7; 3% instances), CCONJ (6; 2% instances), NUM (3; 1% instances), X (2; 1% instances), AUX (1; 0% instances), PART (1; 0% instances), VERB (1; 0% instances)