home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_German-HDT: POS Tags: PRON

There are 28 PRON lemmas (0%), 65 PRON types (0%) and 94853 PRON tokens (3%). Out of 16 observed tags, the rank of PRON is: 11 in number of lemmas, 11 in number of types and 11 in number of tokens.

The 10 most frequent PRON lemmas: der, sich, es, sie, man, er, wir, was, wer, ich

The 10 most frequent PRON types: sich, es, die, sie, man, das, er, der, wir, was

The 10 most frequent ambiguous lemmas: der (DET 359942, PRON 28690, X 2), es (PRON 13851, PROPN 1), sie (PRON 8059, X 16), man (PRON 6680, X 2), wir (PRON 4106, X 1), was (PRON 2181, X 5), nichts (PRON 795, ADV 2), etwas (ADV 568, PRON 423), du (PRON 54, X 5), ihr (DET 7652, PRON 22)

The 10 most frequent ambiguous types: es (PRON 11325, PROPN 1), die (DET 77836, PRON 12604, X 2), sie (PRON 5415, X 16), man (PRON 5900, X 2), das (DET 25405, PRON 4240, X 1), der (DET 91438, PRON 4857, X 2), wir (PRON 1680, X 1), was (PRON 1450, X 5), dem (DET 66367, PRON 1681, X 1), nichts (PRON 750, ADV 2)

Morphology

The form / lemma ratio of PRON is 2.321429 (the average of all parts of speech is 2.529657).

The 1st highest number of forms (11) was observed with the lemma “der”: d., da, das, dem, den, denen, der, deren, derer, dessen, die.

The 2nd highest number of forms (4) was observed with the lemma “es”: ’s, es, ihm, s.

The 3rd highest number of forms (4) was observed with the lemma “wer”: wem, wen, wer, wessen.

PRON occurs with 10 features: PronType (94853; 100% instances), Case (93583; 99% instances), Number (73114; 77% instances), Person (54207; 57% instances), Gender (44119; 47% instances), Reflex (21144; 22% instances), Polite (70; 0% instances), Abbr (3; 0% instances), Foreign (1; 0% instances), Typo (1; 0% instances)

PRON occurs with 26 feature-value pairs: Abbr=Yes, Case=Acc, Case=Dat, Case=Gen, Case=Nom, Foreign=Yes, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polite=Form, PronType=Dem, PronType=Dem,Rel, PronType=Ind, PronType=Int,Rel, PronType=Neg, PronType=Prs, PronType=Rcp, PronType=Rel, PronType=Tot, Reflex=Yes, Typo=Yes

PRON occurs with 87 feature combinations. The most frequent feature combination is Case=Acc|Person=3|PronType=Prs|Reflex=Yes (17728 tokens). Examples: sich

Relations

PRON nodes are attached to their parents using 21 different relations: nsubj (49879; 53% instances), obj (16415; 17% instances), expl:pv (10468; 11% instances), obl (5345; 6% instances), nsubj:pass (4698; 5% instances), expl (3114; 3% instances), nmod (1919; 2% instances), obl:arg (1891; 2% instances), xcomp (332; 0% instances), conj (229; 0% instances), det (200; 0% instances), root (163; 0% instances), appos (77; 0% instances), acl (58; 0% instances), ccomp (32; 0% instances), parataxis (14; 0% instances), advcl (6; 0% instances), csubj (6; 0% instances), reparandum (4; 0% instances), amod (2; 0% instances), vocative (1; 0% instances)

Parents of PRON nodes belong to 13 different parts of speech: VERB (82389; 87% instances), NOUN (4475; 5% instances), ADJ (3741; 4% instances), AUX (3262; 3% instances), DET (277; 0% instances), ADV (198; 0% instances), (163; 0% instances), PRON (108; 0% instances), NUM (90; 0% instances), PROPN (88; 0% instances), X (58; 0% instances), SCONJ (3; 0% instances), PART (1; 0% instances)

86979 (92%) PRON nodes are leaves.

6731 (7%) PRON nodes have one child.

632 (1%) PRON nodes have two children.

511 (1%) PRON nodes have three or more children.

The highest child degree of a PRON node is 9.

Children of PRON nodes are attached using 25 different relations: case (6105; 61% instances), advmod (1068; 11% instances), nmod (795; 8% instances), punct (603; 6% instances), acl (364; 4% instances), cc (231; 2% instances), cop (229; 2% instances), nsubj (222; 2% instances), conj (151; 1% instances), appos (139; 1% instances), obl (44; 0% instances), det (30; 0% instances), aux (26; 0% instances), parataxis (20; 0% instances), ccomp (19; 0% instances), mark (14; 0% instances), advcl (4; 0% instances), flat (4; 0% instances), reparandum (4; 0% instances), xcomp (4; 0% instances), csubj (3; 0% instances), flat:name (3; 0% instances), amod (1; 0% instances), expl (1; 0% instances), nmod:poss (1; 0% instances)

Children of PRON nodes belong to 15 different parts of speech: ADP (5964; 59% instances), ADV (931; 9% instances), NOUN (919; 9% instances), PUNCT (603; 6% instances), CCONJ (376; 4% instances), VERB (358; 4% instances), AUX (284; 3% instances), PROPN (162; 2% instances), ADJ (133; 1% instances), DET (133; 1% instances), PRON (108; 1% instances), PART (51; 1% instances), X (35; 0% instances), NUM (21; 0% instances), SCONJ (7; 0% instances)