home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_German-GSD: POS Tags: VERB

There are 2621 VERB lemmas (6%), 4989 VERB types (9%) and 20686 VERB tokens (7%). Out of 17 observed tags, the rank of VERB is: 4 in number of lemmas, 4 in number of types and 6 in number of tokens.

The 10 most frequent VERB lemmas: haben, werden, geben, kommen, sein, finden, liegen, gehen, machen, gehören

The 10 most frequent VERB types: wurde, gibt, hat, liegt, kam, hatte, gab, war, erhielt, befindet

The 10 most frequent ambiguous lemmas: haben (AUX 1019, VERB 476, PROPN 1), werden (AUX 3256, VERB 382, PROPN 1), geben (VERB 367, PROPN 2, ADJ 1, NOUN 1), kommen (VERB 357, NOUN 2, ADJ 1, PROPN 1), sein (AUX 4653, DET 1388, VERB 351, NOUN 5, PROPN 5), finden (VERB 241, NOUN 1), liegen (VERB 240, ADJ 1, NOUN 1), machen (VERB 209, PROPN 1), gehören (VERB 207, NOUN 1), lassen (VERB 194, NOUN 1, PROPN 1)

The 10 most frequent ambiguous types: wurde (AUX 1273, VERB 243), hat (AUX 314, VERB 158), hatte (AUX 188, VERB 143), war (AUX 1200, VERB 111, PROPN 1), ist (AUX 2008, VERB 94, PROPN 2), haben (AUX 196, VERB 92), machen (VERB 72, PROPN 1), kommen (VERB 67, PROPN 1), finden (VERB 64, NOUN 1), gehörte (VERB 65, NOUN 1)

Morphology

The form / lemma ratio of VERB is 1.903472 (the average of all parts of speech is 1.187855).

The 1st highest number of forms (13) was observed with the lemma “sein”: bin, bist, gewesen, ist, sei, seien, sein, seyn, sind, war, waren, wart, wären.

The 2nd highest number of forms (10) was observed with the lemma “haben”: Hast, gehabt, hab, habe, haben, hat, hatte, hatten, hätte, hätten.

The 3rd highest number of forms (10) was observed with the lemma “lassen”: Laß, gelassen, lasse, lassen, laßt, liess, ließ, ließen, lässt, läßt.

VERB occurs with 10 features: VerbForm (20506; 99% instances), Mood (12500; 60% instances), Number (12496; 60% instances), Person (12469; 60% instances), Tense (12461; 60% instances), Voice (168; 1% instances), Foreign (13; 0% instances), Typo (11; 0% instances), Polite (3; 0% instances), Abbr (2; 0% instances)

VERB occurs with 18 feature-value pairs: Abbr=Yes, Foreign=Yes, Mood=Imp, Mood=Ind, Mood=Sub, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polite=Form, Tense=Past, Tense=Pres, Typo=Yes, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Pass

VERB occurs with 42 feature combinations. The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin (5079 tokens). Examples: kam, hatte, gab, erhielt, wurde, war, begann, führte, ging, nahm

Relations

VERB nodes are attached to their parents using 17 different relations: root (11871; 57% instances), conj (2856; 14% instances), acl (2435; 12% instances), advcl (1341; 6% instances), ccomp (711; 3% instances), xcomp (706; 3% instances), parataxis (414; 2% instances), csubj (178; 1% instances), appos (55; 0% instances), csubj:pass (40; 0% instances), dep (31; 0% instances), acl:relcl (25; 0% instances), obl (17; 0% instances), nmod (2; 0% instances), obj (2; 0% instances), compound (1; 0% instances), flat (1; 0% instances)

Parents of VERB nodes belong to 16 different parts of speech: (11871; 57% instances), VERB (5293; 26% instances), NOUN (2236; 11% instances), ADJ (603; 3% instances), PROPN (441; 2% instances), ADV (84; 0% instances), PRON (54; 0% instances), DET (47; 0% instances), NUM (14; 0% instances), ADP (12; 0% instances), AUX (12; 0% instances), PART (6; 0% instances), CCONJ (5; 0% instances), X (5; 0% instances), INTJ (2; 0% instances), PUNCT (1; 0% instances)

141 (1%) VERB nodes are leaves.

344 (2%) VERB nodes have one child.

963 (5%) VERB nodes have two children.

19238 (93%) VERB nodes have three or more children.

The highest child degree of a VERB node is 14.

Children of VERB nodes are attached using 38 different relations: punct (18602; 20% instances), obl (17009; 18% instances), nsubj (13428; 14% instances), advmod (10023; 11% instances), obj (8268; 9% instances), aux:pass (3412; 4% instances), aux (3177; 3% instances), nsubj:pass (3156; 3% instances), mark (2904; 3% instances), conj (2766; 3% instances), cc (2440; 3% instances), xcomp (1785; 2% instances), compound:prt (1494; 2% instances), advcl (1186; 1% instances), obl:arg (1157; 1% instances), ccomp (730; 1% instances), obl:agent (479; 1% instances), parataxis (363; 0% instances), expl (355; 0% instances), dep (288; 0% instances), acl (227; 0% instances), expl:pv (214; 0% instances), appos (108; 0% instances), csubj (108; 0% instances), amod (75; 0% instances), det (65; 0% instances), csubj:pass (45; 0% instances), case (40; 0% instances), cop (19; 0% instances), obl:tmod (18; 0% instances), compound (17; 0% instances), discourse (8; 0% instances), nmod (7; 0% instances), nsubj:outer (3; 0% instances), flat (2; 0% instances), vocative (2; 0% instances), nmod:poss (1; 0% instances), reparandum (1; 0% instances)

Children of VERB nodes belong to 17 different parts of speech: NOUN (27020; 29% instances), PUNCT (18602; 20% instances), PRON (9229; 10% instances), ADV (8550; 9% instances), AUX (6617; 7% instances), PROPN (6560; 7% instances), VERB (5293; 6% instances), ADJ (2566; 3% instances), CCONJ (2451; 3% instances), PART (1650; 2% instances), NUM (1551; 2% instances), SCONJ (1516; 2% instances), ADP (1366; 1% instances), DET (958; 1% instances), X (45; 0% instances), INTJ (4; 0% instances), SYM (4; 0% instances)