home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_German: POS Tags: VERB

There are 2673 VERB lemmas (6%), 5007 VERB types (9%) and 19921 VERB tokens (7%). Out of 15 observed tags, the rank of VERB is: 4 in number of lemmas, 4 in number of types and 7 in number of tokens.

The 10 most frequent VERB lemmas: haben, geben, sein, kommen, finden, liegen, gehen, gehören, machen, stehen

The 10 most frequent VERB types: gibt, hat, kam, liegt, hatte, gab, war, erhielt, befindet, ist

The 10 most frequent ambiguous lemmas: haben (AUX 1013, VERB 475, CCONJ 1, PROPN 1), geben (VERB 367, PROPN 2, ADJ 1, NOUN 1), sein (AUX 4628, DET 1229, VERB 363, PRON 152, PROPN 14, NOUN 7), kommen (VERB 357, NOUN 2, ADJ 1, PROPN 1), finden (VERB 240, AUX 1, NOUN 1), liegen (VERB 239, ADJ 1, AUX 1, NOUN 1), gehen (VERB 235, AUX 1), gehören (VERB 206, NOUN 1), machen (VERB 195, AUX 14, PROPN 1), stehen (VERB 188, AUX 3, ADV 1, NOUN 1)

The 10 most frequent ambiguous types: hat (AUX 314, VERB 158), liegt (VERB 147, AUX 1), hatte (AUX 188, VERB 142, CCONJ 1), war (AUX 1199, VERB 112, PROPN 1), ist (AUX 1999, VERB 100, PROPN 5), haben (AUX 195, VERB 93), lassen (VERB 78, AUX 11), besteht (VERB 72, AUX 3), steht (VERB 72, AUX 1), machen (VERB 70, AUX 2, PROPN 1)

Morphology

The form / lemma ratio of VERB is 1.873176 (the average of all parts of speech is 1.186689).

The 1st highest number of forms (13) was observed with the lemma “sein”: bin, bist, gewesen, ist, sei, seien, sein, sind, war, waren, wart, wäre, wären.

The 2nd highest number of forms (10) was observed with the lemma “haben”: Hast, gehabt, hab, habe, haben, hat, hatte, hatten, hätte, hätten.

The 3rd highest number of forms (10) was observed with the lemma “lassen”: Laß, gelassen, lasse, lassen, laßt, liess, ließ, ließen, lässt, läßt.

VERB occurs with 7 features: VerbForm (19312; 97% instances), Person (10245; 51% instances), Number (9418; 47% instances), Mood (843; 4% instances), Tense (843; 4% instances), Case (4; 0% instances), Gender (4; 0% instances)

VERB occurs with 16 feature-value pairs: Case=Acc, Case=Gen, Gender=Masc, Gender=Masc,Neut, Mood=Ind, Mood=Sub, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Inf, VerbForm=Part

VERB occurs with 37 feature combinations. The most frequent feature combination is Number=Sing|Person=3|VerbForm=Fin (6391 tokens). Examples: gibt, kam, liegt, erhielt, befindet, gab, ging, sagte, führte, begann

Relations

VERB nodes are attached to their parents using 23 different relations: root (11419; 57% instances), conj (2713; 14% instances), acl (2426; 12% instances), advcl (1292; 6% instances), ccomp (692; 3% instances), xcomp (676; 3% instances), parataxis (397; 2% instances), csubj (177; 1% instances), csubj:pass (39; 0% instances), dep (30; 0% instances), appos (27; 0% instances), nsubj (7; 0% instances), obj (7; 0% instances), aux:pass (4; 0% instances), amod (3; 0% instances), fixed (3; 0% instances), aux (2; 0% instances), flat (2; 0% instances), advmod (1; 0% instances), cc (1; 0% instances), det (1; 0% instances), mark (1; 0% instances), nsubj:pass (1; 0% instances)

Parents of VERB nodes belong to 13 different parts of speech: (11419; 57% instances), VERB (5073; 25% instances), NOUN (2176; 11% instances), ADJ (597; 3% instances), PROPN (420; 2% instances), PRON (134; 1% instances), ADV (34; 0% instances), ADP (28; 0% instances), AUX (13; 0% instances), NUM (13; 0% instances), CCONJ (5; 0% instances), X (5; 0% instances), PART (4; 0% instances)

173 (1%) VERB nodes are leaves.

448 (2%) VERB nodes have one child.

1317 (7%) VERB nodes have two children.

17983 (90%) VERB nodes have three or more children.

The highest child degree of a VERB node is 19.

Children of VERB nodes are attached using 32 different relations: punct (17283; 19% instances), obl (17228; 19% instances), nsubj (12953; 14% instances), advmod (9422; 10% instances), obj (8230; 9% instances), aux:pass (3271; 4% instances), aux (3141; 3% instances), nsubj:pass (3021; 3% instances), mark (2865; 3% instances), conj (2741; 3% instances), cc (2344; 3% instances), compound:prt (1488; 2% instances), advcl (1182; 1% instances), iobj (1156; 1% instances), xcomp (963; 1% instances), ccomp (726; 1% instances), dep (401; 0% instances), parataxis (357; 0% instances), expl (354; 0% instances), case (320; 0% instances), acl (232; 0% instances), expl:pv (208; 0% instances), appos (171; 0% instances), csubj (97; 0% instances), amod (88; 0% instances), det (70; 0% instances), csubj:pass (45; 0% instances), cop (22; 0% instances), compound (5; 0% instances), fixed (4; 0% instances), flat (2; 0% instances), det:poss (1; 0% instances)

Children of VERB nodes belong to 15 different parts of speech: NOUN (26217; 29% instances), PUNCT (17292; 19% instances), PRON (9895; 11% instances), ADV (7988; 9% instances), AUX (6418; 7% instances), PROPN (6402; 7% instances), VERB (5073; 6% instances), CCONJ (2361; 3% instances), ADJ (2341; 3% instances), ADP (1622; 2% instances), PART (1587; 2% instances), SCONJ (1525; 2% instances), NUM (1518; 2% instances), DET (96; 0% instances), X (56; 0% instances)