home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_German-GSD: POS Tags: VERB

There are 2615 VERB lemmas (6%), 4977 VERB types (9%) and 20656 VERB tokens (7%). Out of 17 observed tags, the rank of VERB is: 4 in number of lemmas, 4 in number of types and 7 in number of tokens.

The 10 most frequent VERB lemmas: haben, werden, geben, kommen, sein, finden, liegen, gehen, machen, gehören

The 10 most frequent VERB types: wurde, gibt, hat, liegt, kam, hatte, gab, war, erhielt, befindet

The 10 most frequent ambiguous lemmas: haben (AUX 1019, VERB 475, CCONJ 1, PROPN 1), werden (AUX 3247, VERB 380, X 9, PROPN 2, PUNCT 1), geben (VERB 367, PROPN 2, ADJ 1, NOUN 1), kommen (VERB 357, NOUN 2, ADJ 1, PROPN 1), sein (AUX 4644, DET 1388, VERB 353, PROPN 10, NOUN 5), finden (VERB 241, NOUN 1), liegen (VERB 240, ADJ 1, NOUN 1), machen (VERB 209, PROPN 1), gehören (VERB 207, NOUN 1), lassen (VERB 193, ADP 1, NOUN 1, PROPN 1)

The 10 most frequent ambiguous types: wurde (AUX 1268, VERB 242, X 5, PUNCT 1), hat (AUX 314, VERB 158), hatte (AUX 188, VERB 142, CCONJ 1), war (AUX 1200, VERB 111, PROPN 1), ist (AUX 2004, VERB 95, PROPN 5), haben (AUX 196, VERB 92), machen (VERB 72, PROPN 1), kommen (VERB 67, PROPN 1), finden (VERB 64, NOUN 1), gehörte (VERB 65, NOUN 1)

Morphology

The form / lemma ratio of VERB is 1.903250 (the average of all parts of speech is 1.187208).

The 1st highest number of forms (13) was observed with the lemma “sein”: bin, bist, gewesen, ist, sei, seien, sein, seyn, sind, war, waren, wart, wären.

The 2nd highest number of forms (10) was observed with the lemma “haben”: Hast, gehabt, hab, habe, haben, hat, hatte, hatten, hätte, hätten.

The 3rd highest number of forms (10) was observed with the lemma “lassen”: Laß, gelassen, lasse, lassen, laßt, liess, ließ, ließen, lässt, läßt.

VERB occurs with 9 features: VerbForm (20477; 99% instances), Mood (12477; 60% instances), Number (12477; 60% instances), Person (12447; 60% instances), Tense (12440; 60% instances), Voice (168; 1% instances), Typo (9; 0% instances), Foreign (8; 0% instances), Abbr (1; 0% instances)

VERB occurs with 17 feature-value pairs: Abbr=Yes, Foreign=Yes, Mood=Imp, Mood=Ind, Mood=Sub, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Tense=Past, Tense=Pres, Typo=Yes, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Pass

VERB occurs with 35 feature combinations. The most frequent feature combination is VerbForm=Part (5069 tokens). Examples: gegründet, genannt, verwendet, eingesetzt, genutzt, bezeichnet, gebaut, aufgenommen, gewählt, gemacht

Relations

VERB nodes are attached to their parents using 18 different relations: root (11865; 57% instances), conj (2847; 14% instances), acl (2448; 12% instances), advcl (1336; 6% instances), ccomp (711; 3% instances), xcomp (702; 3% instances), parataxis (413; 2% instances), csubj (178; 1% instances), appos (54; 0% instances), csubj:pass (40; 0% instances), dep (30; 0% instances), obl (17; 0% instances), acl:relcl (8; 0% instances), nmod (2; 0% instances), obj (2; 0% instances), compound (1; 0% instances), flat (1; 0% instances), nsubj (1; 0% instances)

Parents of VERB nodes belong to 16 different parts of speech: (11865; 57% instances), VERB (5281; 26% instances), NOUN (2227; 11% instances), ADJ (603; 3% instances), PROPN (446; 2% instances), PRON (142; 1% instances), ADV (36; 0% instances), NUM (14; 0% instances), ADP (11; 0% instances), AUX (10; 0% instances), PART (7; 0% instances), CCONJ (6; 0% instances), X (4; 0% instances), INTJ (2; 0% instances), DET (1; 0% instances), PUNCT (1; 0% instances)

138 (1%) VERB nodes are leaves.

340 (2%) VERB nodes have one child.

960 (5%) VERB nodes have two children.

19218 (93%) VERB nodes have three or more children.

The highest child degree of a VERB node is 14.

Children of VERB nodes are attached using 34 different relations: punct (18587; 20% instances), obl (17533; 19% instances), nsubj (13453; 14% instances), advmod (9878; 11% instances), obj (8235; 9% instances), aux:pass (3414; 4% instances), aux (3178; 3% instances), nsubj:pass (3157; 3% instances), mark (2902; 3% instances), conj (2752; 3% instances), cc (2435; 3% instances), xcomp (1777; 2% instances), compound:prt (1491; 2% instances), advcl (1186; 1% instances), iobj (1156; 1% instances), ccomp (729; 1% instances), dep (395; 0% instances), parataxis (361; 0% instances), expl (355; 0% instances), acl (229; 0% instances), expl:pv (211; 0% instances), csubj (109; 0% instances), appos (108; 0% instances), amod (77; 0% instances), det (54; 0% instances), csubj:pass (45; 0% instances), case (35; 0% instances), compound (18; 0% instances), cop (18; 0% instances), discourse (4; 0% instances), nmod (4; 0% instances), flat (2; 0% instances), obl:arg (1; 0% instances), vocative (1; 0% instances)

Children of VERB nodes belong to 17 different parts of speech: NOUN (26993; 29% instances), PUNCT (18587; 20% instances), PRON (10198; 11% instances), ADV (8434; 9% instances), AUX (6615; 7% instances), PROPN (6556; 7% instances), VERB (5281; 6% instances), ADJ (2548; 3% instances), CCONJ (2447; 3% instances), PART (1646; 2% instances), NUM (1548; 2% instances), SCONJ (1510; 2% instances), ADP (1365; 1% instances), DET (106; 0% instances), X (52; 0% instances), INTJ (2; 0% instances), SYM (2; 0% instances)