home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Dutch-LassySmall: POS Tags: VERB

There are 1288 VERB lemmas (10%), 2281 VERB types (14%) and 6496 VERB tokens (7%). Out of 16 observed tags, the rank of VERB is: 4 in number of lemmas, 3 in number of types and 7 in number of tokens.

The 10 most frequent VERB lemmas: zijn, hebben, komen, zien, gaan, maken, staan, nemen, krijgen, bestaan

The 10 most frequent VERB types: zie, heeft, kwam, is, zijn, genoemd, komt, hebben, telt, staat

The 10 most frequent ambiguous lemmas: zijn (AUX 1325, PRON 445, VERB 180), hebben (VERB 173, AUX 138), bestaan (VERB 75, NOUN 6), blijven (VERB 47, AUX 25), leven (NOUN 28, VERB 22), geboren (ADJ 17, VERB 14), kunnen (AUX 118, VERB 13), gebeuren (VERB 11, NOUN 1), blijken (AUX 9, VERB 9), worden (AUX 945, VERB 9)

The 10 most frequent ambiguous types: heeft (VERB 73, AUX 41), is (AUX 687, VERB 59), zijn (PRON 399, AUX 219, VERB 57), hebben (VERB 44, AUX 35), staat (VERB 41, NOUN 22), was (AUX 332, VERB 38), had (AUX 46, VERB 36), vormen (VERB 23, NOUN 2), bestaan (VERB 22, NOUN 6), bleef (VERB 22, AUX 7)

Morphology

The form / lemma ratio of VERB is 1.770963 (the average of all parts of speech is 1.174887).

The 1st highest number of forms (9) was observed with the lemma “zijn”: ben, geweest, gewezen, is, waren, was, zij, zijn, zijnde.

The 2nd highest number of forms (8) was observed with the lemma “komen”: gekomen, komen, komend, komende, komt, kwam, kwamen, kwamer.

The 3rd highest number of forms (7) was observed with the lemma “doen”: deden, deed, doe, doen, doet, gedaan, gedane.

VERB occurs with 3 features: VerbForm (6496; 100% instances), Number (3503; 54% instances), Tense (3503; 54% instances)

VERB occurs with 7 feature-value pairs: Number=Plur, Number=Sing, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Inf, VerbForm=Part

VERB occurs with 6 feature combinations. The most frequent feature combination is VerbForm=Part (2006 tokens). Examples: genoemd, volgende, opgericht, gemaakt, gebruikt, gekozen, gelegen, verkozen, bestaande, geboren

Relations

VERB nodes are attached to their parents using 23 different relations: root (3217; 50% instances), conj (659; 10% instances), acl:relcl (555; 9% instances), advcl (407; 6% instances), amod (352; 5% instances), acl (316; 5% instances), xcomp (254; 4% instances), parataxis (210; 3% instances), ccomp (155; 2% instances), fixed (76; 1% instances), nmod (63; 1% instances), obl (61; 1% instances), csubj (43; 1% instances), nsubj (33; 1% instances), flat:name (20; 0% instances), advmod (19; 0% instances), obj (17; 0% instances), appos (15; 0% instances), nsubj:pass (9; 0% instances), compound:prt (7; 0% instances), orphan (5; 0% instances), obl:agent (2; 0% instances), mark (1; 0% instances)

Parents of VERB nodes belong to 13 different parts of speech: (3217; 50% instances), VERB (1571; 24% instances), NOUN (1251; 19% instances), PROPN (169; 3% instances), ADJ (142; 2% instances), ADV (34; 1% instances), PRON (31; 0% instances), DET (24; 0% instances), NUM (22; 0% instances), X (12; 0% instances), ADP (10; 0% instances), SCONJ (9; 0% instances), SYM (4; 0% instances)

498 (8%) VERB nodes are leaves.

313 (5%) VERB nodes have one child.

509 (8%) VERB nodes have two children.

5176 (80%) VERB nodes have three or more children.

The highest child degree of a VERB node is 11.

Children of VERB nodes are attached using 34 different relations: obl (4534; 18% instances), punct (4295; 17% instances), nsubj (3612; 14% instances), advmod (2440; 10% instances), obj (2072; 8% instances), mark (1008; 4% instances), aux:pass (955; 4% instances), nsubj:pass (872; 3% instances), conj (691; 3% instances), compound:prt (681; 3% instances), cc (620; 2% instances), advcl (586; 2% instances), aux (547; 2% instances), xcomp (513; 2% instances), parataxis (274; 1% instances), obl:agent (184; 1% instances), amod (178; 1% instances), ccomp (178; 1% instances), expl:pv (148; 1% instances), case (141; 1% instances), det (117; 0% instances), nmod (102; 0% instances), cop (74; 0% instances), iobj (66; 0% instances), fixed (49; 0% instances), expl (29; 0% instances), csubj (22; 0% instances), orphan (14; 0% instances), nmod:poss (13; 0% instances), flat:name (9; 0% instances), nummod (9; 0% instances), acl (4; 0% instances), appos (4; 0% instances), acl:relcl (2; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: NOUN (7145; 29% instances), PUNCT (4295; 17% instances), ADV (2198; 9% instances), PROPN (2019; 8% instances), PRON (1941; 8% instances), AUX (1576; 6% instances), VERB (1571; 6% instances), ADP (1308; 5% instances), ADJ (1011; 4% instances), CCONJ (646; 3% instances), NUM (636; 3% instances), SCONJ (382; 2% instances), DET (183; 1% instances), SYM (76; 0% instances), X (52; 0% instances), INTJ (4; 0% instances)