home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-GUM: POS Tags: VERB

There are 2011 VERB lemmas (13%), 3771 VERB types (20%) and 19574 VERB tokens (10%). Out of 17 observed tags, the rank of VERB is: 4 in number of lemmas, 3 in number of types and 3 in number of tokens.

The 10 most frequent VERB lemmas: have, go, know, make, do, get, say, take, be, see

The 10 most frequent VERB types: have, know, get, do, make, said, see, think, go, had

The 10 most frequent ambiguous lemmas: have (AUX 944, VERB 776), go (VERB 449, NOUN 6), do (AUX 709, VERB 400, NOUN 4, PROPN 3), get (VERB 390, AUX 23), take (VERB 326, NOUN 3), be (AUX 6070, VERB 325), use (VERB 269, NOUN 38), look (VERB 183, NOUN 9), call (VERB 147, NOUN 10), try (VERB 145, NOUN 3, INTJ 1)

The 10 most frequent ambiguous types: have (VERB 460, AUX 328), get (VERB 197, AUX 15), do (AUX 333, VERB 211, NOUN 3, PROPN 3, PART 1), said (VERB 198, ADJ 1), go (VERB 159, NOUN 3), had (AUX 180, VERB 149), take (VERB 136, NOUN 3), has (AUX 244, VERB 127), got (VERB 95, AUX 5), are (AUX 770, VERB 99)

Morphology

The form / lemma ratio of VERB is 1.875186 (the average of all parts of speech is 1.229167).

The 1st highest number of forms (14) was observed with the lemma “be”: ‘m, ‘re, ‘s, ai, are, be, been, being, is, s, was, were, where, ’s.

The 2nd highest number of forms (6) was observed with the lemma “do”: did, do, does, doing, done, to.

The 3rd highest number of forms (6) was observed with the lemma “go”: go, goes, going, gon, gone, went.

VERB occurs with 9 features: VerbForm (19572; 100% instances), Tense (11531; 59% instances), Person (8201; 42% instances), Mood (8109; 41% instances), Number (7217; 37% instances), Voice (2726; 14% instances), Typo (52; 0% instances), Polarity (13; 0% instances), Abbr (9; 0% instances)

VERB occurs with 18 feature-value pairs: Abbr=Yes, Mood=Imp, Mood=Ind, Mood=Sub, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Tense=Past, Tense=Pres, Typo=Yes, VerbForm=Fin, VerbForm=Ger, VerbForm=Inf, VerbForm=Part, Voice=Pass

VERB occurs with 43 feature combinations. The most frequent feature combination is VerbForm=Inf (4808 tokens). Examples: have, do, make, get, know, see, go, take, say, find

Relations

VERB nodes are attached to their parents using 27 different relations: root (6902; 35% instances), advcl (2614; 13% instances), conj (2276; 12% instances), xcomp (1756; 9% instances), acl:relcl (1509; 8% instances), acl (1440; 7% instances), amod (831; 4% instances), ccomp (812; 4% instances), parataxis (729; 4% instances), csubj (235; 1% instances), advcl:relcl (174; 1% instances), case (98; 1% instances), reparandum (64; 0% instances), appos (35; 0% instances), dep (22; 0% instances), compound (15; 0% instances), csubj:pass (14; 0% instances), discourse (13; 0% instances), obl (9; 0% instances), orphan (6; 0% instances), csubj:outer (5; 0% instances), nmod (5; 0% instances), dislocated (4; 0% instances), fixed (2; 0% instances), mark (2; 0% instances), nsubj (1; 0% instances), obj (1; 0% instances)

Parents of VERB nodes belong to 15 different parts of speech: VERB (7173; 37% instances), (6902; 35% instances), NOUN (3535; 18% instances), ADJ (818; 4% instances), PROPN (386; 2% instances), PRON (344; 2% instances), ADV (282; 1% instances), NUM (43; 0% instances), DET (39; 0% instances), AUX (32; 0% instances), INTJ (9; 0% instances), X (5; 0% instances), ADP (4; 0% instances), PART (1; 0% instances), SYM (1; 0% instances)

888 (5%) VERB nodes are leaves.

1754 (9%) VERB nodes have one child.

3483 (18%) VERB nodes have two children.

13449 (69%) VERB nodes have three or more children.

The highest child degree of a VERB node is 12.

Children of VERB nodes are attached using 45 different relations: punct (10861; 16% instances), nsubj (9933; 14% instances), obj (8336; 12% instances), obl (7225; 11% instances), advmod (5515; 8% instances), mark (4600; 7% instances), aux (3925; 6% instances), advcl (2569; 4% instances), cc (2462; 4% instances), xcomp (2401; 3% instances), conj (2264; 3% instances), aux:pass (1395; 2% instances), nsubj:pass (1263; 2% instances), ccomp (1178; 2% instances), compound:prt (719; 1% instances), parataxis (696; 1% instances), discourse (544; 1% instances), expl (377; 1% instances), dep (361; 1% instances), obl:tmod (348; 1% instances), obl:agent (341; 0% instances), iobj (326; 0% instances), reparandum (183; 0% instances), cop (129; 0% instances), nsubj:outer (127; 0% instances), obl:npmod (124; 0% instances), compound (122; 0% instances), csubj (85; 0% instances), vocative (65; 0% instances), fixed (41; 0% instances), advcl:relcl (39; 0% instances), dislocated (25; 0% instances), case (19; 0% instances), csubj:pass (17; 0% instances), cc:preconj (14; 0% instances), det (10; 0% instances), amod (9; 0% instances), appos (7; 0% instances), csubj:outer (6; 0% instances), acl (3; 0% instances), acl:relcl (2; 0% instances), list (2; 0% instances), nmod (2; 0% instances), orphan (2; 0% instances), goeswith (1; 0% instances)

Children of VERB nodes belong to 17 different parts of speech: NOUN (15867; 23% instances), PUNCT (10861; 16% instances), PRON (9721; 14% instances), VERB (7173; 10% instances), AUX (5565; 8% instances), ADV (4956; 7% instances), PART (3229; 5% instances), PROPN (2716; 4% instances), CCONJ (2472; 4% instances), SCONJ (2237; 3% instances), ADJ (1277; 2% instances), ADP (915; 1% instances), NUM (801; 1% instances), INTJ (579; 1% instances), DET (215; 0% instances), X (66; 0% instances), SYM (23; 0% instances)