home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Naija-NSC: POS Tags: VERB

There are 867 VERB lemmas (19%), 1048 VERB types (20%) and 17758 VERB tokens (13%). Out of 15 observed tags, the rank of VERB is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent VERB lemmas: go, dey, do, get, say, come, want, know, see, tell

The 10 most frequent VERB types: go, dey, do, get, say, come, know, see, tell, wan

The 10 most frequent ambiguous lemmas: go (AUX 2210, VERB 948), dey (AUX 3124, VERB 936), do (VERB 840, AUX 34), say (VERB 715, SCONJ 1), talk (VERB 356, NOUN 12), take (VERB 316, SCONJ 26), be (AUX 1030, VERB 305), make (AUX 680, VERB 260, SCONJ 162, NOUN 1), call (VERB 200, NOUN 10), use (VERB 195, ADJ 2)

The 10 most frequent ambiguous types: go (AUX 2210, VERB 938), dey (AUX 3125, VERB 936), do (VERB 825, AUX 30), say (VERB 706, SCONJ 1), come (VERB 508, AUX 221), wan (VERB 381, NOUN 1), talk (VERB 351, NOUN 12), take (VERB 312, SCONJ 26), be (AUX 812, VERB 267), make (AUX 392, VERB 250, SCONJ 114, NOUN 1)

Morphology

The form / lemma ratio of VERB is 1.208766 (the average of all parts of speech is 1.162960).

The 1st highest number of forms (7) was observed with the lemma “be”: ‘re, am, are, be, been, is, was.

The 2nd highest number of forms (5) was observed with the lemma “tink”: think, thinking, tink, tinking, tought.

The 3rd highest number of forms (4) was observed with the lemma “come”: came, come, comes, coming.

VERB occurs with 9 features: VerbType (912; 5% instances), VerbForm (406; 2% instances), Tense (394; 2% instances), PartType (276; 2% instances), Mood (109; 1% instances), Number (68; 0% instances), Person (68; 0% instances), Voice (8; 0% instances), Aspect (5; 0% instances)

VERB occurs with 13 feature-value pairs: Aspect=Imp, Mood=Ind, Mood=Opt, Number=Sing, PartType=Cop, Person=1, Person=3, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Part, VerbType=Cop, Voice=Pass

VERB occurs with 21 feature combinations. The most frequent feature combination is _ (16167 tokens). Examples: go, do, get, say, come, know, see, tell, wan, talk

Relations

VERB nodes are attached to their parents using 35 different relations: root (6196; 35% instances), acl:relcl (1909; 11% instances), advcl (1555; 9% instances), compound:svc (1262; 7% instances), advcl:cleft (1249; 7% instances), ccomp (1096; 6% instances), xcomp (1077; 6% instances), conj (890; 5% instances), parataxis:conj (785; 4% instances), reparandum (545; 3% instances), parataxis (442; 2% instances), parataxis:discourse (144; 1% instances), acl (141; 1% instances), compound:redup (87; 0% instances), parataxis:parenth (66; 0% instances), csubj (55; 0% instances), fixed (48; 0% instances), dislocated (31; 0% instances), compound (30; 0% instances), parataxis:dislocated (30; 0% instances), parataxis:mod (27; 0% instances), case (22; 0% instances), discourse (21; 0% instances), obj (9; 0% instances), dep (8; 0% instances), appos (6; 0% instances), flat:foreign (5; 0% instances), nsubj (5; 0% instances), mark (4; 0% instances), flat (3; 0% instances), iobj (3; 0% instances), obj:lvc (3; 0% instances), advmod (2; 0% instances), compound:prt (1; 0% instances), obl:arg (1; 0% instances)

Parents of VERB nodes belong to 16 different parts of speech: VERB (6871; 39% instances), (6196; 35% instances), NOUN (1834; 10% instances), ADV (969; 5% instances), PRON (936; 5% instances), AUX (331; 2% instances), ADJ (201; 1% instances), SCONJ (128; 1% instances), PROPN (109; 1% instances), X (87; 0% instances), NUM (32; 0% instances), ADP (25; 0% instances), PART (17; 0% instances), INTJ (11; 0% instances), DET (9; 0% instances), CCONJ (2; 0% instances)

601 (3%) VERB nodes are leaves.

1539 (9%) VERB nodes have one child.

2140 (12%) VERB nodes have two children.

13478 (76%) VERB nodes have three or more children.

The highest child degree of a VERB node is 18.

Children of VERB nodes are attached using 45 different relations: dep (20504; 27% instances), nsubj (12105; 16% instances), aux (9770; 13% instances), obj (7247; 10% instances), mark (4451; 6% instances), advmod (3444; 5% instances), discourse (2322; 3% instances), xcomp (1967; 3% instances), obl:arg (1771; 2% instances), advcl (1721; 2% instances), compound:svc (1251; 2% instances), iobj (1198; 2% instances), obl:mod (1161; 2% instances), ccomp (1071; 1% instances), dislocated (993; 1% instances), conj (904; 1% instances), parataxis:conj (794; 1% instances), parataxis (725; 1% instances), compound:prt (364; 0% instances), cc (341; 0% instances), cop (264; 0% instances), expl:subj (215; 0% instances), reparandum (174; 0% instances), vocative (134; 0% instances), parataxis:discourse (112; 0% instances), compound:redup (86; 0% instances), obj:lvc (53; 0% instances), fixed (32; 0% instances), parataxis:parenth (32; 0% instances), csubj (31; 0% instances), det (29; 0% instances), compound (24; 0% instances), parataxis:dislocated (24; 0% instances), advcl:cleft (22; 0% instances), obl:agent (18; 0% instances), parataxis:mod (17; 0% instances), acl:relcl (7; 0% instances), amod (6; 0% instances), appos (6; 0% instances), iobj:agent (4; 0% instances), nmod (4; 0% instances), flat:foreign (3; 0% instances), nmod:poss (3; 0% instances), nummod (3; 0% instances), case (2; 0% instances)

Children of VERB nodes belong to 15 different parts of speech: X (20538; 27% instances), PRON (14129; 19% instances), AUX (10205; 14% instances), NOUN (9670; 13% instances), VERB (6871; 9% instances), ADV (3849; 5% instances), SCONJ (3566; 5% instances), ADP (2011; 3% instances), PROPN (1034; 1% instances), ADJ (935; 1% instances), INTJ (886; 1% instances), CCONJ (856; 1% instances), PART (610; 1% instances), NUM (169; 0% instances), DET (80; 0% instances)