home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Vietnamese-VTB: POS Tags: VERB

There are 2122 VERB lemmas (28%), 2122 VERB types (28%) and 10868 VERB tokens (19%). Out of 17 observed tags, the rank of VERB is: 2 in number of lemmas, 2 in number of types and 2 in number of tokens.

The 10 most frequent VERB lemmas: có, đi, biết, làm, nói, ra, về, cho, đến, thấy

The 10 most frequent VERB types: có, đi, biết, làm, nói, ra, về, cho, đến, thấy

The 10 most frequent ambiguous lemmas: (VERB 547, ADV 35, PART 5, NOUN 1), đi (VERB 222, ADV 20, PART 3), ra (VERB 172, ADV 70, ADP 9), về (VERB 159, ADP 121, ADV 16), cho (ADP 263, VERB 148, ADV 11), đến (ADP 177, VERB 116, ADV 27, PART 12), thấy (VERB 98, ADV 1), vào (ADP 145, VERB 98, ADV 35), lên (VERB 92, ADV 36, ADP 9), học (VERB 90, NOUN 2)

The 10 most frequent ambiguous types: (VERB 515, ADV 33, PART 5, NOUN 1), đi (VERB 217, ADV 20, PART 3), ra (VERB 169, ADV 70, ADP 9), về (VERB 155, ADP 116, ADV 16), cho (ADP 262, VERB 146, ADV 11), đến (ADP 157, VERB 111, ADV 27, PART 12), thấy (VERB 95, ADV 1), vào (ADP 145, VERB 94, ADV 35), lên (VERB 90, ADV 36, ADP 9), học (VERB 87, NOUN 2)

Morphology

The form / lemma ratio of VERB is 1.000000 (the average of all parts of speech is 1.001997).

The 1st highest number of forms (1) was observed with the lemma “am hiểu”: am hiểu.

The 2nd highest number of forms (1) was observed with the lemma “an nghỉ”: an nghỉ.

The 3rd highest number of forms (1) was observed with the lemma “an toàn”: an toàn.

VERB does not occur with any features.

Relations

VERB nodes are attached to their parents using 53 different relations: root (2682; 25% instances), xcomp (1443; 13% instances), conj (1405; 13% instances), acl:subj (1155; 11% instances), compound:vmod (710; 7% instances), advcl (627; 6% instances), compound:svc (456; 4% instances), ccomp (431; 4% instances), parataxis (425; 4% instances), advcl:objective (295; 3% instances), acl (223; 2% instances), acl:tmod (213; 2% instances), compound:dir (177; 2% instances), xcomp:adj (126; 1% instances), csubj (91; 1% instances), acl:tonp (76; 1% instances), csubj:vsubj (55; 1% instances), compound (32; 0% instances), obj (29; 0% instances), case (25; 0% instances), acl:relcl (22; 0% instances), compound:atov (22; 0% instances), appos (20; 0% instances), obl (19; 0% instances), xcomp:dir (15; 0% instances), obl:tmod (13; 0% instances), appos:nmod (8; 0% instances), obl:comp (8; 0% instances), list (6; 0% instances), obl:about (6; 0% instances), compound:prt (5; 0% instances), mark:pcomp (5; 0% instances), vocative (5; 0% instances), amod (4; 0% instances), csubj:pass (4; 0% instances), discourse (3; 0% instances), fixed (3; 0% instances), nmod:poss (3; 0% instances), xcomp:vcomp (3; 0% instances), compound:redup (2; 0% instances), compound:verbnoun (2; 0% instances), nmod (2; 0% instances), nsubj:xsubj (2; 0% instances), compound:amod (1; 0% instances), compound:pron (1; 0% instances), dislocated (1; 0% instances), flat (1; 0% instances), flat:redup (1; 0% instances), mark (1; 0% instances), nsubj (1; 0% instances), nsubj:nn (1; 0% instances), nsubj:pass (1; 0% instances), obl:iobj (1; 0% instances)

Parents of VERB nodes belong to 13 different parts of speech: VERB (5035; 46% instances), (2682; 25% instances), NOUN (2627; 24% instances), ADJ (387; 4% instances), PROPN (62; 1% instances), PRON (24; 0% instances), ADV (17; 0% instances), ADP (14; 0% instances), X (8; 0% instances), NUM (5; 0% instances), AUX (3; 0% instances), SCONJ (3; 0% instances), SYM (1; 0% instances)

1886 (17%) VERB nodes are leaves.

2155 (20%) VERB nodes have one child.

1904 (18%) VERB nodes have two children.

4923 (45%) VERB nodes have three or more children.

The highest child degree of a VERB node is 14.

Children of VERB nodes are attached using 73 different relations: punct (5349; 19% instances), obj (4190; 15% instances), nsubj (3435; 12% instances), advmod (2364; 8% instances), xcomp (1880; 7% instances), conj (1377; 5% instances), obl:tmod (978; 3% instances), mark (895; 3% instances), obl (880; 3% instances), obl:comp (780; 3% instances), advcl (741; 3% instances), cc (599; 2% instances), ccomp (520; 2% instances), advmod:neg (511; 2% instances), compound:svc (508; 2% instances), parataxis (444; 2% instances), aux (370; 1% instances), aux:pass (339; 1% instances), compound:dir (275; 1% instances), advcl:objective (266; 1% instances), advmod:adj (265; 1% instances), mark:pcomp (210; 1% instances), nsubj:pass (209; 1% instances), discourse (191; 1% instances), obl:with (152; 1% instances), obl:iobj (122; 0% instances), compound:prt (89; 0% instances), csubj (86; 0% instances), case (79; 0% instances), obl:agent (75; 0% instances), compound:verbnoun (61; 0% instances), cop (61; 0% instances), obl:about (48; 0% instances), det:pmod (41; 0% instances), compound (36; 0% instances), csubj:vsubj (34; 0% instances), iobj (34; 0% instances), amod (31; 0% instances), dislocated (31; 0% instances), nmod (24; 0% instances), nmod:poss (22; 0% instances), appos (19; 0% instances), advmod:dir (18; 0% instances), xcomp:dir (15; 0% instances), fixed (11; 0% instances), acl:subj (10; 0% instances), compound:vmod (10; 0% instances), csubj:pass (10; 0% instances), vocative (10; 0% instances), appos:nmod (8; 0% instances), nsubj:xsubj (8; 0% instances), list (7; 0% instances), nummod (7; 0% instances), xcomp:adj (7; 0% instances), csubj:asubj (6; 0% instances), acl (5; 0% instances), obl:adj (5; 0% instances), compound:atov (4; 0% instances), compound:redup (4; 0% instances), compound:z (4; 0% instances), expl (4; 0% instances), nsubj:nn (4; 0% instances), dep (3; 0% instances), det (3; 0% instances), flat:redup (3; 0% instances), clf:det (2; 0% instances), compound:amod (2; 0% instances), xcomp:vcomp (2; 0% instances), acl:relcl (1; 0% instances), acl:tmod (1; 0% instances), clf (1; 0% instances), compound:pron (1; 0% instances), flat (1; 0% instances)

Children of VERB nodes belong to 17 different parts of speech: NOUN (9098; 32% instances), PUNCT (5349; 19% instances), VERB (5035; 17% instances), ADV (3114; 11% instances), ADJ (1153; 4% instances), PROPN (1093; 4% instances), SCONJ (1067; 4% instances), PRON (1012; 4% instances), AUX (782; 3% instances), ADP (466; 2% instances), CCONJ (369; 1% instances), PART (122; 0% instances), X (62; 0% instances), NUM (51; 0% instances), INTJ (13; 0% instances), DET (7; 0% instances), SYM (5; 0% instances)