home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Cantonese-HK: POS Tags: VERB

There are 454 VERB lemmas (26%), 454 VERB types (26%) and 2484 VERB tokens (18%). Out of 15 observed tags, the rank of VERB is: 2 in number of lemmas, 2 in number of types and 2 in number of tokens.

The 10 most frequent VERB lemmas: 係、 有、 做、 去、 冇、 講、 畀、 同、 睇、 返

The 10 most frequent VERB types: 係、 有、 做、 去、 冇、 講、 畀、 同、 睇、 返

The 10 most frequent ambiguous lemmas: 係 (VERB 313, AUX 100, ADV 10, DET 1), 有 (VERB 124, AUX 10), 去 (VERB 84, SCONJ 6, ADP 1), 冇 (VERB 72, AUX 18), 畀 (VERB 54, ADP 21), 同 (VERB 43, ADP 4, CCONJ 2), 返 (VERB 36, ADV 3), 話 (VERB 29, SCONJ 4, PART 1), 倒 (VERB 27, ADV 1), 唻 (VERB 25, PART 16, SCONJ 8)

The 10 most frequent ambiguous types: 係 (VERB 312, AUX 99, DET 1), 有 (VERB 124, AUX 10), 去 (VERB 84, SCONJ 6, ADP 1), 冇 (VERB 72, AUX 18), 畀 (VERB 54, ADP 21), 同 (VERB 43, ADP 4, CCONJ 2), 返 (VERB 36, ADV 3), 話 (VERB 29, SCONJ 4, PART 1), 倒 (VERB 27, ADV 1), 嚟 (VERB 25, PART 14, SCONJ 5)

Morphology

The form / lemma ratio of VERB is 1.000000 (the average of all parts of speech is 1.001746).

The 1st highest number of forms (2) was observed with the lemma “係”: 係, 係咪.

The 2nd highest number of forms (2) was observed with the lemma “出唻”: 出唻, 出嚟.

The 3rd highest number of forms (1) was observed with the lemma “Next-stop-is-Tin-Fu”: Next-stop-is-Tin-Fu.

VERB does not occur with any features.

Relations

VERB nodes are attached to their parents using 25 different relations: root (784; 32% instances), ccomp (395; 16% instances), conj (371; 15% instances), advcl (206; 8% instances), acl (161; 6% instances), xcomp (128; 5% instances), parataxis (118; 5% instances), compound:vv (84; 3% instances), advcl:coverb (65; 3% instances), compound:dir (60; 2% instances), reparandum (57; 2% instances), csubj (11; 0% instances), obj (8; 0% instances), case (6; 0% instances), nmod (6; 0% instances), discourse:sp (5; 0% instances), obl (5; 0% instances), discourse (3; 0% instances), nsubj (3; 0% instances), amod (2; 0% instances), appos (2; 0% instances), compound (1; 0% instances), compound:ext (1; 0% instances), compound:vo (1; 0% instances), dislocated (1; 0% instances)

Parents of VERB nodes belong to 12 different parts of speech: VERB (1411; 57% instances), (784; 32% instances), NOUN (188; 8% instances), ADJ (53; 2% instances), AUX (14; 1% instances), ADV (13; 1% instances), PRON (11; 0% instances), PROPN (5; 0% instances), SCONJ (2; 0% instances), ADP (1; 0% instances), DET (1; 0% instances), NUM (1; 0% instances)

304 (12%) VERB nodes are leaves.

413 (17%) VERB nodes have one child.

398 (16%) VERB nodes have two children.

1369 (55%) VERB nodes have three or more children.

The highest child degree of a VERB node is 15.

Children of VERB nodes are attached using 46 different relations: punct (1584; 19% instances), advmod (1122; 14% instances), obj (885; 11% instances), nsubj (791; 10% instances), discourse:sp (788; 10% instances), ccomp (428; 5% instances), aux (387; 5% instances), conj (382; 5% instances), discourse (266; 3% instances), advcl (208; 3% instances), obl (164; 2% instances), xcomp (145; 2% instances), mark (138; 2% instances), obl:tmod (116; 1% instances), compound:vv (114; 1% instances), parataxis (111; 1% instances), reparandum (82; 1% instances), advcl:coverb (62; 1% instances), compound:dir (60; 1% instances), mark:rel (58; 1% instances), vocative (50; 1% instances), cc (36; 0% instances), obj:periph (28; 0% instances), compound:ext (15; 0% instances), compound:vo (14; 0% instances), iobj (14; 0% instances), case (13; 0% instances), compound:quant (13; 0% instances), dislocated (10; 0% instances), compound (9; 0% instances), csubj (9; 0% instances), nmod (8; 0% instances), nsubj:periph (7; 0% instances), det (6; 0% instances), advmod:df (5; 0% instances), amod (5; 0% instances), cop (3; 0% instances), obl:agent (3; 0% instances), acl (2; 0% instances), appos (2; 0% instances), case:loc (2; 0% instances), nummod (2; 0% instances), obl:patient (2; 0% instances), clf (1; 0% instances), mark:adv (1; 0% instances), nsubj:pass (1; 0% instances)

Children of VERB nodes belong to 15 different parts of speech: PUNCT (1584; 19% instances), VERB (1411; 17% instances), ADV (1193; 15% instances), NOUN (1174; 14% instances), PART (898; 11% instances), PRON (887; 11% instances), AUX (404; 5% instances), INTJ (145; 2% instances), ADJ (127; 2% instances), SCONJ (99; 1% instances), PROPN (82; 1% instances), CCONJ (74; 1% instances), ADP (53; 1% instances), DET (11; 0% instances), NUM (10; 0% instances)