home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-HK: POS Tags: VERB

There are 481 VERB lemmas (28%), 489 VERB types (28%) and 1872 VERB tokens (19%). Out of 16 observed tags, the rank of VERB is: 2 in number of lemmas, 2 in number of types and 1 in number of tokens.

The 10 most frequent VERB lemmas: 有、 是、 說、 做、 請、 看、 去、 為、 議事、 進行

The 10 most frequent VERB types: 有、 是、 說、 沒有、 做、 請、 看、 去、 議事、 進行

The 10 most frequent ambiguous lemmas: 有 (VERB 122, AUX 10, ADJ 1, ADV 1), 是 (AUX 108, VERB 77), 去 (VERB 27, ADP 3), 為 (VERB 26, ADP 4, AUX 3), 議事 (VERB 25, NOUN 5), 選舉 (NOUN 24, VERB 24), 到 (VERB 23, ADP 21), 來 (VERB 20, SCONJ 4, ADP 1, PART 1), 宣誓 (VERB 18, NOUN 14), 發言 (VERB 14, NOUN 2)

The 10 most frequent ambiguous types: 有 (VERB 88, AUX 6, ADJ 1, ADV 1), 是 (AUX 108, VERB 77), 沒有 (VERB 34, AUX 4, ADV 1), 去 (VERB 27, ADP 3), 議事 (VERB 25, NOUN 5), 選舉 (NOUN 24, VERB 24), 到 (VERB 23, ADP 21), 來 (VERB 20, SCONJ 4, ADP 1, PART 1), 宣誓 (VERB 18, NOUN 14), 發言 (VERB 14, NOUN 2)

Morphology

The form / lemma ratio of VERB is 1.016632 (the average of all parts of speech is 1.007013).

The 1st highest number of forms (6) was observed with the lemma “為”: 以為, 分為, 成為, 為, 為難, 認為.

The 2nd highest number of forms (3) was observed with the lemma “用”: 不用, 沒用, 用.

The 3rd highest number of forms (2) was observed with the lemma “有”: 有, 沒有.

VERB occurs with 1 features: Polarity (41; 2% instances)

VERB occurs with 1 feature-value pairs: Polarity=Neg

VERB occurs with 2 feature combinations. The most frequent feature combination is _ (1831 tokens). Examples: 有、 是、 說、 做、 請、 看、 去、 議事、 進行、 選舉

Relations

VERB nodes are attached to their parents using 20 different relations: root (770; 41% instances), ccomp (224; 12% instances), advcl (209; 11% instances), conj (183; 10% instances), acl (168; 9% instances), xcomp (132; 7% instances), parataxis (65; 3% instances), compound:vv (50; 3% instances), compound:dir (31; 2% instances), csubj (13; 1% instances), compound (5; 0% instances), nsubj (5; 0% instances), case (4; 0% instances), obj (4; 0% instances), obl (3; 0% instances), reparandum (2; 0% instances), appos (1; 0% instances), discourse:sp (1; 0% instances), nmod (1; 0% instances), obj:periph (1; 0% instances)

Parents of VERB nodes belong to 11 different parts of speech: VERB (843; 45% instances), (770; 41% instances), NOUN (197; 11% instances), ADJ (35; 2% instances), AUX (7; 0% instances), PRON (6; 0% instances), ADV (4; 0% instances), PROPN (4; 0% instances), ADP (3; 0% instances), DET (2; 0% instances), SCONJ (1; 0% instances)

229 (12%) VERB nodes are leaves.

235 (13%) VERB nodes have one child.

300 (16%) VERB nodes have two children.

1108 (59%) VERB nodes have three or more children.

The highest child degree of a VERB node is 11.

Children of VERB nodes are attached using 41 different relations: punct (1115; 19% instances), advmod (928; 16% instances), obj (776; 14% instances), nsubj (709; 12% instances), aux (325; 6% instances), ccomp (248; 4% instances), discourse:sp (216; 4% instances), obl (213; 4% instances), advcl (201; 4% instances), conj (187; 3% instances), xcomp (148; 3% instances), mark (108; 2% instances), obl:tmod (88; 2% instances), compound:vv (78; 1% instances), mark:rel (76; 1% instances), parataxis (68; 1% instances), vocative (44; 1% instances), compound:dir (33; 1% instances), cc (32; 1% instances), obj:periph (27; 0% instances), discourse (20; 0% instances), case (11; 0% instances), compound:ext (10; 0% instances), csubj (8; 0% instances), obl:patient (8; 0% instances), iobj (7; 0% instances), compound:vo (5; 0% instances), acl (4; 0% instances), compound (4; 0% instances), dislocated (4; 0% instances), cop (3; 0% instances), det (3; 0% instances), nummod (3; 0% instances), appos (2; 0% instances), aux:pass (2; 0% instances), nmod (2; 0% instances), reparandum (2; 0% instances), amod (1; 0% instances), case:loc (1; 0% instances), nsubj:pass (1; 0% instances), obl:agent (1; 0% instances)

Children of VERB nodes belong to 15 different parts of speech: PUNCT (1115; 19% instances), NOUN (1111; 19% instances), ADV (913; 16% instances), VERB (843; 15% instances), PRON (719; 13% instances), AUX (336; 6% instances), PART (312; 5% instances), ADJ (87; 2% instances), PROPN (83; 1% instances), SCONJ (63; 1% instances), ADP (61; 1% instances), CCONJ (53; 1% instances), INTJ (13; 0% instances), DET (7; 0% instances), NUM (6; 0% instances)