home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-HK: POS Tags: VERB

There are 111 VERB lemmas (24%), 149 VERB types (26%) and 378 VERB tokens (20%). Out of 17 observed tags, the rank of VERB is: 2 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent VERB lemmas: _、 是、 有、 唱、 來、 做、 唱歌、 到、 沒有、 說

The 10 most frequent VERB types: 是、 有、 唱、 沒有、 做、 走、 來、 唱歌、 看、 說

The 10 most frequent ambiguous lemmas: _ (VERB 114, PUNCT 111, NOUN 69, ADV 63, PART 54, PRON 49, ADJ 21, NUM 19, AUX 18, ADP 10, PROPN 10, DET 8, INTJ 5, SCONJ 1, X 1), 有 (VERB 21, AUX 1), 到 (VERB 6, ADP 2), 沒有 (VERB 6, AUX 1), 對 (VERB 4, ADJ 3), 在 (ADP 12, VERB 3), 了 (PART 19, AUX 4, VERB 2), 幫 (VERB 2, ADP 1), 用 (VERB 2, ADP 1), 像 (ADP 1, ADV 1, VERB 1)

The 10 most frequent ambiguous types: 有 (VERB 23, AUX 1), 沒有 (VERB 12, AUX 2), 來 (VERB 10, SCONJ 1), 到 (VERB 6, ADP 3), 給 (VERB 6, ADP 4), 去 (VERB 5, ADP 2), 對 (VERB 5, ADJ 3), 在 (ADP 14, VERB 4, ADV 1), 要 (AUX 11, VERB 4), 喜歡 (AUX 3, VERB 3)

Morphology

The form / lemma ratio of VERB is 1.342342 (the average of all parts of speech is 1.221258).

The 1st highest number of forms (60) was observed with the lemma “_”: 交換, 來, 做, 出來, 出現, 去, 取, 吃飯, 問問, 喜歡, 回, 回來, 在, 夠, 好, 對, 屬於, 弄, 扭, 找, 拿, 掛, 換, 搗蛋, 收, 收拾, 放工, 是, 有, 沒, 沒有, 沒用, 洗, 煮, 煮菜, 獎勵, 玩, 理, 看, 看看, 知, 知道, 等, 算, 管, 給, 繼續, 行, 要, 記, 說, 讀書, 讓, 走, 趕快, 跌, 過, 閃, 餘下, 默書.

The 2nd highest number of forms (1) was observed with the lemma “下”: 下.

The 3rd highest number of forms (1) was observed with the lemma “了”: 了.

VERB does not occur with any features.

Relations

VERB nodes are attached to their parents using 15 different relations: root (157; 42% instances), conj (49; 13% instances), advcl (33; 9% instances), parataxis (28; 7% instances), cop (25; 7% instances), ccomp (24; 6% instances), acl (20; 5% instances), compound:vv (15; 4% instances), xcomp (14; 4% instances), csubj (4; 1% instances), compound:dir (3; 1% instances), case (2; 1% instances), dislocated (2; 1% instances), goeswith (1; 0% instances), nsubj (1; 0% instances)

Parents of VERB nodes belong to 6 different parts of speech: (157; 42% instances), VERB (157; 42% instances), NOUN (40; 11% instances), ADJ (14; 4% instances), PRON (8; 2% instances), PROPN (2; 1% instances)

66 (17%) VERB nodes are leaves.

52 (14%) VERB nodes have one child.

50 (13%) VERB nodes have two children.

210 (56%) VERB nodes have three or more children.

The highest child degree of a VERB node is 11.

Children of VERB nodes are attached using 30 different relations: punct (227; 21% instances), advmod (171; 16% instances), obj (145; 13% instances), nsubj (107; 10% instances), discourse:sp (82; 7% instances), aux (65; 6% instances), conj (43; 4% instances), obl (39; 4% instances), advcl (36; 3% instances), ccomp (27; 2% instances), parataxis (26; 2% instances), compound:vv (23; 2% instances), xcomp (21; 2% instances), dislocated (16; 1% instances), obl:tmod (14; 1% instances), mark:rel (11; 1% instances), vocative (9; 1% instances), discourse (5; 0% instances), mark (5; 0% instances), csubj (4; 0% instances), iobj (4; 0% instances), compound:dir (3; 0% instances), compound:vo (3; 0% instances), advmod:df (2; 0% instances), cc (2; 0% instances), compound:ext (2; 0% instances), cop (2; 0% instances), dep (1; 0% instances), goeswith (1; 0% instances), obl:patient (1; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: PUNCT (227; 21% instances), NOUN (190; 17% instances), ADV (173; 16% instances), VERB (157; 14% instances), PRON (134; 12% instances), PART (95; 9% instances), AUX (66; 6% instances), PROPN (21; 2% instances), ADJ (20; 2% instances), ADP (4; 0% instances), INTJ (3; 0% instances), CCONJ (2; 0% instances), SYM (2; 0% instances), NUM (1; 0% instances), SCONJ (1; 0% instances), X (1; 0% instances)