home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-GSD: POS Tags: VERB

There are 4603 VERB lemmas (21%), 4667 VERB types (21%) and 18219 VERB tokens (15%). Out of 16 observed tags, the rank of VERB is: 3 in number of lemmas, 3 in number of types and 2 in number of tokens.

The 10 most frequent VERB lemmas: 有、 為、 在、 於、 是、 由、 成、 到、 位、 開始

The 10 most frequent VERB types: 有、 在、 於、 為、 是、 由、 成、 到、 位、 開始

The 10 most frequent ambiguous lemmas: 為 (VERB 609, AUX 581, ADP 131, PROPN 1), 在 (ADP 1060, VERB 556, ADV 28), 於 (VERB 497, ADP 398), 是 (AUX 1062, VERB 384), 由 (VERB 213, ADP 54, NOUN 2), 成 (VERB 183, PROPN 1), 到 (VERB 174, CCONJ 28, ADP 22), 位 (VERB 171, NOUN 83, PART 2), 開始 (VERB 145, ADP 6, NOUN 1), 用 (VERB 123, NOUN 1)

The 10 most frequent ambiguous types: 在 (ADP 1060, VERB 556, ADV 28), 於 (VERB 497, ADP 398), 為 (AUX 566, VERB 495, ADP 131, PROPN 1), 是 (AUX 884, VERB 322), 由 (VERB 213, ADP 54, NOUN 2), 成 (VERB 183, PROPN 1), 到 (VERB 173, CCONJ 28, ADP 22), 位 (VERB 171, NOUN 83, PART 2), 開始 (VERB 145, ADP 6, NOUN 1), 用 (VERB 121, NOUN 1)

Morphology

The form / lemma ratio of VERB is 1.013904 (the average of all parts of speech is 1.004819).

The 1st highest number of forms (21) was observed with the lemma “為”: 一分為二, 亦為, 以為, 任人為賢, 列為, 化整為零, 合而為一, 封為, 混為一談, 為, 為人, 為期, 無為, 無能為力, 爲, 相依為命, 行為, 見義勇為, 認為, 身為, 選為.

The 2nd highest number of forms (12) was observed with the lemma “是”: 不是, 也是, 像是, 則是, 卻是, 又是, 就是, 是, 是否, 正是, 而是, 都是.

The 3rd highest number of forms (3) was observed with the lemma “有”: 有, 未有, 沒有.

VERB occurs with 2 features: Voice (492; 3% instances), Polarity (215; 1% instances)

VERB occurs with 2 feature-value pairs: Polarity=Neg, Voice=Cau

VERB occurs with 3 feature combinations. The most frequent feature combination is _ (17512 tokens). Examples: 有、 在、 於、 為、 是、 由、 成、 到、 位、 開始

Relations

VERB nodes are attached to their parents using 22 different relations: root (4112; 23% instances), advcl (3553; 20% instances), ccomp (1774; 10% instances), parataxis (1632; 9% instances), xcomp (1621; 9% instances), acl:relcl (1604; 9% instances), mark (1385; 8% instances), compound (1245; 7% instances), conj (328; 2% instances), csubj (309; 2% instances), amod (305; 2% instances), acl (169; 1% instances), obj (46; 0% instances), dislocated (39; 0% instances), appos (28; 0% instances), nmod (28; 0% instances), nsubj (20; 0% instances), obl (8; 0% instances), csubj:pass (6; 0% instances), nmod:tmod (4; 0% instances), discourse (2; 0% instances), reparandum (1; 0% instances)

Parents of VERB nodes belong to 13 different parts of speech: VERB (10010; 55% instances), (4112; 23% instances), NOUN (2094; 11% instances), PART (1574; 9% instances), ADJ (216; 1% instances), PROPN (91; 0% instances), ADP (52; 0% instances), NUM (24; 0% instances), ADV (16; 0% instances), X (16; 0% instances), PRON (7; 0% instances), AUX (4; 0% instances), DET (3; 0% instances)

3222 (18%) VERB nodes are leaves.

2485 (14%) VERB nodes have one child.

3106 (17%) VERB nodes have two children.

9406 (52%) VERB nodes have three or more children.

The highest child degree of a VERB node is 12.

Children of VERB nodes are attached using 36 different relations: punct (9169; 18% instances), obj (7698; 15% instances), nsubj (7438; 14% instances), mark (4324; 8% instances), advcl (3777; 7% instances), obl (3113; 6% instances), advmod (2464; 5% instances), ccomp (2080; 4% instances), xcomp (1861; 4% instances), aux (1788; 3% instances), parataxis (1734; 3% instances), mark:rel (1662; 3% instances), nmod:tmod (1493; 3% instances), case (664; 1% instances), aux:pass (424; 1% instances), conj (329; 1% instances), nsubj:pass (275; 1% instances), csubj (267; 1% instances), obl:patient (194; 0% instances), cc (184; 0% instances), discourse (143; 0% instances), iobj (78; 0% instances), nmod (27; 0% instances), compound:ext (24; 0% instances), mark:adv (22; 0% instances), appos (20; 0% instances), cop (9; 0% instances), discourse:sp (7; 0% instances), amod (6; 0% instances), csubj:pass (6; 0% instances), compound (5; 0% instances), nummod (5; 0% instances), dislocated (4; 0% instances), det (3; 0% instances), reparandum (1; 0% instances), vocative (1; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: NOUN (14743; 29% instances), VERB (10010; 20% instances), PUNCT (9169; 18% instances), SCONJ (4169; 8% instances), PART (2729; 5% instances), PROPN (2445; 5% instances), ADV (2360; 5% instances), AUX (2226; 4% instances), ADP (1194; 2% instances), PRON (1063; 2% instances), ADJ (630; 1% instances), CCONJ (182; 0% instances), NUM (171; 0% instances), X (135; 0% instances), DET (67; 0% instances), SYM (6; 0% instances)