home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-GSD: POS Tags: VERB

There are 4605 VERB lemmas (21%), 4668 VERB types (21%) and 18217 VERB tokens (15%). Out of 16 observed tags, the rank of VERB is: 3 in number of lemmas, 3 in number of types and 2 in number of tokens.

The 10 most frequent VERB lemmas: 有、 在、 為、 於、 是、 由、 成、 到、 位、 開始

The 10 most frequent VERB types: 有、 在、 於、 為、 是、 由、 成、 到、 位、 開始

The 10 most frequent ambiguous lemmas: 在 (ADP 1061, VERB 555, ADV 28), 為 (AUX 580, VERB 524, ADP 131, PROPN 1), 於 (VERB 497, ADP 398), 是 (AUX 1064, VERB 382), 由 (VERB 213, ADP 54, NOUN 2), 成 (VERB 183, PROPN 1), 到 (VERB 174, CCONJ 28, ADP 22), 位 (VERB 171, NOUN 83, PART 2), 開始 (VERB 145, ADP 6, NOUN 1), 用 (VERB 123, NOUN 1)

The 10 most frequent ambiguous types: 在 (ADP 1061, VERB 555, ADV 28), 於 (VERB 497, ADP 398), 為 (AUX 566, VERB 495, ADP 131, PROPN 1), 是 (AUX 885, VERB 321), 由 (VERB 213, ADP 54, NOUN 2), 成 (VERB 183, PROPN 1), 到 (VERB 173, CCONJ 28, ADP 22), 位 (VERB 171, NOUN 83, PART 2), 開始 (VERB 145, ADP 6, NOUN 1), 用 (VERB 121, NOUN 1)

Morphology

The form / lemma ratio of VERB is 1.013681 (the average of all parts of speech is 1.004732).

The 1st highest number of forms (20) was observed with the lemma “為”: 一分為二, 亦為, 以為, 任人為賢, 列為, 化整為零, 合而為一, 封為, 混為一談, 為, 為人, 為期, 無為, 無能為力, 爲, 相依為命, 行為, 見義勇為, 身為, 選為.

The 2nd highest number of forms (12) was observed with the lemma “是”: 不是, 也是, 像是, 則是, 卻是, 又是, 就是, 是, 是否, 正是, 而是, 都是.

The 3rd highest number of forms (3) was observed with the lemma “有”: 有, 未有, 沒有.

VERB occurs with 2 features: Voice (492; 3% instances), Polarity (215; 1% instances)

VERB occurs with 2 feature-value pairs: Polarity=Neg, Voice=Cau

VERB occurs with 3 feature combinations. The most frequent feature combination is _ (17510 tokens). Examples: 有、 在、 於、 為、 是、 由、 成、 到、 位、 開始

Relations

VERB nodes are attached to their parents using 23 different relations: root (4112; 23% instances), advcl (3548; 19% instances), ccomp (1769; 10% instances), parataxis (1633; 9% instances), xcomp (1621; 9% instances), acl:relcl (1603; 9% instances), mark (1384; 8% instances), compound (1245; 7% instances), conj (334; 2% instances), amod (305; 2% instances), csubj (305; 2% instances), acl (174; 1% instances), obj (46; 0% instances), dislocated (39; 0% instances), appos (28; 0% instances), nmod (28; 0% instances), nsubj (20; 0% instances), obl (9; 0% instances), csubj:pass (6; 0% instances), nmod:tmod (4; 0% instances), discourse (2; 0% instances), case (1; 0% instances), reparandum (1; 0% instances)

Parents of VERB nodes belong to 12 different parts of speech: VERB (10004; 55% instances), (4112; 23% instances), NOUN (2100; 12% instances), PART (1575; 9% instances), ADJ (215; 1% instances), PROPN (92; 1% instances), ADP (52; 0% instances), NUM (24; 0% instances), X (17; 0% instances), ADV (16; 0% instances), PRON (6; 0% instances), AUX (4; 0% instances)

3221 (18%) VERB nodes are leaves.

2487 (14%) VERB nodes have one child.

3098 (17%) VERB nodes have two children.

9411 (52%) VERB nodes have three or more children.

The highest child degree of a VERB node is 12.

Children of VERB nodes are attached using 38 different relations: punct (9173; 18% instances), obj (7699; 15% instances), nsubj (7363; 14% instances), mark (4323; 8% instances), advcl (3773; 7% instances), obl (3115; 6% instances), advmod (2464; 5% instances), ccomp (2075; 4% instances), xcomp (1861; 4% instances), aux (1789; 3% instances), parataxis (1733; 3% instances), mark:rel (1661; 3% instances), nmod:tmod (1494; 3% instances), case (668; 1% instances), aux:pass (425; 1% instances), conj (336; 1% instances), nsubj:pass (277; 1% instances), csubj (263; 1% instances), obl:patient (194; 0% instances), cc (184; 0% instances), discourse (143; 0% instances), iobj (78; 0% instances), obl:agent (70; 0% instances), nmod (27; 0% instances), compound:ext (24; 0% instances), mark:adv (22; 0% instances), appos (20; 0% instances), cop (9; 0% instances), discourse:sp (7; 0% instances), amod (6; 0% instances), csubj:pass (6; 0% instances), compound (5; 0% instances), nummod (5; 0% instances), dislocated (4; 0% instances), det (3; 0% instances), nsubj:outer (3; 0% instances), reparandum (1; 0% instances), vocative (1; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: NOUN (14745; 29% instances), VERB (10004; 19% instances), PUNCT (9173; 18% instances), SCONJ (4168; 8% instances), PART (2732; 5% instances), PROPN (2444; 5% instances), ADV (2360; 5% instances), AUX (2228; 4% instances), ADP (1194; 2% instances), PRON (1063; 2% instances), ADJ (632; 1% instances), CCONJ (182; 0% instances), NUM (171; 0% instances), X (135; 0% instances), DET (67; 0% instances), SYM (6; 0% instances)