home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-GSDSimp: POS Tags: VERB

There are 4575 VERB lemmas (21%), 4638 VERB types (21%) and 18219 VERB tokens (15%). Out of 16 observed tags, the rank of VERB is: 3 in number of lemmas, 3 in number of types and 2 in number of tokens.

The 10 most frequent VERB lemmas: 有、 为、 在、 于、 是、 由、 成、 到、 位、 开始

The 10 most frequent VERB types: 有、 在、 于、 为、 是、 由、 成、 到、 位、 开始

The 10 most frequent ambiguous lemmas: 为 (VERB 609, AUX 581, ADP 133, PROPN 1), 在 (ADP 1060, VERB 556, ADV 28), 于 (VERB 497, ADP 399, PROPN 3), 是 (AUX 1062, VERB 384), 由 (VERB 213, ADP 54, NOUN 2), 成 (VERB 183, PROPN 1), 到 (VERB 174, CCONJ 28, ADP 22), 位 (VERB 171, NOUN 83, PART 2), 开始 (VERB 145, ADP 6, NOUN 1), 用 (VERB 123, NOUN 1)

The 10 most frequent ambiguous types: 在 (ADP 1060, VERB 556, ADV 28), 于 (VERB 497, ADP 399, PROPN 3), 为 (AUX 568, VERB 496, ADP 133, PROPN 1), 是 (AUX 884, VERB 322), 由 (VERB 213, ADP 54, NOUN 2), 成 (VERB 183, PROPN 1), 到 (VERB 173, CCONJ 28, ADP 22), 位 (VERB 171, NOUN 83, PART 2), 开始 (VERB 145, ADP 6, NOUN 1), 用 (VERB 121, NOUN 1)

Morphology

The form / lemma ratio of VERB is 1.013770 (the average of all parts of speech is 1.004660).

The 1st highest number of forms (20) was observed with the lemma “为”: 一分为二, 为, 为人, 为期, 亦为, 以为, 任人为贤, 列为, 化整为零, 合而为一, 封为, 无为, 无能为力, 混为一谈, 相依为命, 行为, 见义勇为, 认为, 身为, 选为.

The 2nd highest number of forms (12) was observed with the lemma “是”: 不是, 也是, 像是, 则是, 却是, 又是, 就是, 是, 是否, 正是, 而是, 都是.

The 3rd highest number of forms (3) was observed with the lemma “有”: 有, 未有, 没有.

VERB occurs with 2 features: Voice (492; 3% instances), Polarity (215; 1% instances)

VERB occurs with 2 feature-value pairs: Polarity=Neg, Voice=Cau

VERB occurs with 3 feature combinations. The most frequent feature combination is _ (17512 tokens). Examples: 有、 在、 于、 为、 是、 由、 成、 到、 位、 开始

Relations

VERB nodes are attached to their parents using 22 different relations: root (4113; 23% instances), advcl (3553; 20% instances), ccomp (1773; 10% instances), parataxis (1632; 9% instances), xcomp (1621; 9% instances), acl:relcl (1604; 9% instances), mark (1385; 8% instances), compound (1245; 7% instances), conj (329; 2% instances), csubj (309; 2% instances), amod (305; 2% instances), acl (169; 1% instances), obj (46; 0% instances), dislocated (39; 0% instances), nmod (28; 0% instances), appos (27; 0% instances), nsubj (20; 0% instances), obl (8; 0% instances), csubj:pass (6; 0% instances), nmod:tmod (4; 0% instances), discourse (2; 0% instances), reparandum (1; 0% instances)

Parents of VERB nodes belong to 13 different parts of speech: VERB (10010; 55% instances), (4113; 23% instances), NOUN (2108; 12% instances), PART (1572; 9% instances), ADJ (216; 1% instances), PROPN (80; 0% instances), ADP (52; 0% instances), NUM (24; 0% instances), ADV (16; 0% instances), X (14; 0% instances), PRON (7; 0% instances), AUX (4; 0% instances), DET (3; 0% instances)

3222 (18%) VERB nodes are leaves.

2485 (14%) VERB nodes have one child.

3106 (17%) VERB nodes have two children.

9406 (52%) VERB nodes have three or more children.

The highest child degree of a VERB node is 12.

Children of VERB nodes are attached using 36 different relations: punct (9169; 18% instances), obj (7699; 15% instances), nsubj (7438; 15% instances), mark (4324; 8% instances), advcl (3776; 7% instances), obl (3113; 6% instances), advmod (2464; 5% instances), ccomp (2080; 4% instances), xcomp (1861; 4% instances), aux (1788; 3% instances), parataxis (1734; 3% instances), mark:rel (1662; 3% instances), nmod:tmod (1493; 3% instances), case (663; 1% instances), aux:pass (424; 1% instances), conj (329; 1% instances), nsubj:pass (275; 1% instances), csubj (267; 1% instances), obl:patient (194; 0% instances), cc (184; 0% instances), discourse (143; 0% instances), iobj (77; 0% instances), compound:ext (24; 0% instances), mark:adv (22; 0% instances), nmod (21; 0% instances), appos (20; 0% instances), cop (9; 0% instances), discourse:sp (7; 0% instances), amod (6; 0% instances), csubj:pass (6; 0% instances), compound (5; 0% instances), nummod (5; 0% instances), dislocated (4; 0% instances), det (3; 0% instances), reparandum (1; 0% instances), vocative (1; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: NOUN (14738; 29% instances), VERB (10010; 20% instances), PUNCT (9169; 18% instances), SCONJ (4169; 8% instances), PART (2728; 5% instances), PROPN (2443; 5% instances), ADV (2360; 5% instances), AUX (2226; 4% instances), ADP (1194; 2% instances), PRON (1063; 2% instances), ADJ (630; 1% instances), CCONJ (182; 0% instances), NUM (171; 0% instances), X (135; 0% instances), DET (67; 0% instances), SYM (6; 0% instances)