home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-GSD: POS Tags: VERB

There are 4642 VERB lemmas (21%), 4642 VERB types (21%) and 18673 VERB tokens (15%). Out of 15 observed tags, the rank of VERB is: 3 in number of lemmas, 3 in number of types and 2 in number of tokens.

The 10 most frequent VERB lemmas: 有、 在、 為、 於、 被、 是、 由、 成、 到、 位

The 10 most frequent VERB types: 有、 在、 為、 於、 被、 是、 由、 成、 到、 位

The 10 most frequent ambiguous lemmas: 有 (VERB 602, AUX 1), 在 (ADP 1059, VERB 557, ADV 28), 為 (AUX 553, VERB 507, ADP 131, PROPN 1, X 1), 於 (VERB 497, ADP 398), 是 (AUX 883, VERB 322, X 1), 由 (VERB 213, ADP 54, NOUN 2), 成 (VERB 183, PROPN 1), 到 (VERB 173, CCONJ 28, ADP 22), 位 (VERB 171, NOUN 83, PART 2), 開始 (VERB 144, ADP 6, AUX 1, NOUN 1)

The 10 most frequent ambiguous types: 有 (VERB 602, AUX 1), 在 (ADP 1059, VERB 557, ADV 28), 為 (AUX 553, VERB 507, ADP 131, PROPN 1, X 1), 於 (VERB 497, ADP 398), 是 (AUX 883, VERB 322, X 1), 由 (VERB 213, ADP 54, NOUN 2), 成 (VERB 183, PROPN 1), 到 (VERB 173, CCONJ 28, ADP 22), 位 (VERB 171, NOUN 83, PART 2), 開始 (VERB 144, ADP 6, AUX 1, NOUN 1)

Morphology

The form / lemma ratio of VERB is 1.000000 (the average of all parts of speech is 1.000266).

The 1st highest number of forms (1) was observed with the lemma “一分為二”: 一分為二.

The 2nd highest number of forms (1) was observed with the lemma “一到”: 一到.

The 3rd highest number of forms (1) was observed with the lemma “一反其道”: 一反其道.

VERB occurs with 1 features: Voice (1112; 6% instances)

VERB occurs with 2 feature-value pairs: Voice=Cau, Voice=Pass

VERB occurs with 3 feature combinations. The most frequent feature combination is _ (17561 tokens). Examples: 有、 在、 於、 為、 是、 由、 成、 到、 位、 開始

Relations

VERB nodes are attached to their parents using 25 different relations: root (4075; 22% instances), acl (3368; 18% instances), dep (1808; 10% instances), ccomp (1747; 9% instances), xcomp (1612; 9% instances), acl:relcl (1597; 9% instances), mark (1385; 7% instances), case:suff (1241; 7% instances), aux:pass (424; 2% instances), conj (321; 2% instances), csubj (306; 2% instances), amod (300; 2% instances), aux:caus (198; 1% instances), advcl (87; 0% instances), obj (45; 0% instances), dislocated (40; 0% instances), nmod (30; 0% instances), appos (23; 0% instances), advmod (20; 0% instances), nsubj (20; 0% instances), obl (8; 0% instances), csubj:pass (6; 0% instances), det (6; 0% instances), nmod:tmod (5; 0% instances), discourse (1; 0% instances)

Parents of VERB nodes belong to 13 different parts of speech: VERB (10457; 56% instances), (4075; 22% instances), NOUN (2054; 11% instances), PART (1566; 8% instances), ADJ (256; 1% instances), PROPN (96; 1% instances), ADP (72; 0% instances), NUM (43; 0% instances), ADV (21; 0% instances), X (20; 0% instances), PRON (9; 0% instances), AUX (3; 0% instances), PUNCT (1; 0% instances)

3825 (20%) VERB nodes are leaves.

3367 (18%) VERB nodes have one child.

3332 (18%) VERB nodes have two children.

8149 (44%) VERB nodes have three or more children.

The highest child degree of a VERB node is 18.

Children of VERB nodes are attached using 38 different relations: punct (10879; 21% instances), obj (7886; 15% instances), nsubj (7350; 14% instances), mark (4302; 8% instances), acl (3507; 7% instances), advmod (2974; 6% instances), obl (2576; 5% instances), ccomp (2080; 4% instances), dep (1898; 4% instances), xcomp (1697; 3% instances), mark:relcl (1652; 3% instances), nmod:tmod (1493; 3% instances), case:aspect (946; 2% instances), aux (837; 2% instances), case (654; 1% instances), aux:pass (421; 1% instances), conj (327; 1% instances), nsubj:pass (273; 1% instances), csubj (260; 0% instances), aux:caus (194; 0% instances), cc (182; 0% instances), discourse (149; 0% instances), mark:advb (93; 0% instances), iobj (78; 0% instances), appos (26; 0% instances), acl:relcl (10; 0% instances), det (10; 0% instances), cop (9; 0% instances), amod (6; 0% instances), case:dec (6; 0% instances), csubj:pass (6; 0% instances), nummod (6; 0% instances), mark:comp (5; 0% instances), dislocated (4; 0% instances), case:suff (3; 0% instances), advcl (1; 0% instances), case:pref (1; 0% instances), vocative (1; 0% instances)

Children of VERB nodes belong to 15 different parts of speech: NOUN (14404; 27% instances), PUNCT (10878; 21% instances), VERB (10457; 20% instances), PART (5204; 10% instances), ADV (4782; 9% instances), PROPN (2680; 5% instances), ADP (1184; 2% instances), PRON (1045; 2% instances), AUX (852; 2% instances), ADJ (575; 1% instances), X (333; 1% instances), CCONJ (182; 0% instances), NUM (152; 0% instances), DET (67; 0% instances), SYM (7; 0% instances)