home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-PUD: POS Tags: VERB

There are 1393 VERB lemmas (24%), 1408 VERB types (24%) and 3528 VERB tokens (16%). Out of 15 observed tags, the rank of VERB is: 2 in number of lemmas, 2 in number of types and 2 in number of tokens.

The 10 most frequent VERB lemmas: 有、 在、 是、 為、 於、 說、 開始、 來、 到、 讓

The 10 most frequent VERB types: 在、 有、 於、 是、 為、 說、 沒有、 開始、 來、 到

The 10 most frequent ambiguous lemmas: 在 (ADP 272, VERB 118, ADV 25), 是 (AUX 167, VERB 98), 為 (VERB 95, ADP 35, AUX 29), 於 (VERB 80, ADP 45), 開始 (VERB 27, NOUN 1), 來 (ADV 40, VERB 26, ADP 2), 到 (VERB 26, ADP 4, CCONJ 4), 由 (VERB 20, ADP 1), 發生 (VERB 19, NOUN 2), 建立 (VERB 16, NOUN 1)

The 10 most frequent ambiguous types: 在 (ADP 272, VERB 118, ADV 25), 於 (VERB 80, ADP 45), 是 (AUX 148, VERB 67), 為 (VERB 63, ADP 35, AUX 29), 開始 (VERB 27, NOUN 1), 來 (ADV 40, VERB 26, ADP 2), 到 (VERB 26, ADP 4, CCONJ 4), 由 (VERB 20, ADP 1), 發生 (VERB 19, NOUN 2), 建立 (VERB 16, NOUN 1)

Morphology

The form / lemma ratio of VERB is 1.010768 (the average of all parts of speech is 1.006233).

The 1st highest number of forms (9) was observed with the lemma “是”: 不是, 也是, 像是, 就是, 是, 是不是, 是否, 正是, 都是.

The 2nd highest number of forms (6) was observed with the lemma “為”: 一分為二, 以為, 引以為豪, 為, 為期, 認為.

The 3rd highest number of forms (2) was observed with the lemma “有”: 有, 沒有.

VERB occurs with 2 features: Voice (90; 3% instances), Polarity (48; 1% instances)

VERB occurs with 2 feature-value pairs: Polarity=Neg, Voice=Cau

VERB occurs with 3 feature combinations. The most frequent feature combination is _ (3390 tokens). Examples: 在、 有、 於、 是、 為、 說、 開始、 來、 到、 認為

Relations

VERB nodes are attached to their parents using 24 different relations: root (890; 25% instances), advcl (482; 14% instances), xcomp (443; 13% instances), acl:relcl (409; 12% instances), ccomp (336; 10% instances), dep (308; 9% instances), mark:prt (240; 7% instances), obj (84; 2% instances), nsubj (69; 2% instances), csubj (62; 2% instances), conj (51; 1% instances), compound (39; 1% instances), amod (36; 1% instances), acl (17; 0% instances), nmod (16; 0% instances), obl (15; 0% instances), appos (10; 0% instances), nsubj:pass (8; 0% instances), dislocated (5; 0% instances), parataxis (3; 0% instances), obl:patient (2; 0% instances), iobj (1; 0% instances), obl:agent (1; 0% instances), obl:tmod (1; 0% instances)

Parents of VERB nodes belong to 11 different parts of speech: VERB (1968; 56% instances), (890; 25% instances), NOUN (530; 15% instances), ADJ (89; 3% instances), PROPN (23; 1% instances), ADP (10; 0% instances), X (7; 0% instances), PRON (4; 0% instances), NUM (3; 0% instances), PART (3; 0% instances), DET (1; 0% instances)

420 (12%) VERB nodes are leaves.

518 (15%) VERB nodes have one child.

605 (17%) VERB nodes have two children.

1985 (56%) VERB nodes have three or more children.

The highest child degree of a VERB node is 11.

Children of VERB nodes are attached using 39 different relations: punct (1763; 16% instances), nsubj (1535; 14% instances), obj (1438; 13% instances), advmod (998; 9% instances), aux (667; 6% instances), obl (651; 6% instances), xcomp (504; 5% instances), advcl (494; 5% instances), mark:rel (424; 4% instances), ccomp (402; 4% instances), dep (322; 3% instances), mark:prt (321; 3% instances), mark (269; 3% instances), obl:tmod (200; 2% instances), compound (109; 1% instances), aux:pass (79; 1% instances), discourse:sp (72; 1% instances), nsubj:pass (71; 1% instances), conj (51; 0% instances), csubj (45; 0% instances), obl:patient (39; 0% instances), cc (37; 0% instances), appos (30; 0% instances), mark:adv (22; 0% instances), obl:agent (22; 0% instances), clf (20; 0% instances), acl:relcl (19; 0% instances), amod (18; 0% instances), case (16; 0% instances), iobj (15; 0% instances), case:loc (10; 0% instances), flat:name (10; 0% instances), cop (8; 0% instances), det (8; 0% instances), nummod (7; 0% instances), flat (2; 0% instances), parataxis (2; 0% instances), discourse (1; 0% instances), vocative (1; 0% instances)

Children of VERB nodes belong to 15 different parts of speech: NOUN (2985; 28% instances), VERB (1968; 18% instances), PUNCT (1763; 16% instances), ADV (1056; 10% instances), AUX (754; 7% instances), PART (558; 5% instances), PROPN (524; 5% instances), PRON (465; 4% instances), ADP (265; 2% instances), ADJ (177; 2% instances), X (66; 1% instances), NUM (43; 0% instances), CCONJ (37; 0% instances), SCONJ (23; 0% instances), DET (18; 0% instances)