home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-CFL: POS Tags: VERB

There are 420 VERB lemmas (25%), 429 VERB types (26%) and 1327 VERB tokens (18%). Out of 15 observed tags, the rank of VERB is: 2 in number of lemmas, 2 in number of types and 2 in number of tokens.

The 10 most frequent VERB lemmas: 到、 去、 有、 是、 看、 说、 让、 来、 见、 吃

The 10 most frequent VERB types: 到、 去、 有、 是、 看、 说、 让、 来、 见、 吃

The 10 most frequent ambiguous lemmas: 到 (VERB 89, ADP 7), 有 (VERB 64, AUX 2), 是 (AUX 75, VERB 51), 来 (VERB 27, SCONJ 10, ADP 1), 为 (VERB 13, ADP 4), 想 (AUX 14, VERB 12), 过 (VERB 11, AUX 10), 上 (ADP 28, VERB 10, NOUN 3), 旅行 (NOUN 16, VERB 10), 感觉 (VERB 8, NOUN 1)

The 10 most frequent ambiguous types: 到 (VERB 89, ADP 7), 有 (VERB 52, ADV 1), 是 (AUX 74, VERB 49), 来 (VERB 27, SCONJ 10, ADP 1), 想 (AUX 14, VERB 12), 没有 (VERB 12, AUX 2), 过 (VERB 11, AUX 10), 上 (ADP 28, VERB 10, NOUN 3), 旅行 (NOUN 16, VERB 10), 感觉 (VERB 8, NOUN 1)

Morphology

The form / lemma ratio of VERB is 1.021429 (the average of all parts of speech is 1.009709).

The 1st highest number of forms (5) was observed with the lemma “为”: 为了, 以为, 作为, 成为, 认为.

The 2nd highest number of forms (3) was observed with the lemma “是”: 是, 真是, 自以为是.

The 3rd highest number of forms (2) was observed with the lemma “听”: 停, 听.

VERB occurs with 1 features: Polarity (16; 1% instances)

VERB occurs with 1 feature-value pairs: Polarity=Neg

VERB occurs with 2 feature combinations. The most frequent feature combination is _ (1311 tokens). Examples: 到、 去、 有、 是、 看、 说、 让、 来、 见、 吃

Relations

VERB nodes are attached to their parents using 21 different relations: root (373; 28% instances), conj (200; 15% instances), advcl (171; 13% instances), parataxis (129; 10% instances), acl (103; 8% instances), compound:vv (101; 8% instances), xcomp (100; 8% instances), ccomp (85; 6% instances), compound:dir (28; 2% instances), obl:tmod (9; 1% instances), nsubj (7; 1% instances), csubj (5; 0% instances), obl (4; 0% instances), dep (3; 0% instances), amod (2; 0% instances), obj (2; 0% instances), case (1; 0% instances), case:loc (1; 0% instances), nmod (1; 0% instances), obl:patient (1; 0% instances), reparandum (1; 0% instances)

Parents of VERB nodes belong to 6 different parts of speech: VERB (771; 58% instances), (373; 28% instances), NOUN (129; 10% instances), ADJ (47; 4% instances), PRON (4; 0% instances), PROPN (3; 0% instances)

181 (14%) VERB nodes are leaves.

158 (12%) VERB nodes have one child.

184 (14%) VERB nodes have two children.

804 (61%) VERB nodes have three or more children.

The highest child degree of a VERB node is 10.

Children of VERB nodes are attached using 37 different relations: punct (680; 16% instances), obj (558; 13% instances), advmod (551; 13% instances), nsubj (534; 13% instances), aux (256; 6% instances), obl (201; 5% instances), conj (192; 5% instances), advcl (179; 4% instances), xcomp (136; 3% instances), parataxis (135; 3% instances), obl:tmod (134; 3% instances), ccomp (109; 3% instances), compound:vv (106; 3% instances), mark:rel (101; 2% instances), mark (84; 2% instances), discourse:sp (77; 2% instances), cc (52; 1% instances), compound:dir (28; 1% instances), compound:vo (25; 1% instances), dislocated (14; 0% instances), case (12; 0% instances), case:loc (11; 0% instances), compound:ext (8; 0% instances), dep (8; 0% instances), iobj (7; 0% instances), obl:patient (7; 0% instances), acl (5; 0% instances), cop (5; 0% instances), obl:agent (5; 0% instances), nummod (4; 0% instances), nsubj:pass (3; 0% instances), csubj (2; 0% instances), mark:adv (2; 0% instances), nsubj:outer (2; 0% instances), discourse (1; 0% instances), nmod (1; 0% instances), reparandum (1; 0% instances)

Children of VERB nodes belong to 15 different parts of speech: NOUN (865; 20% instances), VERB (771; 18% instances), PUNCT (680; 16% instances), PRON (566; 13% instances), ADV (494; 12% instances), AUX (262; 6% instances), PART (195; 5% instances), ADJ (140; 3% instances), PROPN (97; 2% instances), ADP (60; 1% instances), CCONJ (53; 1% instances), SCONJ (45; 1% instances), NUM (4; 0% instances), DET (2; 0% instances), INTJ (2; 0% instances)