Treebank Statistics: UD_Chinese-GSDSimp: POS Tags: VERB
There are 4577 VERB lemmas (21%), 4639 VERB types (21%) and 18216 VERB tokens (15%).
Out of 16 observed tags, the rank of VERB is: 3 in number of lemmas, 3 in number of types and 2 in number of tokens.
The 10 most frequent VERB lemmas: 有、 在、 为、 于、 是、 由、 成、 到、 位、 开始
The 10 most frequent VERB types: 有、 在、 于、 为、 是、 由、 成、 到、 位、 开始
The 10 most frequent ambiguous lemmas: 在 (ADP 1061, VERB 555, ADV 28), 为 (AUX 580, VERB 524, ADP 133, PROPN 1), 于 (VERB 497, ADP 399, PROPN 3), 是 (AUX 1064, VERB 382), 由 (VERB 212, ADP 55, NOUN 2), 成 (VERB 183, PROPN 1), 到 (VERB 174, CCONJ 28, ADP 22), 位 (VERB 171, NOUN 83, PART 2), 开始 (VERB 145, ADP 6, NOUN 1), 用 (VERB 123, NOUN 1)
The 10 most frequent ambiguous types: 在 (ADP 1061, VERB 555, ADV 28), 于 (VERB 497, ADP 399, PROPN 3), 为 (AUX 568, VERB 496, ADP 133, PROPN 1), 是 (AUX 885, VERB 321), 由 (VERB 212, ADP 55, NOUN 2), 成 (VERB 183, PROPN 1), 到 (VERB 173, CCONJ 28, ADP 22), 位 (VERB 171, NOUN 83, PART 2), 开始 (VERB 145, ADP 6, NOUN 1), 用 (VERB 121, NOUN 1)
- 在
- 于
- 为
- 是
- 由
- 成
- 到
- 位
- 开始
- 用
Morphology
The form / lemma ratio of VERB is 1.013546 (the average of all parts of speech is 1.004572).
The 1st highest number of forms (19) was observed with the lemma “为”: 一分为二, 为, 为人, 为期, 亦为, 以为, 任人为贤, 列为, 化整为零, 合而为一, 封为, 无为, 无能为力, 混为一谈, 相依为命, 行为, 见义勇为, 身为, 选为.
The 2nd highest number of forms (12) was observed with the lemma “是”: 不是, 也是, 像是, 则是, 却是, 又是, 就是, 是, 是否, 正是, 而是, 都是.
The 3rd highest number of forms (3) was observed with the lemma “有”: 有, 未有, 没有.
VERB occurs with 2 features: Voice (492; 3% instances), Polarity (215; 1% instances)
VERB occurs with 2 feature-value pairs: Polarity=Neg, Voice=Cau
VERB occurs with 3 feature combinations.
The most frequent feature combination is _ (17509 tokens).
Examples: 有、 在、 于、 为、 是、 由、 成、 到、 位、 开始
Relations
VERB nodes are attached to their parents using 22 different relations: root (4112; 23% instances), advcl (3548; 19% instances), ccomp (1768; 10% instances), parataxis (1633; 9% instances), xcomp (1621; 9% instances), acl:relcl (1603; 9% instances), mark (1384; 8% instances), compound (1245; 7% instances), conj (335; 2% instances), amod (305; 2% instances), csubj (305; 2% instances), acl (174; 1% instances), obj (46; 0% instances), dislocated (39; 0% instances), appos (28; 0% instances), nmod (28; 0% instances), nsubj (20; 0% instances), obl (9; 0% instances), csubj:pass (6; 0% instances), nmod:tmod (4; 0% instances), discourse (2; 0% instances), reparandum (1; 0% instances)
Parents of VERB nodes belong to 12 different parts of speech: VERB (10004; 55% instances), (4112; 23% instances), NOUN (2114; 12% instances), PART (1574; 9% instances), ADJ (215; 1% instances), PROPN (80; 0% instances), ADP (52; 0% instances), NUM (24; 0% instances), ADV (16; 0% instances), X (15; 0% instances), PRON (6; 0% instances), AUX (4; 0% instances)
3220 (18%) VERB nodes are leaves.
2487 (14%) VERB nodes have one child.
3098 (17%) VERB nodes have two children.
9411 (52%) VERB nodes have three or more children.
The highest child degree of a VERB node is 12.
Children of VERB nodes are attached using 38 different relations: punct (9173; 18% instances), obj (7700; 15% instances), nsubj (7362; 14% instances), mark (4323; 8% instances), advcl (3772; 7% instances), obl (3115; 6% instances), advmod (2464; 5% instances), ccomp (2075; 4% instances), xcomp (1861; 4% instances), aux (1789; 3% instances), parataxis (1733; 3% instances), mark:rel (1661; 3% instances), nmod:tmod (1494; 3% instances), case (667; 1% instances), aux:pass (425; 1% instances), conj (336; 1% instances), nsubj:pass (277; 1% instances), csubj (263; 1% instances), obl:patient (194; 0% instances), cc (184; 0% instances), discourse (143; 0% instances), iobj (77; 0% instances), obl:agent (70; 0% instances), compound:ext (24; 0% instances), mark:adv (22; 0% instances), nmod (21; 0% instances), appos (20; 0% instances), cop (9; 0% instances), discourse:sp (7; 0% instances), amod (6; 0% instances), csubj:pass (6; 0% instances), compound (5; 0% instances), nummod (5; 0% instances), dislocated (4; 0% instances), det (3; 0% instances), nsubj:outer (3; 0% instances), reparandum (1; 0% instances), vocative (1; 0% instances)
Children of VERB nodes belong to 16 different parts of speech: NOUN (14740; 29% instances), VERB (10004; 20% instances), PUNCT (9173; 18% instances), SCONJ (4168; 8% instances), PART (2730; 5% instances), PROPN (2442; 5% instances), ADV (2360; 5% instances), AUX (2228; 4% instances), ADP (1194; 2% instances), PRON (1063; 2% instances), ADJ (632; 1% instances), CCONJ (182; 0% instances), NUM (171; 0% instances), X (135; 0% instances), DET (67; 0% instances), SYM (6; 0% instances)