Treebank Statistics: UD_Japanese-PUDLUW: POS Tags: VERB
There are 934 VERB lemmas (17%), 1250 VERB types (21%) and 2269 VERB tokens (10%).
Out of 16 observed tags, the rank of VERB is: 2 in number of lemmas, 2 in number of types and 5 in number of tokens.
The 10 most frequent VERB lemmas: 有る, 成る, する, 言う, 持つ, 見る, 述べる, 知る, 受ける, 考える
The 10 most frequent VERB types: ある, し, なっ, なる, あり, 述べ, 言っ, 受け, 考え, あっ
The 10 most frequent ambiguous lemmas: する (VERB 70, SCONJ 5), 答える (NOUN 3, VERB 2), 通る (NOUN 1, VERB 1)
The 10 most frequent ambiguous types: ある (VERB 73, DET 6), し (VERB 49, SCONJ 5), なる (VERB 36, AUX 1), あり (VERB 32, NOUN 1), 考え (VERB 18, NOUN 1), い (VERB 10, PART 1), でき (VERB 8, AUX 1), なら (VERB 3, AUX 1), 助け (VERB 2, NOUN 1), 行い (VERB 2, NOUN 1)
- ある
- し
- なる
- あり
- 考え
- い
- でき
- なら
- 助け
- 行い
Morphology
The form / lemma ratio of VERB is 1.338330 (the average of all parts of speech is 1.079832).
The 1st highest number of forms (6) was observed with the lemma “取る”: とっ, とる, 取っ, 取ら, 取る, 執り.
The 2nd highest number of forms (6) was observed with the lemma “成る”: なっ, なら, なり, なる, なれ, 成っ.
The 3rd highest number of forms (6) was observed with the lemma “行く”: いか, いく, ゆく, 行き, 行く, 行っ.
VERB does not occur with any features.
Relations
VERB nodes are attached to their parents using 9 different relations: root (844; 37% instances), acl (731; 32% instances), advcl (617; 27% instances), ccomp (49; 2% instances), csubj (8; 0% instances), nmod (7; 0% instances), compound (6; 0% instances), obj (5; 0% instances), obl (2; 0% instances)
Parents of VERB nodes belong to 9 different parts of speech: (844; 37% instances), NOUN (699; 31% instances), VERB (624; 28% instances), ADJ (41; 2% instances), PROPN (40; 2% instances), ADV (8; 0% instances), NUM (6; 0% instances), PRON (5; 0% instances), AUX (2; 0% instances)
27 (1%) VERB nodes are leaves.
208 (9%) VERB nodes have one child.
375 (17%) VERB nodes have two children.
1659 (73%) VERB nodes have three or more children.
The highest child degree of a VERB node is 14.
Children of VERB nodes are attached using 20 different relations: aux (2102; 24% instances), obl (1472; 17% instances), punct (1338; 15% instances), nsubj (1218; 14% instances), obj (828; 9% instances), advcl (797; 9% instances), mark (323; 4% instances), case (234; 3% instances), advmod (219; 2% instances), cc (87; 1% instances), ccomp (68; 1% instances), nsubj:outer (58; 1% instances), compound (49; 1% instances), nmod (22; 0% instances), nummod (13; 0% instances), csubj (2; 0% instances), iobj (2; 0% instances), dep (1; 0% instances), det (1; 0% instances), discourse (1; 0% instances)
Children of VERB nodes belong to 16 different parts of speech: NOUN (2729; 31% instances), AUX (2102; 24% instances), PUNCT (1338; 15% instances), VERB (624; 7% instances), PROPN (507; 6% instances), PRON (267; 3% instances), SCONJ (260; 3% instances), ADP (234; 3% instances), ADV (223; 3% instances), NUM (207; 2% instances), ADJ (191; 2% instances), CCONJ (87; 1% instances), PART (63; 1% instances), DET (1; 0% instances), INTJ (1; 0% instances), SYM (1; 0% instances)