home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Japanese-PUDLUW: POS Tags: VERB

There are 934 VERB lemmas (17%), 1250 VERB types (21%) and 2269 VERB tokens (10%). Out of 16 observed tags, the rank of VERB is: 2 in number of lemmas, 2 in number of types and 5 in number of tokens.

The 10 most frequent VERB lemmas: 有る, 成る, する, 言う, 持つ, 見る, 述べる, 知る, 受ける, 考える

The 10 most frequent VERB types: ある, し, なっ, なる, あり, 述べ, 言っ, 受け, 考え, あっ

The 10 most frequent ambiguous lemmas: する (VERB 70, SCONJ 5), 答える (NOUN 3, VERB 2), 通る (NOUN 1, VERB 1)

The 10 most frequent ambiguous types: ある (VERB 73, DET 6), し (VERB 49, SCONJ 5), なる (VERB 36, AUX 1), あり (VERB 32, NOUN 1), 考え (VERB 18, NOUN 1), い (VERB 10, PART 1), でき (VERB 8, AUX 1), なら (VERB 3, AUX 1), 助け (VERB 2, NOUN 1), 行い (VERB 2, NOUN 1)

Morphology

The form / lemma ratio of VERB is 1.338330 (the average of all parts of speech is 1.079803).

The 1st highest number of forms (6) was observed with the lemma “取る”: とっ, とる, 取っ, 取ら, 取る, 執り.

The 2nd highest number of forms (6) was observed with the lemma “成る”: なっ, なら, なり, なる, なれ, 成っ.

The 3rd highest number of forms (6) was observed with the lemma “行く”: いか, いく, ゆく, 行き, 行く, 行っ.

VERB does not occur with any features.

Relations

VERB nodes are attached to their parents using 10 different relations: root (844; 37% instances), acl (731; 32% instances), advcl (615; 27% instances), ccomp (49; 2% instances), csubj (8; 0% instances), nmod (7; 0% instances), compound (6; 0% instances), obj (5; 0% instances), appos (2; 0% instances), obl (2; 0% instances)

Parents of VERB nodes belong to 9 different parts of speech: (844; 37% instances), NOUN (699; 31% instances), VERB (622; 27% instances), ADJ (41; 2% instances), PROPN (41; 2% instances), ADV (8; 0% instances), NUM (6; 0% instances), PRON (5; 0% instances), AUX (3; 0% instances)

26 (1%) VERB nodes are leaves.

208 (9%) VERB nodes have one child.

376 (17%) VERB nodes have two children.

1659 (73%) VERB nodes have three or more children.

The highest child degree of a VERB node is 13.

Children of VERB nodes are attached using 20 different relations: aux (2100; 24% instances), obl (1432; 16% instances), punct (1338; 15% instances), nsubj (1217; 14% instances), obj (832; 9% instances), advcl (819; 9% instances), mark (322; 4% instances), case (231; 3% instances), advmod (219; 2% instances), cc (87; 1% instances), ccomp (68; 1% instances), compound (58; 1% instances), nsubj:outer (57; 1% instances), nmod (22; 0% instances), nummod (13; 0% instances), appos (2; 0% instances), csubj (2; 0% instances), dep (1; 0% instances), det (1; 0% instances), discourse (1; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: NOUN (2726; 31% instances), AUX (2100; 24% instances), PUNCT (1338; 15% instances), VERB (622; 7% instances), PROPN (505; 6% instances), PRON (267; 3% instances), SCONJ (259; 3% instances), ADP (231; 3% instances), ADV (223; 3% instances), NUM (207; 2% instances), ADJ (191; 2% instances), CCONJ (87; 1% instances), PART (63; 1% instances), DET (1; 0% instances), INTJ (1; 0% instances), SYM (1; 0% instances)