home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Japanese-Modern: POS Tags: VERB

There are 464 VERB lemmas (18%), 756 VERB types (25%) and 2275 VERB tokens (16%). Out of 17 observed tags, the rank of VERB is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent VERB lemmas: す, あり, 以つ, 然り, 至る, 於く, 得, 行ふ, 立つ, 用ふ

The 10 most frequent VERB types: し, する, 以, せ, す, あり, 於, 得, あら, ある

The 10 most frequent ambiguous lemmas: 爲 (NOUN 3, VERB 2), ある (PRON 1, VERB 1)

The 10 most frequent ambiguous types: し (VERB 215, AUX 18), せ (VERB 76, AUX 1), ある (VERB 27, PRON 1), 非 (VERB 15, NOUN 5), 然 (VERB 12, PART 1), なる (AUX 57, VERB 5), 見 (VERB 5, NOUN 1), 有 (VERB 4, NOUN 1), なり (AUX 139, ADP 21, VERB 3), 云 (VERB 3, NOUN 2)

Morphology

The form / lemma ratio of VERB is 1.629310 (the average of all parts of speech is 1.139839).

The 1st highest number of forms (7) was observed with the lemma “あり”: あつ, あら, あり, ある, あれ, 非, 非ら.

The 2nd highest number of forms (6) was observed with the lemma “欲す”: 欲し, 欲す, 欲する, 欲すれ, 欲せ, 欲る.

The 3rd highest number of forms (6) was observed with the lemma “立つ”: 立, 立た, 立ち, 立つ, 立て, 立る.

VERB does not occur with any features.

Relations

VERB nodes are attached to their parents using 10 different relations: advcl (727; 32% instances), root (495; 22% instances), acl (425; 19% instances), compound (275; 12% instances), dep (100; 4% instances), obl (68; 3% instances), nmod (67; 3% instances), nsubj (47; 2% instances), obj (38; 2% instances), iobj (33; 1% instances)

Parents of VERB nodes belong to 11 different parts of speech: VERB (893; 39% instances), NOUN (788; 35% instances), (495; 22% instances), AUX (28; 1% instances), ADJ (26; 1% instances), PRON (25; 1% instances), PART (12; 1% instances), ADV (4; 0% instances), PROPN (2; 0% instances), NUM (1; 0% instances), X (1; 0% instances)

325 (14%) VERB nodes are leaves.

266 (12%) VERB nodes have one child.

652 (29%) VERB nodes have two children.

1032 (45%) VERB nodes have three or more children.

The highest child degree of a VERB node is 8.

Children of VERB nodes are attached using 16 different relations: obj (783; 14% instances), nmod (735; 13% instances), aux (696; 12% instances), advcl (690; 12% instances), iobj (481; 9% instances), mark (386; 7% instances), case (385; 7% instances), cc (336; 6% instances), advmod (328; 6% instances), obl (307; 5% instances), nsubj (149; 3% instances), dep (117; 2% instances), amod (83; 1% instances), punct (80; 1% instances), compound (66; 1% instances), nummod (6; 0% instances)

Children of VERB nodes belong to 14 different parts of speech: NOUN (1962; 35% instances), VERB (893; 16% instances), AUX (727; 13% instances), SCONJ (386; 7% instances), ADP (385; 7% instances), ADV (353; 6% instances), CCONJ (336; 6% instances), PRON (284; 5% instances), ADJ (101; 2% instances), PUNCT (80; 1% instances), PART (68; 1% instances), NUM (26; 0% instances), SYM (17; 0% instances), PROPN (10; 0% instances)