home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Japanese-GSDLUW: POS Tags: VERB

There are 3388 VERB lemmas (12%), 5344 VERB types (17%) and 15875 VERB tokens (11%). Out of 17 observed tags, the rank of VERB is: 3 in number of lemmas, 2 in number of types and 5 in number of tokens.

The 10 most frequent VERB lemmas: 成る, 有る, する, 言う, 行う, 思う, 持つ, 行く, 見る, 受ける

The 10 most frequent VERB types: なっ, し, ある, なる, あり, する, なり, いう, あっ, 行っ

The 10 most frequent ambiguous lemmas: する (VERB 697, SCONJ 44), 通る (NOUN 26, VERB 17), ている (AUX 2072, VERB 4)

The 10 most frequent ambiguous types: し (VERB 426, SCONJ 44), ある (VERB 414, DET 33), なる (VERB 209, AUX 4), あり (VERB 179, INTJ 1, NOUN 1), なり (VERB 142, AUX 1), あっ (VERB 109, INTJ 2), さ (VERB 100, PART 1), 思い (VERB 100, NOUN 10), 考え (VERB 49, NOUN 8), でき (VERB 38, AUX 3)

Morphology

The form / lemma ratio of VERB is 1.577332 (the average of all parts of speech is 1.095294).

The 1st highest number of forms (19) was observed with the lemma “取る”: とっ, とら, とり, とる, とろう, 取っ, 取ら, 取り, 取る, 取れ, 取ろう, 執っ, 執ら, 執り, 執る, 採ら, 採る, 撮ら, 撮り.

The 2nd highest number of forms (12) was observed with the lemma “行く”: いか, いき, いく, いけ, いこう, いっ, 行か, 行き, 行く, 行け, 行こう, 行っ.

The 3rd highest number of forms (11) was observed with the lemma “作る”: つくら, つくり, つくる, 作っ, 作ら, 作り, 作る, 作ろう, 創ら, 造ら, 造る.

VERB does not occur with any features.

Relations

VERB nodes are attached to their parents using 10 different relations: root (5160; 33% instances), acl (5117; 32% instances), advcl (4961; 31% instances), ccomp (252; 2% instances), csubj (144; 1% instances), compound (101; 1% instances), nmod (90; 1% instances), obl (38; 0% instances), csubj:outer (6; 0% instances), obj (6; 0% instances)

Parents of VERB nodes belong to 11 different parts of speech: (5160; 33% instances), NOUN (5040; 32% instances), VERB (4830; 30% instances), ADJ (379; 2% instances), PROPN (293; 2% instances), NUM (79; 0% instances), ADV (42; 0% instances), PRON (27; 0% instances), SCONJ (21; 0% instances), AUX (3; 0% instances), INTJ (1; 0% instances)

251 (2%) VERB nodes are leaves.

1441 (9%) VERB nodes have one child.

2962 (19%) VERB nodes have two children.

11221 (71%) VERB nodes have three or more children.

The highest child degree of a VERB node is 12.

Children of VERB nodes are attached using 20 different relations: aux (13545; 23% instances), obl (10550; 18% instances), punct (8314; 14% instances), advcl (6325; 11% instances), nsubj (5684; 10% instances), obj (4982; 9% instances), mark (3154; 5% instances), advmod (1590; 3% instances), case (1505; 3% instances), cc (547; 1% instances), compound (513; 1% instances), ccomp (360; 1% instances), nsubj:outer (340; 1% instances), nmod (98; 0% instances), nummod (85; 0% instances), dep (34; 0% instances), discourse (8; 0% instances), amod (5; 0% instances), csubj (3; 0% instances), det (1; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: NOUN (18875; 33% instances), AUX (13545; 23% instances), PUNCT (8314; 14% instances), VERB (4830; 8% instances), SCONJ (2655; 5% instances), PROPN (2063; 4% instances), ADV (1597; 3% instances), ADP (1505; 3% instances), NUM (1291; 2% instances), ADJ (1275; 2% instances), PRON (603; 1% instances), CCONJ (547; 1% instances), PART (519; 1% instances), SYM (15; 0% instances), INTJ (8; 0% instances), DET (1; 0% instances)