home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Japanese-GSD: POS Tags: VERB

There are 2831 VERB lemmas (13%), 4286 VERB types (18%) and 21226 VERB tokens (11%). Out of 16 observed tags, the rank of VERB is: 3 in number of lemmas, 3 in number of types and 3 in number of tokens.

The 10 most frequent VERB lemmas: 居る, 有る, 成る, 為る, 言う, 因る, 行く, 行う, 思う, 来る

The 10 most frequent VERB types: いる, ある, い, なっ, いう, し, あり, あっ, なる, おり

The 10 most frequent ambiguous lemmas: 為る (AUX 5115, VERB 675, SCONJ 12, CCONJ 3), 言う (VERB 665, SCONJ 5), 因る (VERB 428, CCONJ 2), 出来る (AUX 237, VERB 91), 使用 (VERB 79, NOUN 27), 利用 (VERB 78, NOUN 25), 参加 (VERB 41, NOUN 38), 存在 (VERB 41, NOUN 21), 発表 (VERB 36, NOUN 25), 登場 (NOUN 36, VERB 34)

The 10 most frequent ambiguous types: ある (VERB 1008, DET 33), し (AUX 2933, VERB 417, SCONJ 54), あり (VERB 327, NOUN 2), あっ (VERB 251, INTJ 2), なる (VERB 236, AUX 4), つい (VERB 169, ADV 1), なり (VERB 166, NOUN 2, AUX 1), する (AUX 1029, VERB 144, CCONJ 3), よっ (VERB 136, CCONJ 2), き (VERB 131, NOUN 1)

Morphology

The form / lemma ratio of VERB is 1.513953 (the average of all parts of speech is 1.115220).

The 1st highest number of forms (20) was observed with the lemma “取る”: とっ, とら, とり, とる, とろう, 取っ, 取ら, 取り, 取る, 取れ, 取ろう, 執っ, 執ら, 執り, 執る, 採ら, 採る, 撮ら, 撮り, 撮れ.

The 2nd highest number of forms (16) was observed with the lemma “行く”: いか, いき, いく, いけ, いける, いこう, いっ, ゆか, ゆく, 行か, 行き, 行く, 行け, 行ける, 行こう, 行っ.

The 3rd highest number of forms (16) was observed with the lemma “言う”: いい, いう, いえ, いえよう, いえる, いっ, いわ, 云う, 云える, 言い, 言う, 言え, 言えよう, 言える, 言っ, 言わ.

VERB does not occur with any features.

Relations

VERB nodes are attached to their parents using 14 different relations: fixed (5194; 24% instances), acl (5124; 24% instances), root (5097; 24% instances), advcl (4966; 23% instances), ccomp (272; 1% instances), compound (237; 1% instances), csubj (143; 1% instances), nmod (126; 1% instances), obl (40; 0% instances), obj (9; 0% instances), nsubj (7; 0% instances), csubj:outer (6; 0% instances), dep (3; 0% instances), nsubj:outer (2; 0% instances)

Parents of VERB nodes belong to 14 different parts of speech: NOUN (5488; 26% instances), (5097; 24% instances), VERB (4901; 23% instances), SCONJ (2883; 14% instances), ADP (1378; 6% instances), AUX (817; 4% instances), ADJ (366; 2% instances), PROPN (190; 1% instances), ADV (50; 0% instances), PRON (26; 0% instances), PART (14; 0% instances), NUM (13; 0% instances), CCONJ (2; 0% instances), INTJ (1; 0% instances)

5698 (27%) VERB nodes are leaves.

1042 (5%) VERB nodes have one child.

2561 (12%) VERB nodes have two children.

11925 (56%) VERB nodes have three or more children.

The highest child degree of a VERB node is 12.

Children of VERB nodes are attached using 20 different relations: aux (14646; 24% instances), obl (11295; 18% instances), punct (8243; 13% instances), mark (6333; 10% instances), advcl (6323; 10% instances), nsubj (5435; 9% instances), obj (4941; 8% instances), case (1518; 2% instances), advmod (1486; 2% instances), cc (544; 1% instances), compound (526; 1% instances), ccomp (389; 1% instances), nsubj:outer (298; 0% instances), nmod (127; 0% instances), dep (18; 0% instances), amod (9; 0% instances), discourse (8; 0% instances), csubj (3; 0% instances), det (1; 0% instances), nummod (1; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: NOUN (21201; 34% instances), AUX (14646; 24% instances), PUNCT (8243; 13% instances), SCONJ (5799; 9% instances), VERB (4901; 8% instances), ADV (1510; 2% instances), ADP (1499; 2% instances), PROPN (1398; 2% instances), ADJ (1150; 2% instances), PRON (573; 1% instances), PART (554; 1% instances), CCONJ (544; 1% instances), NUM (95; 0% instances), SYM (26; 0% instances), INTJ (4; 0% instances), DET (1; 0% instances)