home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Japanese-GSD: POS Tags: ADJ

There are 750 ADJ lemmas (4%), 985 ADJ types (4%) and 3839 ADJ tokens (2%). Out of 16 observed tags, the rank of ADJ is: 4 in number of lemmas, 4 in number of types and 9 in number of tokens.

The 10 most frequent ADJ lemmas: 無い, 良い, 多い, 可能, 高い, 美味しい, 同じ, 大きい, 大きな, 少ない

The 10 most frequent ADJ types: ない, 可能, 多い, なく, 同じ, いい, 多く, 高い, 大きな, 良い

The 10 most frequent ambiguous lemmas: 無い (ADJ 325, AUX 188), 良い (ADJ 179, AUX 23), 必要 (NOUN 44, ADJ 37), 別 (ADJ 26, NOUN 17), 特別 (ADJ 19, ADV 1), 親切 (ADJ 19, NOUN 2), 最高 (NOUN 20, ADJ 16), 十分 (ADJ 15, ADV 4), 普通 (ADJ 15, NOUN 12), 満足 (VERB 19, ADJ 13, NOUN 3)

The 10 most frequent ambiguous types: ない (AUX 562, ADJ 161), なく (AUX 109, ADJ 84), いい (ADJ 55, AUX 14, VERB 9), 多く (ADJ 54, NOUN 51), 良い (ADJ 44, AUX 3), 必要 (NOUN 44, ADJ 37), 無い (ADJ 28, AUX 2), なかっ (AUX 116, ADJ 27), 別 (ADJ 26, NOUN 17), 特別 (ADJ 19, ADV 1)

Morphology

The form / lemma ratio of ADJ is 1.313333 (the average of all parts of speech is 1.115220).

The 1st highest number of forms (12) was observed with the lemma “良い”: いい, よい, よかっ, よく, よけれ, よさ, イイ, 良, 良い, 良かっ, 良く, 良さ.

The 2nd highest number of forms (10) was observed with the lemma “無い”: ない, なかっ, なき, なく, なけれ, なさ, 亡き, 無い, 無かっ, 無く.

The 3rd highest number of forms (9) was observed with the lemma “美味しい”: おいし, おいしい, おいしかっ, おいしく, 美味し, 美味しい, 美味しかっ, 美味しく, 美味しゅう.

ADJ does not occur with any features.

Relations

ADJ nodes are attached to their parents using 14 different relations: advcl (1306; 34% instances), acl (1198; 31% instances), root (529; 14% instances), amod (445; 12% instances), nmod (150; 4% instances), ccomp (79; 2% instances), nsubj (27; 1% instances), dep (22; 1% instances), obj (22; 1% instances), obl (22; 1% instances), compound (16; 0% instances), fixed (13; 0% instances), csubj (9; 0% instances), dislocated (1; 0% instances)

Parents of ADJ nodes belong to 10 different parts of speech: NOUN (1834; 48% instances), VERB (1152; 30% instances), (529; 14% instances), ADJ (278; 7% instances), ADV (13; 0% instances), PROPN (12; 0% instances), ADP (11; 0% instances), PRON (6; 0% instances), PART (2; 0% instances), SCONJ (2; 0% instances)

980 (26%) ADJ nodes are leaves.

1218 (32%) ADJ nodes have one child.

548 (14%) ADJ nodes have two children.

1093 (28%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 9.

Children of ADJ nodes are attached using 18 different relations: aux (1707; 25% instances), nsubj (992; 15% instances), punct (968; 14% instances), advcl (695; 10% instances), obl (611; 9% instances), case (486; 7% instances), mark (455; 7% instances), advmod (331; 5% instances), compound (197; 3% instances), nmod (123; 2% instances), cc (49; 1% instances), dislocated (38; 1% instances), csubj (16; 0% instances), amod (14; 0% instances), obj (14; 0% instances), det (8; 0% instances), dep (6; 0% instances), nummod (2; 0% instances)

Children of ADJ nodes belong to 15 different parts of speech: NOUN (1912; 28% instances), AUX (1707; 25% instances), PUNCT (968; 14% instances), ADP (486; 7% instances), VERB (370; 6% instances), ADV (339; 5% instances), SCONJ (311; 5% instances), ADJ (278; 4% instances), PART (150; 2% instances), PRON (65; 1% instances), PROPN (53; 1% instances), CCONJ (49; 1% instances), SYM (10; 0% instances), DET (8; 0% instances), NUM (6; 0% instances)