home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Japanese-Modern: POS Tags: NOUN

There are 1603 NOUN lemmas (62%), 1603 NOUN types (54%) and 4311 NOUN tokens (30%). Out of 17 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: 者, ヿ, 國, 政治, 法, 事, 人, 曰く, 所, 時

The 10 most frequent NOUN types: 者, ヿ, 國, 政治, 法, 事, 人, 曰く, 所, 時

The 10 most frequent ambiguous lemmas: 者 (NOUN 122, PART 2), 國 (NOUN 65, PART 9), 法 (NOUN 50, PART 6, PROPN 1), 人 (NOUN 45, PART 19), 所 (NOUN 43, PART 2), 民 (NOUN 33, PART 3), 上 (PART 27, NOUN 9), 中 (PART 12, NOUN 8), 力 (NOUN 8, PART 1), 如何 (NOUN 8, ADV 1)

The 10 most frequent ambiguous types: 者 (NOUN 122, PART 2), 國 (NOUN 65, PART 9), 法 (NOUN 50, PART 6, PROPN 1), 人 (NOUN 45, PART 19), 所 (NOUN 43, PART 2), 民 (NOUN 33, PART 3), 説 (NOUN 23, VERB 3), 可 (NOUN 11, AUX 2), 上 (PART 27, NOUN 9), 習 (NOUN 9, VERB 1)

Morphology

The form / lemma ratio of NOUN is 1.000000 (the average of all parts of speech is 1.139839).

The 1st highest number of forms (1) was observed with the lemma “〓々”: 〓々.

The 2nd highest number of forms (1) was observed with the lemma “いかん”: いかん.

The 3rd highest number of forms (1) was observed with the lemma “かづき”: かづき.

NOUN occurs with 1 features: Polarity (3; 0% instances)

NOUN occurs with 1 feature-value pairs: Polarity=Neg

NOUN occurs with 2 feature combinations. The most frequent feature combination is _ (4308 tokens). Examples: 者, ヿ, 國, 政治, 法, 事, 人, 曰く, 所, 時

Relations

NOUN nodes are attached to their parents using 8 different relations: nmod (1791; 42% instances), compound (700; 16% instances), obj (682; 16% instances), iobj (443; 10% instances), obl (285; 7% instances), root (213; 5% instances), nsubj (179; 4% instances), dep (18; 0% instances)

Parents of NOUN nodes belong to 11 different parts of speech: VERB (1962; 46% instances), NOUN (1701; 39% instances), (213; 5% instances), PART (152; 4% instances), ADJ (98; 2% instances), AUX (95; 2% instances), NUM (48; 1% instances), PRON (21; 0% instances), PROPN (15; 0% instances), ADV (5; 0% instances), X (1; 0% instances)

1060 (25%) NOUN nodes are leaves.

1383 (32%) NOUN nodes have one child.

1172 (27%) NOUN nodes have two children.

696 (16%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 13.

Children of NOUN nodes are attached using 18 different relations: case (2267; 35% instances), nmod (1479; 23% instances), compound (767; 12% instances), aux (440; 7% instances), acl (420; 7% instances), amod (146; 2% instances), obj (133; 2% instances), nummod (127; 2% instances), cc (112; 2% instances), nsubj (101; 2% instances), obl (93; 1% instances), advmod (87; 1% instances), dep (81; 1% instances), punct (69; 1% instances), mark (68; 1% instances), iobj (48; 1% instances), det (14; 0% instances), discourse (1; 0% instances)

Children of NOUN nodes belong to 17 different parts of speech: ADP (2267; 35% instances), NOUN (1701; 26% instances), VERB (788; 12% instances), AUX (449; 7% instances), PRON (403; 6% instances), ADJ (158; 2% instances), NUM (144; 2% instances), CCONJ (112; 2% instances), ADV (95; 1% instances), PART (79; 1% instances), PUNCT (69; 1% instances), SCONJ (68; 1% instances), PROPN (62; 1% instances), SYM (42; 1% instances), DET (14; 0% instances), INTJ (1; 0% instances), X (1; 0% instances)