home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Japanese-GSD: POS Tags: PRON

There are 35 PRON lemmas (0%), 55 PRON types (0%) and 1108 PRON tokens (1%). Out of 16 observed tags, the rank of PRON is: 9 in number of lemmas, 8 in number of types and 13 in number of tokens.

The 10 most frequent PRON lemmas: 此れ, 其れ, 彼, 私, 此処, 何, 此方, 其処, 何時, 何れ

The 10 most frequent PRON types: これ, それ, 彼, 私, ここ, 何, こちら, そこ, いつ, 彼女

The 10 most frequent ambiguous lemmas: 其れ (PRON 137, CCONJ 18), 私 (PRON 99, NOUN 2), 何 (PRON 73, NUM 27), 何れ (PRON 36, ADV 1), そんな (PRON 24, ADJ 2), こんな (PRON 14, ADJ 1), どんな (PRON 8, ADJ 1), 君 (NOUN 20, PRON 3)

The 10 most frequent ambiguous types: それ (PRON 136, CCONJ 18), 私 (PRON 93, NOUN 2), 何 (PRON 50, NUM 27), そんな (PRON 24, ADJ 2), いずれ (PRON 23, ADV 1), こんな (PRON 14, ADJ 1), どんな (PRON 8, ADJ 1), あれ (VERB 25, PRON 5), か (PART 242, ADP 182, PRON 1), 余 (ADV 1, NOUN 1, PRON 1)

Morphology

The form / lemma ratio of PRON is 1.571429 (the average of all parts of speech is 1.115220).

The 1st highest number of forms (3) was observed with the lemma “何”: なに, なん, 何.

The 2nd highest number of forms (3) was observed with the lemma “何処”: いずこ, どこ, 何処.

The 3rd highest number of forms (3) was observed with the lemma “僕”: ぼく, ボク, 僕.

PRON does not occur with any features.

Relations

PRON nodes are attached to their parents using 9 different relations: nmod (334; 30% instances), obl (331; 30% instances), nsubj (309; 28% instances), obj (95; 9% instances), root (14; 1% instances), advcl (9; 1% instances), dislocated (8; 1% instances), compound (5; 0% instances), acl (3; 0% instances)

Parents of PRON nodes belong to 8 different parts of speech: VERB (572; 52% instances), NOUN (426; 38% instances), ADJ (65; 6% instances), ADV (16; 1% instances), (14; 1% instances), PRON (7; 1% instances), PROPN (6; 1% instances), NUM (2; 0% instances)

180 (16%) PRON nodes are leaves.

687 (62%) PRON nodes have one child.

181 (16%) PRON nodes have two children.

60 (5%) PRON nodes have three or more children.

The highest child degree of a PRON node is 11.

Children of PRON nodes are attached using 17 different relations: case (1022; 80% instances), punct (101; 8% instances), cop (31; 2% instances), acl (29; 2% instances), nmod (27; 2% instances), mark (24; 2% instances), nsubj (10; 1% instances), aux (8; 1% instances), advmod (6; 0% instances), advcl (3; 0% instances), obl (3; 0% instances), csubj (2; 0% instances), dep (2; 0% instances), det (2; 0% instances), cc (1; 0% instances), compound (1; 0% instances), dislocated (1; 0% instances)

Children of PRON nodes belong to 14 different parts of speech: ADP (1022; 80% instances), PUNCT (101; 8% instances), AUX (39; 3% instances), NOUN (36; 3% instances), VERB (26; 2% instances), PART (14; 1% instances), SCONJ (10; 1% instances), PRON (7; 1% instances), ADJ (6; 0% instances), ADV (6; 0% instances), DET (2; 0% instances), PROPN (2; 0% instances), CCONJ (1; 0% instances), SYM (1; 0% instances)