home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Japanese-GSDLUW: POS Tags: PRON

There are 44 PRON lemmas (0%), 61 PRON types (0%) and 1050 PRON tokens (1%). Out of 17 observed tags, the rank of PRON is: 9 in number of lemmas, 9 in number of types and 11 in number of tokens.

The 10 most frequent PRON lemmas: 此れ, 其れ, 此処, 私, 彼, 何, 此方, 其処, 何れ, 何時

The 10 most frequent PRON types: これ, それ, ここ, 私, 彼, こちら, そこ, これら, 何, いつ

The 10 most frequent ambiguous lemmas: 何れ (PRON 36, ADV 1), そんな (PRON 24, ADJ 2), こんな (PRON 14, ADJ 1), どんな (PRON 8, ADJ 1), 何等 (ADV 4, PRON 4)

The 10 most frequent ambiguous types: そんな (PRON 24, ADJ 2), いずれ (PRON 23, ADV 1), こんな (PRON 14, ADJ 1), どんな (PRON 8, ADJ 1), あれ (VERB 6, PRON 5), 何ら (ADV 4, PRON 4), か (PART 229, ADP 160, PRON 1), 余 (ADV 1, PRON 1)

Morphology

The form / lemma ratio of PRON is 1.386364 (the average of all parts of speech is 1.095294).

The 1st highest number of forms (3) was observed with the lemma “何”: なに, なん, 何.

The 2nd highest number of forms (3) was observed with the lemma “何処”: いずこ, どこ, 何処.

The 3rd highest number of forms (3) was observed with the lemma “僕”: ぼく, ボク, 僕.

PRON does not occur with any features.

Relations

PRON nodes are attached to their parents using 8 different relations: obl (330; 31% instances), nsubj (319; 30% instances), nmod (258; 25% instances), obj (98; 9% instances), nsubj:outer (21; 2% instances), root (14; 1% instances), advcl (8; 1% instances), acl (2; 0% instances)

Parents of PRON nodes belong to 8 different parts of speech: VERB (603; 57% instances), NOUN (332; 32% instances), ADJ (64; 6% instances), (14; 1% instances), NUM (12; 1% instances), ADV (11; 1% instances), PRON (8; 1% instances), PROPN (6; 1% instances)

68 (6%) PRON nodes are leaves.

746 (71%) PRON nodes have one child.

175 (17%) PRON nodes have two children.

61 (6%) PRON nodes have three or more children.

The highest child degree of a PRON node is 11.

Children of PRON nodes are attached using 16 different relations: case (1072; 81% instances), punct (105; 8% instances), acl (31; 2% instances), nmod (28; 2% instances), cop (25; 2% instances), mark (23; 2% instances), aux (10; 1% instances), nsubj (10; 1% instances), advmod (6; 0% instances), obl (3; 0% instances), advcl (2; 0% instances), csubj (2; 0% instances), dep (2; 0% instances), det (2; 0% instances), cc (1; 0% instances), nsubj:outer (1; 0% instances)

Children of PRON nodes belong to 15 different parts of speech: ADP (1072; 81% instances), PUNCT (105; 8% instances), AUX (35; 3% instances), NOUN (31; 2% instances), VERB (27; 2% instances), PART (13; 1% instances), SCONJ (10; 1% instances), PRON (8; 1% instances), ADJ (6; 0% instances), ADV (6; 0% instances), PROPN (5; 0% instances), DET (2; 0% instances), CCONJ (1; 0% instances), NUM (1; 0% instances), SYM (1; 0% instances)