home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Japanese-PUD: POS Tags: PRON

There are 18 PRON lemmas (0%), 21 PRON types (0%) and 443 PRON tokens (2%). Out of 16 observed tags, the rank of PRON is: 10 in number of lemmas, 10 in number of types and 10 in number of tokens.

The 10 most frequent PRON lemmas: 彼, 其れ, 此れ, 私, 彼女, 其処, 誰, 何, 何れ, 何処

The 10 most frequent PRON types: 彼, それ, これ, 私, 彼女, そこ, 誰, 何, あなた, ここ

The 10 most frequent ambiguous lemmas: 其れ (PRON 68, CCONJ 4), 何 (NUM 12, PRON 9)

The 10 most frequent ambiguous types: それ (PRON 68, CCONJ 4), 何 (NUM 12, PRON 8)

Morphology

The form / lemma ratio of PRON is 1.166667 (the average of all parts of speech is 1.068660).

The 1st highest number of forms (2) was observed with the lemma “何”: なん, 何.

The 2nd highest number of forms (2) was observed with the lemma “何れ”: いずれ, どれ.

The 3rd highest number of forms (2) was observed with the lemma “彼”: かれ, 彼.

PRON does not occur with any features.

Relations

PRON nodes are attached to their parents using 8 different relations: nsubj (190; 43% instances), nmod (156; 35% instances), obl (61; 14% instances), obj (23; 5% instances), dislocated (6; 1% instances), advcl (3; 1% instances), compound (2; 0% instances), root (2; 0% instances)

Parents of PRON nodes belong to 7 different parts of speech: VERB (239; 54% instances), NOUN (183; 41% instances), ADJ (12; 3% instances), ADV (3; 1% instances), PROPN (3; 1% instances), (2; 0% instances), PRON (1; 0% instances)

88 (20%) PRON nodes are leaves.

265 (60%) PRON nodes have one child.

70 (16%) PRON nodes have two children.

20 (5%) PRON nodes have three or more children.

The highest child degree of a PRON node is 5.

Children of PRON nodes are attached using 12 different relations: case (369; 78% instances), punct (69; 14% instances), nmod (9; 2% instances), cop (7; 1% instances), acl (6; 1% instances), mark (6; 1% instances), nsubj (4; 1% instances), advmod (2; 0% instances), aux (1; 0% instances), compound (1; 0% instances), det (1; 0% instances), obl (1; 0% instances)

Children of PRON nodes belong to 10 different parts of speech: ADP (369; 78% instances), PUNCT (69; 14% instances), NOUN (15; 3% instances), AUX (8; 2% instances), VERB (5; 1% instances), SCONJ (4; 1% instances), ADV (2; 0% instances), PART (2; 0% instances), DET (1; 0% instances), PRON (1; 0% instances)