home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Japanese-PUD: POS Tags: PRON

There are 21 PRON lemmas (0%), 21 PRON types (0%) and 446 PRON tokens (2%). Out of 16 observed tags, the rank of PRON is: 10 in number of lemmas, 10 in number of types and 10 in number of tokens.

The 10 most frequent PRON lemmas: 彼, それ, これ, 私, 彼女, そこ, 誰, 何, ある, あなた

The 10 most frequent PRON types: 彼, それ, これ, 私, 彼女, そこ, 誰, 何, ある, あなた

The 10 most frequent ambiguous lemmas: それ (PRON 68, CCONJ 4), 何 (NUM 12, PRON 8), ある (VERB 308, PRON 6)

The 10 most frequent ambiguous types: それ (PRON 68, CCONJ 4), 何 (NUM 12, PRON 8), ある (VERB 196, PRON 6)

Morphology

The form / lemma ratio of PRON is 1.000000 (the average of all parts of speech is 1.050009).

The 1st highest number of forms (1) was observed with the lemma “You”: You.

The 2nd highest number of forms (1) was observed with the lemma “あなた”: あなた.

The 3rd highest number of forms (1) was observed with the lemma “ある”: ある.

PRON does not occur with any features.

Relations

PRON nodes are attached to their parents using 8 different relations: nsubj (190; 43% instances), nmod (160; 36% instances), obl (60; 13% instances), obj (23; 5% instances), dislocated (6; 1% instances), advcl (3; 1% instances), compound (2; 0% instances), root (2; 0% instances)

Parents of PRON nodes belong to 7 different parts of speech: VERB (239; 54% instances), NOUN (186; 42% instances), ADJ (12; 3% instances), PROPN (4; 1% instances), ADV (2; 0% instances), (2; 0% instances), PRON (1; 0% instances)

91 (20%) PRON nodes are leaves.

265 (59%) PRON nodes have one child.

70 (16%) PRON nodes have two children.

20 (4%) PRON nodes have three or more children.

The highest child degree of a PRON node is 5.

Children of PRON nodes are attached using 12 different relations: case (369; 78% instances), punct (69; 14% instances), nmod (9; 2% instances), cop (7; 1% instances), acl (6; 1% instances), mark (6; 1% instances), nsubj (4; 1% instances), advmod (2; 0% instances), aux (1; 0% instances), compound (1; 0% instances), det (1; 0% instances), obl (1; 0% instances)

Children of PRON nodes belong to 10 different parts of speech: ADP (369; 78% instances), PUNCT (69; 14% instances), NOUN (15; 3% instances), AUX (8; 2% instances), VERB (5; 1% instances), SCONJ (4; 1% instances), ADV (2; 0% instances), PART (2; 0% instances), DET (1; 0% instances), PRON (1; 0% instances)