home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-HK: POS Tags: PRON

There are 39 PRON lemmas (2%), 41 PRON types (2%) and 875 PRON tokens (9%). Out of 16 observed tags, the rank of PRON is: 7 in number of lemmas, 7 in number of types and 5 in number of tokens.

The 10 most frequent PRON lemmas: 我、 你、 他、 大家、 這、 自己、 這裡、 什麼、 別人、 你們

The 10 most frequent PRON types: 我、 你、 我們、 他、 大家、 這、 他們、 自己、 這裡、 什麼

The 10 most frequent ambiguous lemmas: 這 (DET 52, PRON 19), 自己 (PRON 13, ADV 2), 什麼 (DET 22, PRON 10), 這樣 (ADV 12, PRON 9), 甚麼 (DET 5, PRON 4, NOUN 1), 哪 (PRON 3, ADV 1), 這些 (DET 13, PRON 3), 那 (DET 28, ADV 9, PRON 3), 那邊 (PRON 3, NOUN 1), 此 (DET 6, PRON 2)

The 10 most frequent ambiguous types: 這 (DET 52, PRON 19), 自己 (PRON 13, ADV 2), 什麼 (DET 22, PRON 10), 這樣 (ADV 12, PRON 9), 甚麼 (DET 5, PRON 4, NOUN 1), 哪 (PRON 3, ADV 1), 這些 (DET 13, PRON 3), 那 (DET 28, ADV 9, PRON 3), 那邊 (PRON 3, NOUN 1), 此 (DET 6, PRON 2)

Morphology

The form / lemma ratio of PRON is 1.051282 (the average of all parts of speech is 1.007013).

The 1st highest number of forms (2) was observed with the lemma “他”: 他, 他們.

The 2nd highest number of forms (2) was observed with the lemma “我”: 我, 我們.

The 3rd highest number of forms (1) was observed with the lemma “人人”: 人人.

PRON does not occur with any features.

Relations

PRON nodes are attached to their parents using 14 different relations: nsubj (554; 63% instances), obj (143; 16% instances), nmod (78; 9% instances), obl (61; 7% instances), root (16; 2% instances), iobj (7; 1% instances), compound (5; 1% instances), appos (4; 0% instances), det (2; 0% instances), ccomp (1; 0% instances), conj (1; 0% instances), dislocated (1; 0% instances), obj:periph (1; 0% instances), obl:patient (1; 0% instances)

Parents of PRON nodes belong to 9 different parts of speech: VERB (719; 82% instances), NOUN (117; 13% instances), (16; 2% instances), ADJ (12; 1% instances), ADV (3; 0% instances), AUX (3; 0% instances), PRON (2; 0% instances), PROPN (2; 0% instances), ADP (1; 0% instances)

739 (84%) PRON nodes are leaves.

119 (14%) PRON nodes have one child.

7 (1%) PRON nodes have two children.

10 (1%) PRON nodes have three or more children.

The highest child degree of a PRON node is 6.

Children of PRON nodes are attached using 18 different relations: case (107; 62% instances), punct (17; 10% instances), advmod (8; 5% instances), cop (7; 4% instances), conj (6; 3% instances), acl (4; 2% instances), discourse:sp (4; 2% instances), nsubj (4; 2% instances), appos (3; 2% instances), clf (3; 2% instances), obl:tmod (2; 1% instances), parataxis (2; 1% instances), aux (1; 1% instances), case:loc (1; 1% instances), discourse (1; 1% instances), flat (1; 1% instances), mark:rel (1; 1% instances), nmod (1; 1% instances)

Children of PRON nodes belong to 11 different parts of speech: ADP (60; 35% instances), PART (51; 29% instances), PUNCT (17; 10% instances), NOUN (15; 9% instances), ADV (8; 5% instances), AUX (8; 5% instances), VERB (6; 3% instances), ADJ (3; 2% instances), PRON (2; 1% instances), PROPN (2; 1% instances), INTJ (1; 1% instances)