home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-HK: POS Tags: PRON

There are 20 PRON lemmas (4%), 22 PRON types (4%) and 170 PRON tokens (9%). Out of 17 observed tags, the rank of PRON is: 7 in number of lemmas, 5 in number of types and 5 in number of tokens.

The 10 most frequent PRON lemmas: 我、 _、 別人、 這裡、 你、 我們、 自己、 他們、 他、 什麼

The 10 most frequent PRON types: 我、 你、 別人、 我們、 他、 這裡、 自己、 他們、 什麼、 這

The 10 most frequent ambiguous lemmas: _ (VERB 114, PUNCT 111, NOUN 69, ADV 63, PART 54, PRON 49, ADJ 21, NUM 19, AUX 18, ADP 10, PROPN 10, DET 8, INTJ 5, SCONJ 1, X 1), 自己 (PRON 7, ADV 1), 什麼 (DET 3, PRON 3), 這樣 (ADV 2, PRON 2), 那 (DET 6, ADV 2, PRON 2), 怎樣 (ADV 1, PRON 1), 這 (DET 12, PRON 1)

The 10 most frequent ambiguous types: 自己 (PRON 7, ADV 1), 什麼 (DET 6, PRON 4), 這 (DET 12, PRON 3), 這些 (PRON 3, DET 2), 這樣 (ADV 2, PRON 2), 那 (DET 7, ADV 3, PRON 2), 多少 (DET 2, PRON 1), 怎樣 (ADV 1, PRON 1)

Morphology

The form / lemma ratio of PRON is 1.100000 (the average of all parts of speech is 1.221258).

The 1st highest number of forms (9) was observed with the lemma “_”: 什麼, 他, 你, 哪兒, 多少, 我, 我們, 這, 這些.

The 2nd highest number of forms (1) was observed with the lemma “人人”: 人人.

The 3rd highest number of forms (1) was observed with the lemma “什麼”: 什麼.

PRON does not occur with any features.

Relations

PRON nodes are attached to their parents using 9 different relations: nsubj (90; 53% instances), obj (34; 20% instances), obl (17; 10% instances), nmod (13; 8% instances), dislocated (5; 3% instances), root (5; 3% instances), iobj (4; 2% instances), appos (1; 1% instances), conj (1; 1% instances)

Parents of PRON nodes belong to 6 different parts of speech: VERB (134; 79% instances), NOUN (23; 14% instances), ADJ (5; 3% instances), (5; 3% instances), PROPN (2; 1% instances), PRON (1; 1% instances)

139 (82%) PRON nodes are leaves.

26 (15%) PRON nodes have one child.

2 (1%) PRON nodes have two children.

3 (2%) PRON nodes have three or more children.

The highest child degree of a PRON node is 6.

Children of PRON nodes are attached using 11 different relations: case (22; 50% instances), punct (6; 14% instances), cop (3; 7% instances), discourse:sp (3; 7% instances), appos (2; 5% instances), obl:tmod (2; 5% instances), parataxis (2; 5% instances), acl (1; 2% instances), advmod (1; 2% instances), det (1; 2% instances), nmod (1; 2% instances)

Children of PRON nodes belong to 9 different parts of speech: ADP (13; 30% instances), PART (10; 23% instances), VERB (8; 18% instances), PUNCT (6; 14% instances), NOUN (3; 7% instances), ADJ (1; 2% instances), ADV (1; 2% instances), DET (1; 2% instances), PRON (1; 2% instances)