home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-PUD: POS Tags: PRON

There are 1 PRON lemmas (7%), 37 PRON types (1%) and 710 PRON tokens (3%). Out of 15 observed tags, the rank of PRON is: 10 in number of lemmas, 11 in number of types and 9 in number of tokens.

The 10 most frequent PRON lemmas: _

The 10 most frequent PRON types: 他、 他們、 她、 我、 這、 其、 它、 我們、 自己、 此

The 10 most frequent ambiguous lemmas: _ (NOUN 5410, VERB 3467, PUNCT 2902, PART 1881, PROPN 1361, ADP 1288, ADV 1283, NUM 873, PRON 710, ADJ 650, AUX 618, DET 355, X 306, CCONJ 283, SCONJ 28)

The 10 most frequent ambiguous types: 這 (DET 107, PRON 45), 此 (PRON 20, DET 9), 之 (PART 21, PRON 6), 那 (DET 21, PRON 6, ADV 2), 怎麼 (ADV 2, PRON 2), 個人 (ADJ 2, PRON 1)

Morphology

The form / lemma ratio of PRON is 37.000000 (the average of all parts of speech is 388.466667).

The 1st highest number of forms (37) was observed with the lemma “_”: 之, 什麼, 他, 他們, 何, 你, 你們, 個人, 其, 哪, 哪兒, 大家, 她, 她們, 它, 它們, 對方, 怎麼, 您, 我, 我們, 此, 牠們, 甚麽, 自己, 自身, 誰, 這, 這樣, 這裡, 這麼, 那, 那兒, 那樣, 那裡, 那里, 閣下.

PRON occurs with 2 features: Person (543; 76% instances), Number (144; 20% instances)

PRON occurs with 4 feature-value pairs: Number=Plur, Person=1, Person=2, Person=3

PRON occurs with 7 feature combinations. The most frequent feature combination is Person=3 (323 tokens). Examples: 他、 她、 其、 他們、 它、 它們、 她們、 牠們

Relations

PRON nodes are attached to their parents using 16 different relations: nsubj (393; 55% instances), nmod (105; 15% instances), compound (73; 10% instances), obj (53; 7% instances), obl (43; 6% instances), nsubj:pass (9; 1% instances), appos (8; 1% instances), advmod (6; 1% instances), obl:patient (6; 1% instances), det (3; 0% instances), root (3; 0% instances), ccomp (2; 0% instances), obl:agent (2; 0% instances), xcomp (2; 0% instances), acl:relcl (1; 0% instances), iobj (1; 0% instances)

Parents of PRON nodes belong to 10 different parts of speech: VERB (456; 64% instances), NOUN (212; 30% instances), ADJ (26; 4% instances), ADP (3; 0% instances), PRON (3; 0% instances), PROPN (3; 0% instances), (3; 0% instances), NUM (2; 0% instances), PART (1; 0% instances), X (1; 0% instances)

529 (75%) PRON nodes are leaves.

163 (23%) PRON nodes have one child.

11 (2%) PRON nodes have two children.

7 (1%) PRON nodes have three or more children.

The highest child degree of a PRON node is 4.

Children of PRON nodes are attached using 13 different relations: case (154; 74% instances), punct (11; 5% instances), case:loc (9; 4% instances), appos (7; 3% instances), cop (7; 3% instances), nsubj (7; 3% instances), conj (5; 2% instances), acl:relcl (3; 1% instances), mark (2; 1% instances), acl (1; 0% instances), advmod (1; 0% instances), det (1; 0% instances), mark:relcl (1; 0% instances)

Children of PRON nodes belong to 12 different parts of speech: PART (108; 52% instances), ADP (56; 27% instances), NOUN (14; 7% instances), PUNCT (11; 5% instances), AUX (7; 3% instances), VERB (4; 2% instances), PRON (3; 1% instances), SCONJ (2; 1% instances), ADV (1; 0% instances), DET (1; 0% instances), PROPN (1; 0% instances), X (1; 0% instances)