home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Classical_Chinese-Kyoto: POS Tags: PRON

There are 40 PRON lemmas (1%), 40 PRON types (1%) and 7985 PRON tokens (6%). Out of 13 observed tags, the rank of PRON is: 6 in number of lemmas, 6 in number of types and 5 in number of tokens.

The 10 most frequent PRON lemmas: 之、 其、 是、 何、 吾、 此、 我、 諸、 自、 斯

The 10 most frequent PRON types: 之、 其、 是、 何、 吾、 此、 我、 諸、 自、 斯

The 10 most frequent ambiguous lemmas: 之 (SCONJ 3385, PRON 2476, VERB 81, PROPN 1), 其 (PRON 2131, PART 117), 是 (PRON 643, NOUN 14), 何 (PRON 375, ADV 114), 吾 (PRON 348, PROPN 1), 我 (PRON 271, PROPN 9), 諸 (NOUN 320, PRON 151, PART 2), 自 (PRON 136, ADP 121, VERB 17, ADV 1), 斯 (PRON 126, ADV 58, PART 2, NOUN 1, PROPN 1), 子 (NOUN 1534, PRON 115, VERB 13, NUM 1)

The 10 most frequent ambiguous types: 之 (SCONJ 3385, PRON 2476, VERB 81, PROPN 1), 其 (PRON 2131, PART 117), 是 (PRON 643, NOUN 14), 何 (PRON 375, ADV 114), 吾 (PRON 348, PROPN 1), 我 (PRON 271, PROPN 9), 諸 (NOUN 320, PRON 151, PART 2), 自 (PRON 136, ADP 121, VERB 17, ADV 1), 斯 (PRON 126, ADV 58, PART 2, NOUN 1, PROPN 1), 子 (NOUN 1534, PRON 115, VERB 13, NUM 1)

Morphology

The form / lemma ratio of PRON is 1.000000 (the average of all parts of speech is 1.002166).

The 1st highest number of forms (1) was observed with the lemma “乃”: 乃.

The 2nd highest number of forms (1) was observed with the lemma “之”: 之.

The 3rd highest number of forms (1) was observed with the lemma “予”: 予.

PRON occurs with 3 features: PronType (7985; 100% instances), Person (5613; 70% instances), Reflex (241; 3% instances)

PRON occurs with 7 feature-value pairs: Person=1, Person=2, Person=3, PronType=Dem, PronType=Int, PronType=Prs, Reflex=Yes

PRON occurs with 7 feature combinations. The most frequent feature combination is Person=3|PronType=Prs (4627 tokens). Examples: 之、 其、 厥

Relations

PRON nodes are attached to their parents using 18 different relations: obj (2729; 34% instances), det (2237; 28% instances), nsubj (1852; 23% instances), iobj (329; 4% instances), obl (319; 4% instances), expl (283; 4% instances), root (105; 1% instances), xcomp (85; 1% instances), flat (16; 0% instances), conj (8; 0% instances), obl:lmod (7; 0% instances), advcl (3; 0% instances), dislocated (3; 0% instances), vocative (3; 0% instances), ccomp (2; 0% instances), parataxis (2; 0% instances), compound (1; 0% instances), nsubj:pass (1; 0% instances)

Parents of PRON nodes belong to 9 different parts of speech: VERB (5444; 68% instances), NOUN (2281; 29% instances), (105; 1% instances), PART (54; 1% instances), NUM (34; 0% instances), AUX (24; 0% instances), PRON (21; 0% instances), PROPN (20; 0% instances), INTJ (2; 0% instances)

7560 (95%) PRON nodes are leaves.

334 (4%) PRON nodes have one child.

71 (1%) PRON nodes have two children.

20 (0%) PRON nodes have three or more children.

The highest child degree of a PRON node is 4.

Children of PRON nodes are attached using 22 different relations: case (285; 53% instances), discourse:sp (92; 17% instances), nsubj (50; 9% instances), csubj (29; 5% instances), advmod (15; 3% instances), nmod (11; 2% instances), cop (8; 1% instances), acl (7; 1% instances), discourse (7; 1% instances), flat (7; 1% instances), conj (6; 1% instances), amod (5; 1% instances), cc (5; 1% instances), det (4; 1% instances), expl (2; 0% instances), nummod (2; 0% instances), parataxis (2; 0% instances), advcl (1; 0% instances), list (1; 0% instances), mark (1; 0% instances), obl (1; 0% instances), obl:lmod (1; 0% instances)

Children of PRON nodes belong to 12 different parts of speech: ADP (241; 44% instances), PART (118; 22% instances), SCONJ (46; 8% instances), NOUN (40; 7% instances), VERB (36; 7% instances), PRON (21; 4% instances), ADV (14; 3% instances), AUX (10; 2% instances), PROPN (7; 1% instances), CCONJ (5; 1% instances), NUM (3; 1% instances), INTJ (1; 0% instances)