home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Classical_Chinese-Kyoto: POS Tags: PART

There are 39 PART lemmas (0%), 39 PART types (0%) and 12365 PART tokens (5%). Out of 13 observed tags, the rank of PART is: 7 in number of lemmas, 7 in number of types and 5 in number of tokens.

The 10 most frequent PART lemmas: 也、 者、 所、 矣、 乎、 焉、 哉、 夫、 與、 已

The 10 most frequent PART types: 也、 者、 所、 矣、 乎、 焉、 哉、 夫、 與、 已

The 10 most frequent ambiguous lemmas: 所 (PART 1082, NOUN 40), 乎 (PART 559, ADP 197), 焉 (PART 472, PRON 63, ADV 46), 夫 (NOUN 736, PART 221, PRON 37), 與 (ADP 725, VERB 182, PART 173, ADV 46, NOUN 1), 已 (ADV 172, PART 163, VERB 79), 然 (ADV 269, VERB 138, PART 124), 而 (CCONJ 3804, PART 124, PRON 7), 其 (PRON 3000, PART 121), 耳 (PART 109, NOUN 31, PROPN 7)

The 10 most frequent ambiguous types: 者 (PART 3026, ADP 3), 所 (PART 1082, NOUN 40), 乎 (PART 559, ADP 197), 焉 (PART 472, PRON 63, ADV 46), 夫 (NOUN 736, PART 221, PRON 37), 與 (ADP 725, VERB 182, PART 173, ADV 46, NOUN 1), 已 (ADV 172, PART 163, VERB 79), 然 (ADV 269, VERB 138, PART 124), 而 (CCONJ 3804, PART 124, PRON 7, ADP 1), 其 (PRON 3000, PART 121)

Morphology

The form / lemma ratio of PART is 1.000000 (the average of all parts of speech is 1.011910).

The 1st highest number of forms (1) was observed with the lemma “乎”: 乎.

The 2nd highest number of forms (1) was observed with the lemma “也”: 也.

The 3rd highest number of forms (1) was observed with the lemma “于”: 于.

PART does not occur with any features.

Relations

PART nodes are attached to their parents using 22 different relations: discourse:sp (7153; 58% instances), nsubj (1918; 16% instances), mark (916; 7% instances), case (648; 5% instances), obj (606; 5% instances), discourse (362; 3% instances), fixed (236; 2% instances), root (228; 2% instances), advmod (75; 1% instances), nmod (67; 1% instances), obl (67; 1% instances), conj (40; 0% instances), dislocated (32; 0% instances), ccomp (5; 0% instances), nsubj:pass (3; 0% instances), flat (2; 0% instances), iobj (2; 0% instances), advcl (1; 0% instances), cc (1; 0% instances), list (1; 0% instances), obl:lmod (1; 0% instances), parataxis (1; 0% instances)

Parents of PART nodes belong to 11 different parts of speech: VERB (8970; 73% instances), NOUN (1894; 15% instances), ADV (322; 3% instances), PART (275; 2% instances), NUM (274; 2% instances), (228; 2% instances), PROPN (194; 2% instances), PRON (140; 1% instances), AUX (60; 0% instances), INTJ (7; 0% instances), ADP (1; 0% instances)

9348 (76%) PART nodes are leaves.

2531 (20%) PART nodes have one child.

311 (3%) PART nodes have two children.

175 (1%) PART nodes have three or more children.

The highest child degree of a PART node is 7.

Children of PART nodes are attached using 22 different relations: acl (1712; 46% instances), amod (698; 19% instances), nmod (565; 15% instances), discourse:sp (188; 5% instances), nsubj (116; 3% instances), nummod (109; 3% instances), case (86; 2% instances), advmod (66; 2% instances), flat (48; 1% instances), det (41; 1% instances), conj (29; 1% instances), cc (20; 1% instances), discourse (16; 0% instances), advcl (11; 0% instances), csubj (8; 0% instances), cop (5; 0% instances), dislocated (3; 0% instances), obl:tmod (3; 0% instances), compound (2; 0% instances), obl (2; 0% instances), mark (1; 0% instances), vocative (1; 0% instances)

Children of PART nodes belong to 11 different parts of speech: VERB (2259; 61% instances), NOUN (645; 17% instances), PART (275; 7% instances), NUM (141; 4% instances), PROPN (136; 4% instances), ADV (72; 2% instances), PRON (71; 2% instances), ADP (59; 2% instances), SCONJ (45; 1% instances), AUX (14; 0% instances), CCONJ (13; 0% instances)