home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Classical_Chinese-Kyoto: POS Tags: PART

There are 35 PART lemmas (1%), 35 PART types (1%) and 9523 PART tokens (7%). Out of 13 observed tags, the rank of PART is: 7 in number of lemmas, 7 in number of types and 4 in number of tokens.

The 10 most frequent PART lemmas: 也、 者、 矣、 所、 乎、 焉、 哉、 夫、 與、 已

The 10 most frequent PART types: 也、 者、 矣、 所、 乎、 焉、 哉、 夫、 與、 已

The 10 most frequent ambiguous lemmas: 所 (PART 692, NOUN 25), 乎 (PART 463, ADP 160), 焉 (PART 383, PRON 61, ADV 45), 夫 (NOUN 446, PART 182, PRON 32), 與 (ADP 337, PART 161, VERB 132, ADV 39), 已 (PART 131, VERB 67, ADV 52), 其 (PRON 2131, PART 117), 而 (CCONJ 2749, PART 96, PRON 2), 然 (ADV 224, VERB 107, PART 81), 蓋 (PART 41, NOUN 5, VERB 4, PROPN 2, ADV 1)

The 10 most frequent ambiguous types: 者 (PART 2113, ADP 3), 所 (PART 692, NOUN 25), 乎 (PART 463, ADP 160), 焉 (PART 383, PRON 61, ADV 45), 夫 (NOUN 446, PART 182, PRON 32), 與 (ADP 337, PART 161, VERB 132, ADV 39), 已 (PART 131, VERB 67, ADV 52), 其 (PRON 2131, PART 117), 而 (CCONJ 2749, PART 96, PRON 2, ADP 1), 然 (ADV 224, VERB 107, PART 81)

Morphology

The form / lemma ratio of PART is 1.000000 (the average of all parts of speech is 1.002166).

The 1st highest number of forms (1) was observed with the lemma “乎”: 乎.

The 2nd highest number of forms (1) was observed with the lemma “也”: 也.

The 3rd highest number of forms (1) was observed with the lemma “于”: 于.

PART does not occur with any features.

Relations

PART nodes are attached to their parents using 21 different relations: discourse:sp (5749; 60% instances), nsubj (1294; 14% instances), mark (660; 7% instances), case (493; 5% instances), obj (422; 4% instances), discourse (317; 3% instances), root (182; 2% instances), fixed (178; 2% instances), advmod (73; 1% instances), obl (45; 0% instances), nmod (38; 0% instances), conj (30; 0% instances), dislocated (30; 0% instances), ccomp (3; 0% instances), nsubj:pass (3; 0% instances), cc (1; 0% instances), flat (1; 0% instances), iobj (1; 0% instances), list (1; 0% instances), obl:lmod (1; 0% instances), parataxis (1; 0% instances)

Parents of PART nodes belong to 11 different parts of speech: VERB (7001; 74% instances), NOUN (1425; 15% instances), ADV (259; 3% instances), PART (219; 2% instances), (182; 2% instances), PROPN (163; 2% instances), PRON (118; 1% instances), NUM (100; 1% instances), AUX (52; 1% instances), INTJ (3; 0% instances), ADP (1; 0% instances)

7419 (78%) PART nodes are leaves.

1756 (18%) PART nodes have one child.

229 (2%) PART nodes have two children.

119 (1%) PART nodes have three or more children.

The highest child degree of a PART node is 5.

Children of PART nodes are attached using 21 different relations: acl (1209; 47% instances), amod (456; 18% instances), nmod (377; 15% instances), discourse:sp (153; 6% instances), nummod (89; 3% instances), nsubj (84; 3% instances), case (65; 3% instances), advmod (56; 2% instances), det (30; 1% instances), cc (17; 1% instances), conj (15; 1% instances), discourse (13; 1% instances), advcl (10; 0% instances), csubj (7; 0% instances), cop (4; 0% instances), flat (4; 0% instances), dislocated (3; 0% instances), obl (2; 0% instances), obl:tmod (2; 0% instances), mark (1; 0% instances), vocative (1; 0% instances)

Children of PART nodes belong to 11 different parts of speech: VERB (1556; 60% instances), NOUN (454; 17% instances), PART (219; 8% instances), NUM (121; 5% instances), ADV (60; 2% instances), PRON (54; 2% instances), ADP (46; 2% instances), SCONJ (34; 1% instances), PROPN (33; 1% instances), CCONJ (11; 0% instances), AUX (10; 0% instances)