home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-PUD: POS Tags: PART

There are 26 PART lemmas (0%), 26 PART types (0%) and 1483 PART tokens (7%). Out of 15 observed tags, the rank of PART is: 11 in number of lemmas, 12 in number of types and 4 in number of tokens.

The 10 most frequent PART lemmas: 的、 了、 地、 之、 得、 嗎、 人、 者、 黨、 區

The 10 most frequent PART types: 的、 了、 地、 之、 得、 嗎、 人、 者、 黨、 區

The 10 most frequent ambiguous lemmas: 的 (PART 1361, X 1), 了 (AUX 339, PART 41), 地 (PART 22, NOUN 2), 之 (PART 21, PRON 6), 得 (PART 9, AUX 5, VERB 3), 人 (NOUN 105, PART 3), 者 (NOUN 2, PART 2), 區 (NOUN 2, PART 1), 家 (NOUN 9, PART 1), 法 (PROPN 2, PART 1)

The 10 most frequent ambiguous types: 的 (PART 1361, X 1), 了 (AUX 339, PART 41), 地 (PART 22, NOUN 2), 之 (PART 21, PRON 6), 得 (PART 9, VERB 3), 人 (NOUN 91, PART 3), 者 (NOUN 2, PART 2), 區 (NOUN 2, PART 1), 家 (NOUN 9, PART 1), 法 (PROPN 2, PART 1)

Morphology

The form / lemma ratio of PART is 1.000000 (the average of all parts of speech is 1.006233).

The 1st highest number of forms (1) was observed with the lemma “之”: 之.

The 2nd highest number of forms (1) was observed with the lemma “了”: 了.

The 3rd highest number of forms (1) was observed with the lemma “人”: 人.

PART occurs with 1 features: Case (709; 48% instances)

PART occurs with 1 feature-value pairs: Case=Gen

PART occurs with 2 feature combinations. The most frequent feature combination is _ (774 tokens). Examples: 的、 了、 地、 得、 之、 嗎、 人、 者、 黨、 區

Relations

PART nodes are attached to their parents using 13 different relations: case (709; 48% instances), mark:rel (626; 42% instances), discourse:sp (87; 6% instances), mark:adv (22; 1% instances), mark:prt (16; 1% instances), obj (8; 1% instances), appos (4; 0% instances), compound (3; 0% instances), conj (3; 0% instances), nsubj (2; 0% instances), dep (1; 0% instances), nmod (1; 0% instances), obl (1; 0% instances)

Parents of PART nodes belong to 9 different parts of speech: VERB (558; 38% instances), NOUN (366; 25% instances), ADJ (222; 15% instances), PROPN (139; 9% instances), PRON (108; 7% instances), ADP (63; 4% instances), X (15; 1% instances), DET (6; 0% instances), NUM (6; 0% instances)

1461 (99%) PART nodes are leaves.

11 (1%) PART nodes have one child.

4 (0%) PART nodes have two children.

7 (0%) PART nodes have three or more children.

The highest child degree of a PART node is 5.

Children of PART nodes are attached using 13 different relations: dep (11; 25% instances), compound (10; 23% instances), punct (6; 14% instances), case (2; 5% instances), case:loc (2; 5% instances), cc (2; 5% instances), conj (2; 5% instances), nmod (2; 5% instances), nsubj (2; 5% instances), nummod (2; 5% instances), clf (1; 2% instances), cop (1; 2% instances), obl (1; 2% instances)

Children of PART nodes belong to 10 different parts of speech: NOUN (19; 43% instances), PUNCT (6; 14% instances), ADP (4; 9% instances), PROPN (4; 9% instances), VERB (3; 7% instances), CCONJ (2; 5% instances), NUM (2; 5% instances), X (2; 5% instances), AUX (1; 2% instances), PRON (1; 2% instances)