home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-HK: POS Tags: PART

There are 25 PART lemmas (1%), 25 PART types (1%) and 567 PART tokens (6%). Out of 16 observed tags, the rank of PART is: 11 in number of lemmas, 11 in number of types and 6 in number of tokens.

The 10 most frequent PART lemmas: 的、 了、 嗎、 吧、 呢、 啊、 呀、 啦、 得、 地

The 10 most frequent PART types: 的、 了、 嗎、 吧、 呢、 啊、 呀、 啦、 得、 地

The 10 most frequent ambiguous lemmas: 的 (PART 306, ADP 1), 了 (PART 65, AUX 39, VERB 2), 啊 (PART 17, INTJ 1), 得 (PART 10, AUX 1, VERB 1), 沒 (VERB 12, PART 6, ADV 3), 等 (VERB 3, PART 2), 中 (ADP 5, NOUN 1, PART 1, VERB 1), 來 (VERB 20, SCONJ 4, ADP 1, PART 1), 個 (NOUN 54, NUM 1, PART 1), 就 (ADV 55, ADP 7, PART 1)

The 10 most frequent ambiguous types: 的 (PART 306, ADP 1), 了 (PART 65, AUX 39, VERB 2), 啊 (PART 17, INTJ 1), 得 (PART 10, AUX 1, VERB 1), 沒 (VERB 12, PART 6, ADV 3), 等 (VERB 3, PART 2), 中 (ADP 5, NOUN 1, PART 1, VERB 1), 來 (VERB 20, SCONJ 4, ADP 1, PART 1), 個 (NOUN 54, NUM 1, PART 1), 就 (ADV 55, ADP 7, PART 1)

Morphology

The form / lemma ratio of PART is 1.000000 (the average of all parts of speech is 1.007013).

The 1st highest number of forms (1) was observed with the lemma “丫”: 丫.

The 2nd highest number of forms (1) was observed with the lemma “中”: 中.

The 3rd highest number of forms (1) was observed with the lemma “之”: 之.

PART occurs with 1 features: Polarity (7; 1% instances)

PART occurs with 1 feature-value pairs: Polarity=Neg

PART occurs with 2 feature combinations. The most frequent feature combination is _ (560 tokens). Examples: 的、 了、 嗎、 吧、 呢、 啊、 呀、 啦、 得、 地

Relations

PART nodes are attached to their parents using 10 different relations: discourse:sp (291; 51% instances), case (140; 25% instances), mark:rel (101; 18% instances), compound:ext (10; 2% instances), advmod (8; 1% instances), mark (8; 1% instances), mark:adv (5; 1% instances), conj (2; 0% instances), case:loc (1; 0% instances), discourse (1; 0% instances)

Parents of PART nodes belong to 10 different parts of speech: VERB (312; 55% instances), NOUN (92; 16% instances), ADJ (59; 10% instances), PRON (51; 9% instances), PROPN (22; 4% instances), AUX (19; 3% instances), ADV (8; 1% instances), DET (2; 0% instances), ADP (1; 0% instances), INTJ (1; 0% instances)

566 (100%) PART nodes are leaves.

1 (0%) PART nodes have one child.

The highest child degree of a PART node is 1.

Children of PART nodes are attached using 1 different relations: punct (1; 100% instances)

Children of PART nodes belong to 1 different parts of speech: PUNCT (1; 100% instances)