home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-HK: POS Tags: PART

There are 12 PART lemmas (3%), 12 PART types (2%) and 131 PART tokens (7%). Out of 17 observed tags, the rank of PART is: 12 in number of lemmas, 12 in number of types and 6 in number of tokens.

The 10 most frequent PART lemmas: _、 的、 了、 呢、 嗎、 啊、 呀、 嘛、 得、 之

The 10 most frequent PART types: 的、 了、 吧、 嗎、 呢、 啊、 呀、 嘛、 得、 之

The 10 most frequent ambiguous lemmas: _ (VERB 114, PUNCT 111, NOUN 69, ADV 63, PART 54, PRON 49, ADJ 21, NUM 19, AUX 18, ADP 10, PROPN 10, DET 8, INTJ 5, SCONJ 1, X 1), 了 (PART 19, AUX 4, VERB 2)

The 10 most frequent ambiguous types: 了 (PART 37, AUX 5, VERB 2), 等 (PART 1, VERB 1)

Morphology

The form / lemma ratio of PART is 1.000000 (the average of all parts of speech is 1.221258).

The 1st highest number of forms (7) was observed with the lemma “_”: 了, 吧, 呢, 啊, 啦, 嗎, 的.

The 2nd highest number of forms (1) was observed with the lemma “之”: 之.

The 3rd highest number of forms (1) was observed with the lemma “了”: 了.

PART does not occur with any features.

Relations

PART nodes are attached to their parents using 6 different relations: discourse:sp (97; 74% instances), case (16; 12% instances), mark:rel (14; 11% instances), compound:ext (2; 2% instances), conj (1; 1% instances), discourse (1; 1% instances)

Parents of PART nodes belong to 6 different parts of speech: VERB (95; 73% instances), ADJ (11; 8% instances), PRON (10; 8% instances), NOUN (9; 7% instances), PROPN (5; 4% instances), AUX (1; 1% instances)

131 (100%) PART nodes are leaves.

The highest child degree of a PART node is 0.