home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-HK: POS Tags: AUX

There are 20 AUX lemmas (4%), 21 AUX types (4%) and 69 AUX tokens (4%). Out of 17 observed tags, the rank of AUX is: 6 in number of lemmas, 7 in number of types and 7 in number of tokens.

The 10 most frequent AUX lemmas: _、 要、 能、 過、 了、 可以、 喜歡、 愛、 著、 不用

The 10 most frequent AUX types: 要、 可以、 了、 能、 過、 愛、 著、 不要、 別、 喜歡

The 10 most frequent ambiguous lemmas: _ (VERB 114, PUNCT 111, NOUN 69, ADV 63, PART 54, PRON 49, ADJ 21, NUM 19, AUX 18, ADP 10, PROPN 10, DET 8, INTJ 5, SCONJ 1, X 1), 要 (AUX 8, VERB 1), 了 (PART 19, AUX 4, VERB 2), 喜歡 (AUX 3, VERB 1), 想 (AUX 2, VERB 1), 好 (ADJ 4, ADV 1, AUX 1), 有 (VERB 21, AUX 1), 沒有 (VERB 6, AUX 1)

The 10 most frequent ambiguous types: 要 (AUX 11, VERB 4), 了 (PART 37, AUX 5, VERB 2), 過 (AUX 5, VERB 1), 喜歡 (AUX 3, VERB 3), 想 (AUX 2, VERB 1), 沒有 (VERB 12, AUX 2), 好 (ADJ 10, ADV 1, AUX 1, VERB 1), 有 (VERB 23, AUX 1)

Morphology

The form / lemma ratio of AUX is 1.050000 (the average of all parts of speech is 1.221258).

The 1st highest number of forms (11) was observed with the lemma “_”: 不要, 了, 別, 可以, 愛, 應, 會, 沒有, 著, 要, 該.

The 2nd highest number of forms (1) was observed with the lemma “不用”: 不用.

The 3rd highest number of forms (1) was observed with the lemma “不要”: 不要.

AUX does not occur with any features.

Relations

AUX nodes are attached to their parents using 3 different relations: aux (67; 97% instances), conj (1; 1% instances), root (1; 1% instances)

Parents of AUX nodes belong to 3 different parts of speech: VERB (66; 96% instances), ADJ (2; 3% instances), (1; 1% instances)

56 (81%) AUX nodes are leaves.

11 (16%) AUX nodes have one child.

1 (1%) AUX nodes have two children.

1 (1%) AUX nodes have three or more children.

The highest child degree of a AUX node is 3.

Children of AUX nodes are attached using 5 different relations: advmod (11; 69% instances), punct (2; 13% instances), conj (1; 6% instances), discourse:sp (1; 6% instances), dislocated (1; 6% instances)

Children of AUX nodes belong to 4 different parts of speech: ADV (12; 75% instances), PUNCT (2; 13% instances), NOUN (1; 6% instances), PART (1; 6% instances)