home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-HK: POS Tags: ADV

There are 63 ADV lemmas (14%), 75 ADV types (13%) and 237 ADV tokens (13%). Out of 17 observed tags, the rank of ADV is: 3 in number of lemmas, 3 in number of types and 4 in number of tokens.

The 10 most frequent ADV lemmas: _、 不、 就、 都、 很、 因為、 也、 又、 最、 便

The 10 most frequent ADV types: 不、 就、 都、 很、 也、 因為、 又、 快、 才、 便

The 10 most frequent ambiguous lemmas: _ (VERB 114, PUNCT 111, NOUN 69, ADV 63, PART 54, PRON 49, ADJ 21, NUM 19, AUX 18, ADP 10, PROPN 10, DET 8, INTJ 5, SCONJ 1, X 1), 這樣 (ADV 2, PRON 2), 那 (DET 6, ADV 2, PRON 2), 但 (CCONJ 3, ADV 1), 像 (ADP 1, ADV 1, VERB 1), 好 (ADJ 4, ADV 1, AUX 1), 怎樣 (ADV 1, PRON 1), 自己 (PRON 7, ADV 1)

The 10 most frequent ambiguous types: 多 (ADV 3, DET 1), 那 (DET 7, ADV 3, PRON 2), 這樣 (ADV 2, PRON 2), 但 (CCONJ 3, ADV 1), 像 (ADP 1, ADV 1, VERB 1), 在 (ADP 14, VERB 4, ADV 1), 好 (ADJ 10, ADV 1, AUX 1, VERB 1), 怎樣 (ADV 1, PRON 1), 自己 (PRON 7, ADV 1)

Morphology

The form / lemma ratio of ADV is 1.190476 (the average of all parts of speech is 1.221258).

The 1st highest number of forms (30) was observed with the lemma “_”: 一起, 不, 乖乖, 也, 便, 再, 又, 只, 只是, 否, 在, 多, 天天, 就, 常, 常常, 很, 快, 才, 早, 曾經, 本, 真, 真的, 總, 這麼, 那, 都, 非常, 順便.

The 2nd highest number of forms (1) was observed with the lemma “一下”: 一下.

The 3rd highest number of forms (1) was observed with the lemma “一向”: 一向.

ADV does not occur with any features.

Relations

ADV nodes are attached to their parents using 3 different relations: advmod (233; 98% instances), discourse (3; 1% instances), conj (1; 0% instances)

Parents of ADV nodes belong to 9 different parts of speech: VERB (173; 73% instances), ADJ (35; 15% instances), AUX (12; 5% instances), NOUN (11; 5% instances), PROPN (2; 1% instances), ADV (1; 0% instances), DET (1; 0% instances), PRON (1; 0% instances), SYM (1; 0% instances)

235 (99%) ADV nodes are leaves.

2 (1%) ADV nodes have one child.

The highest child degree of a ADV node is 1.

Children of ADV nodes are attached using 2 different relations: advmod (1; 50% instances), punct (1; 50% instances)

Children of ADV nodes belong to 2 different parts of speech: ADV (1; 50% instances), PUNCT (1; 50% instances)