home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Cantonese-HK: POS Tags: ADV

There are 111 ADV lemmas (10%), 180 ADV types (10%) and 1578 ADV tokens (11%). Out of 15 observed tags, the rank of ADV is: 3 in number of lemmas, 3 in number of types and 4 in number of tokens.

The 10 most frequent ADV lemmas: _、 唔、 都、 就、 噉、 好、 又、 啲、 下、 咁

The 10 most frequent ADV types: 唔、 就、 都、 噉、 好、 未、 因為、 又、 即係、 已經

The 10 most frequent ambiguous lemmas: _ (PUNCT 1377, VERB 1352, NOUN 1283, ADV 853, PART 764, PRON 662, AUX 335, DET 217, ADJ 209, ADP 140, NUM 124, SCONJ 101, CCONJ 93, INTJ 92, PROPN 52), 好 (ADV 34, ADJ 22, AUX 1), 啲 (NOUN 56, ADV 26, DET 1, PART 1), 下 (ADV 19, DET 1, INTJ 1, PART 1), 喺度 (ADV 14, VERB 3), 先 (ADV 10, PART 10), 快 (ADJ 12, ADV 9), 點 (ADV 8, NOUN 1), 咪 (ADV 7, VERB 1), 一陣 (ADV 6, NOUN 1)

The 10 most frequent ambiguous types: 就 (ADV 191, ADP 1), 噉 (ADV 136, VERB 2, PART 1), 好 (ADV 54, ADJ 37, AUX 4), 因為 (ADV 36, ADP 1), 啲 (NOUN 62, ADV 30, DET 2, PART 1), 下 (ADV 27, DET 6, INTJ 2, PART 2), 喺度 (ADV 22, VERB 3), 先 (PART 20, ADV 16), 剛才 (ADV 15, NOUN 2), 係咪 (ADV 14, VERB 2, AUX 1)

Morphology

The form / lemma ratio of ADV is 1.621622 (the average of all parts of speech is 1.624294).

The 1st highest number of forms (111) was observed with the lemma “_”: 一下, 一來, 一定, 一早, 一而再,再而三, 一路, 一陣, 下, 不, 不論, 不過, 並且, 事實上, 亦, 亦都, 仍然, 以, 仲, 似乎, 但, 但係, 依樣, 依法, 係咪, 先, 先至, 其中, 其實, 再, 剛才, 即, 即係, 即係話, 即刻, 原來, 原先, 去到, 又, 及後, 只, 同時, 咁, 咪, 唔, 唔該, 啲, 喺度, 噉, 噉樣, 噉解, 嚴重, 因此, 因為, 埋, 好, 如此, 完全, 將, 尚未, 就, 就係, 已經, 急速, 我怕, 所以, 打直, 故意, 是否, 暫時, 更加, 最, 最好, 最後, 未, 未必, 未曾, 本身, 根本, 正式, 正正, 比較, 然後, 特別, 由於, 當然, 直接, 真, 確實, 祇, 祇不過, 祇有, 究竟, 絕對, 總之, 而, 自行, 落去, 起碼, 跟住, 返, 連, 進一步, 都, 重新, 鑒於, 非常, 順利, 首先, 點, 點樣, 點解.

The 2nd highest number of forms (1) was observed with the lemma “一”: 一.

The 3rd highest number of forms (1) was observed with the lemma “一味”: 一味.

ADV does not occur with any features.

Relations

ADV nodes are attached to their parents using 23 different relations: advmod (1380; 87% instances), discourse (116; 7% instances), root (15; 1% instances), reparandum (14; 1% instances), amod (8; 1% instances), conj (7; 0% instances), mark:adv (7; 0% instances), advmod:df (5; 0% instances), advcl (3; 0% instances), ccomp (3; 0% instances), parataxis (3; 0% instances), cc (2; 0% instances), compound:vv (2; 0% instances), mark (2; 0% instances), mark:rel (2; 0% instances), nsubj (2; 0% instances), acl (1; 0% instances), compound:ext (1; 0% instances), compound:quant (1; 0% instances), csubj (1; 0% instances), discourse:sp (1; 0% instances), nmod (1; 0% instances), obl (1; 0% instances)

Parents of ADV nodes belong to 12 different parts of speech: VERB (1164; 74% instances), ADJ (195; 12% instances), NOUN (83; 5% instances), AUX (59; 4% instances), ADV (34; 2% instances), (15; 1% instances), PROPN (12; 1% instances), PRON (9; 1% instances), ADP (4; 0% instances), CCONJ (1; 0% instances), DET (1; 0% instances), NUM (1; 0% instances)

1374 (87%) ADV nodes are leaves.

137 (9%) ADV nodes have one child.

46 (3%) ADV nodes have two children.

21 (1%) ADV nodes have three or more children.

The highest child degree of a ADV node is 7.

Children of ADV nodes are attached using 24 different relations: punct (156; 50% instances), discourse:sp (58; 19% instances), advmod (27; 9% instances), nsubj (11; 4% instances), discourse (10; 3% instances), reparandum (9; 3% instances), mark:rel (6; 2% instances), advcl (5; 2% instances), aux (4; 1% instances), mark:adv (4; 1% instances), conj (3; 1% instances), cop (3; 1% instances), parataxis (3; 1% instances), case (2; 1% instances), cc (2; 1% instances), obj (2; 1% instances), advcl:coverb (1; 0% instances), amod (1; 0% instances), dislocated (1; 0% instances), mark (1; 0% instances), nmod (1; 0% instances), nsubj:periph (1; 0% instances), obl:tmod (1; 0% instances), vocative (1; 0% instances)

Children of ADV nodes belong to 12 different parts of speech: PUNCT (156; 50% instances), PART (69; 22% instances), ADV (34; 11% instances), VERB (13; 4% instances), PRON (11; 4% instances), INTJ (9; 3% instances), AUX (7; 2% instances), NOUN (5; 2% instances), ADJ (3; 1% instances), PROPN (3; 1% instances), CCONJ (2; 1% instances), ADP (1; 0% instances)