home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Cantonese-HK: POS Tags: ADV

There are 182 ADV lemmas (11%), 180 ADV types (10%) and 1578 ADV tokens (11%). Out of 15 observed tags, the rank of ADV is: 3 in number of lemmas, 3 in number of types and 4 in number of tokens.

The 10 most frequent ADV lemmas: 唔、 就、 都、 噉、 好、 未、 因為、 又、 即係、 已經

The 10 most frequent ADV types: 唔、 就、 都、 噉、 好、 未、 因為、 又、 即係、 已經

The 10 most frequent ambiguous lemmas: 就 (ADV 191, ADP 1), 噉 (ADV 136, VERB 2, PART 1), 好 (ADV 54, ADJ 37, AUX 4), 因為 (ADV 36, ADP 1), 啲 (NOUN 62, ADV 30, DET 2, PART 1), 下 (ADV 27, DET 6, INTJ 2, PART 2), 喺度 (ADV 22, VERB 3), 先 (PART 20, ADV 16), 剛才 (ADV 15, NOUN 2), 點 (NOUN 12, ADV 11)

The 10 most frequent ambiguous types: 就 (ADV 191, ADP 1), 噉 (ADV 136, VERB 2, PART 1), 好 (ADV 54, ADJ 37, AUX 4), 因為 (ADV 36, ADP 1), 啲 (NOUN 62, ADV 30, DET 2, PART 1), 下 (ADV 27, DET 6, INTJ 2, PART 2), 喺度 (ADV 22, VERB 3), 先 (PART 20, ADV 16), 剛才 (ADV 15, NOUN 2), 係咪 (ADV 14, VERB 2, AUX 1)

Morphology

The form / lemma ratio of ADV is 0.989011 (the average of all parts of speech is 1.001746).

The 1st highest number of forms (1) was observed with the lemma “一”: 一.

The 2nd highest number of forms (1) was observed with the lemma “一下”: 一下.

The 3rd highest number of forms (1) was observed with the lemma “一來”: 一來.

ADV does not occur with any features.

Relations

ADV nodes are attached to their parents using 23 different relations: advmod (1380; 87% instances), discourse (116; 7% instances), root (15; 1% instances), reparandum (14; 1% instances), amod (8; 1% instances), conj (7; 0% instances), mark:adv (7; 0% instances), advmod:df (5; 0% instances), advcl (3; 0% instances), ccomp (3; 0% instances), parataxis (3; 0% instances), cc (2; 0% instances), compound:vv (2; 0% instances), mark (2; 0% instances), mark:rel (2; 0% instances), nsubj (2; 0% instances), acl (1; 0% instances), compound:ext (1; 0% instances), compound:quant (1; 0% instances), csubj (1; 0% instances), discourse:sp (1; 0% instances), nmod (1; 0% instances), obl (1; 0% instances)

Parents of ADV nodes belong to 12 different parts of speech: VERB (1193; 76% instances), ADJ (194; 12% instances), NOUN (84; 5% instances), ADV (35; 2% instances), AUX (28; 2% instances), (15; 1% instances), PROPN (12; 1% instances), PRON (10; 1% instances), ADP (4; 0% instances), CCONJ (1; 0% instances), DET (1; 0% instances), NUM (1; 0% instances)

1374 (87%) ADV nodes are leaves.

137 (9%) ADV nodes have one child.

46 (3%) ADV nodes have two children.

21 (1%) ADV nodes have three or more children.

The highest child degree of a ADV node is 7.

Children of ADV nodes are attached using 25 different relations: punct (157; 50% instances), discourse:sp (58; 18% instances), advmod (27; 9% instances), discourse (11; 3% instances), nsubj (11; 3% instances), reparandum (9; 3% instances), mark:rel (6; 2% instances), advcl (5; 2% instances), aux (4; 1% instances), mark:adv (4; 1% instances), conj (3; 1% instances), cop (3; 1% instances), parataxis (3; 1% instances), case (2; 1% instances), cc (2; 1% instances), obj (2; 1% instances), advcl:coverb (1; 0% instances), amod (1; 0% instances), dislocated (1; 0% instances), mark (1; 0% instances), nmod (1; 0% instances), nsubj:periph (1; 0% instances), obl (1; 0% instances), obl:tmod (1; 0% instances), vocative (1; 0% instances)

Children of ADV nodes belong to 12 different parts of speech: PUNCT (157; 50% instances), PART (69; 22% instances), ADV (35; 11% instances), VERB (13; 4% instances), PRON (11; 3% instances), INTJ (10; 3% instances), AUX (7; 2% instances), NOUN (5; 2% instances), ADJ (3; 1% instances), PROPN (3; 1% instances), CCONJ (2; 1% instances), ADP (1; 0% instances)