home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Cantonese-HK: POS Tags: AUX

There are 24 AUX lemmas (1%), 26 AUX types (2%) and 548 AUX tokens (4%). Out of 15 observed tags, the rank of AUX is: 12 in number of lemmas, 12 in number of types and 7 in number of tokens.

The 10 most frequent AUX lemmas: 係、 咗、 可以、 要、 能夠、 會、 想、 應該、 冇、 過

The 10 most frequent AUX types: 係、 咗、 可以、 要、 能夠、 會、 想、 應該、 冇、 過

The 10 most frequent ambiguous lemmas: 係 (VERB 313, AUX 100, ADV 10, DET 1), 要 (AUX 55, VERB 6), 會 (AUX 29, NOUN 2), 想 (AUX 28, VERB 2), 冇 (VERB 72, AUX 18), 過 (AUX 17, VERB 4, ADP 2), 有 (VERB 124, AUX 10), 緊 (AUX 9, ADJ 1), 中意 (AUX 8, VERB 5), 可能 (AUX 7, NOUN 2)

The 10 most frequent ambiguous types: 係 (VERB 312, AUX 99, DET 1), 要 (AUX 55, VERB 6), 會 (AUX 29, NOUN 2), 想 (AUX 28, VERB 2), 冇 (VERB 72, AUX 18), 過 (AUX 17, VERB 4, ADP 2), 有 (VERB 124, AUX 10), 緊 (AUX 9, ADJ 1), 中意 (AUX 8, VERB 5), 可 (AUX 8, INTJ 4, PART 1)

Morphology

The form / lemma ratio of AUX is 1.083333 (the average of all parts of speech is 1.001746).

The 1st highest number of forms (2) was observed with the lemma “係”: 係, 係咪.

The 2nd highest number of forms (2) was observed with the lemma “可以”: 可, 可以.

The 3rd highest number of forms (1) was observed with the lemma “中意”: 中意.

AUX does not occur with any features.

Relations

AUX nodes are attached to their parents using 10 different relations: aux (409; 75% instances), cop (93; 17% instances), conj (20; 4% instances), root (8; 1% instances), ccomp (7; 1% instances), reparandum (6; 1% instances), acl (2; 0% instances), advcl (1; 0% instances), obj (1; 0% instances), parataxis (1; 0% instances)

Parents of AUX nodes belong to 9 different parts of speech: VERB (404; 74% instances), NOUN (60; 11% instances), ADJ (27; 5% instances), AUX (23; 4% instances), PROPN (11; 2% instances), (8; 1% instances), ADV (7; 1% instances), PRON (7; 1% instances), PART (1; 0% instances)

491 (90%) AUX nodes are leaves.

33 (6%) AUX nodes have one child.

8 (1%) AUX nodes have two children.

16 (3%) AUX nodes have three or more children.

The highest child degree of a AUX node is 11.

Children of AUX nodes are attached using 18 different relations: punct (27; 21% instances), advmod (25; 19% instances), conj (23; 18% instances), nsubj (11; 9% instances), discourse:sp (8; 6% instances), obj (8; 6% instances), ccomp (5; 4% instances), discourse (5; 4% instances), advcl (3; 2% instances), reparandum (3; 2% instances), aux (2; 2% instances), cc (2; 2% instances), xcomp (2; 2% instances), case (1; 1% instances), compound:vo (1; 1% instances), mark (1; 1% instances), obl (1; 1% instances), parataxis (1; 1% instances)

Children of AUX nodes belong to 11 different parts of speech: ADV (28; 22% instances), PUNCT (27; 21% instances), AUX (23; 18% instances), PRON (14; 11% instances), VERB (14; 11% instances), PART (9; 7% instances), NOUN (8; 6% instances), CCONJ (3; 2% instances), ADJ (1; 1% instances), INTJ (1; 1% instances), SCONJ (1; 1% instances)