home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Cantonese-HK: POS Tags: ADP

There are 20 ADP lemmas (2%), 49 ADP types (3%) and 219 ADP tokens (2%). Out of 15 observed tags, the rank of ADP is: 12 in number of lemmas, 7 in number of types and 11 in number of tokens.

The 10 most frequent ADP lemmas: _、 畀、 到、 喺、 之後、 同、 嗰陣時、 由、 响、 嗰時

The 10 most frequent ADP types: 喺、 到、 畀、 根據、 之後、 由、 有關、 上、 同、 嗰陣時

The 10 most frequent ambiguous lemmas: _ (PUNCT 1377, VERB 1352, NOUN 1283, ADV 853, PART 764, PRON 662, AUX 335, DET 217, ADJ 209, ADP 140, NUM 124, SCONJ 101, CCONJ 93, INTJ 92, PROPN 52), 畀 (VERB 31, ADP 16), 到 (ADP 14, VERB 9), 喺 (ADP 13, VERB 10), 之後 (ADP 4, NOUN 1), 同 (VERB 18, ADP 4), 嗰陣時 (NOUN 8, ADP 4), 嗰時 (PRON 4, ADP 3), 度 (ADP 3, NOUN 1), 好似 (ADV 5, ADP 2)

The 10 most frequent ambiguous types: 喺 (ADP 62, VERB 13), 到 (ADP 24, VERB 23, PART 1), 畀 (VERB 54, ADP 21), 根據 (ADP 11, SCONJ 1, VERB 1), 之後 (ADP 9, NOUN 3), 有關 (ADP 7, ADJ 3), 上 (ADP 4, DET 1), 同 (VERB 43, ADP 4, CCONJ 2), 嗰陣時 (NOUN 9, ADP 4, SCONJ 1), 將 (ADP 4, ADV 1)

Morphology

The form / lemma ratio of ADP is 2.450000 (the average of all parts of speech is 1.624294).

The 1st highest number of forms (38) was observed with the lemma “_”: 上, 上高, 之內, 之前, 之後, 以至, 作為, 依照, 到, 到到, 去, 去到, 同埋, 向, 喺, 因為, 將, 對, 對於, 就, 就住, 底下, 度, 建基於, 後, 按照, 時候, 有關, 根據, 比, 為咗, 由, 由於, 畀, 至於, 透過, 關於, 除咗.

The 2nd highest number of forms (1) was observed with the lemma “之後”: 之後.

The 3rd highest number of forms (1) was observed with the lemma “之間”: 之間.

ADP does not occur with any features.

Relations

ADP nodes are attached to their parents using 13 different relations: case (143; 65% instances), mark (34; 16% instances), case:loc (16; 7% instances), advcl:coverb (5; 2% instances), acl (4; 2% instances), reparandum (4; 2% instances), ccomp (3; 1% instances), compound:ext (3; 1% instances), mark:rel (2; 1% instances), root (2; 1% instances), advcl (1; 0% instances), cc (1; 0% instances), obl:tmod (1; 0% instances)

Parents of ADP nodes belong to 10 different parts of speech: NOUN (116; 53% instances), VERB (54; 25% instances), PRON (26; 12% instances), PROPN (12; 5% instances), ADJ (5; 2% instances), (2; 1% instances), ADP (1; 0% instances), ADV (1; 0% instances), NUM (1; 0% instances), PART (1; 0% instances)

198 (90%) ADP nodes are leaves.

13 (6%) ADP nodes have one child.

5 (2%) ADP nodes have two children.

3 (1%) ADP nodes have three or more children.

The highest child degree of a ADP node is 7.

Children of ADP nodes are attached using 9 different relations: obj (13; 34% instances), punct (9; 24% instances), advmod (5; 13% instances), nsubj (4; 11% instances), mark:rel (3; 8% instances), advcl:coverb (1; 3% instances), amod (1; 3% instances), discourse (1; 3% instances), reparandum (1; 3% instances)

Children of ADP nodes belong to 11 different parts of speech: NOUN (12; 32% instances), PUNCT (9; 24% instances), ADV (4; 11% instances), PART (3; 8% instances), PRON (3; 8% instances), PROPN (2; 5% instances), ADJ (1; 3% instances), ADP (1; 3% instances), CCONJ (1; 3% instances), INTJ (1; 3% instances), VERB (1; 3% instances)