home edit page issue tracker

This page pertains to UD version 2.

UD Cantonese

Language: Cantonese (code: yue)
Family: Sino-Tibetan

This treebank has been part of Universal Dependencies since the UD v2.1 release.

The following people have contributed to making this treebank part of UD: Kim Gerdes, John Lee, Herman Leung, Tak-sum Wong.

Repository: UD_Cantonese

License: CC BY-SA 4.0

Genre: spoken

Questions, comments? General annotation questions (either Cantonese-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [tswong-c (æt) my • cityu • edu • hk; jsylee (æt) cityu • edu • hk].

Annotation Source
Lemmas not available
UPOS annotated manually, natively in UD style
XPOS not available
Features not available
Relations annotated manually, natively in UD style




This work was partially supported by a grant from the PROCORE-France/Hong Kong Joint Research Scheme sponsored by the Research Grants Council and the Consulate General of France in Hong Kong (Reference No.: F-CityU107/15 and N 35322RG); and by two Strategic Research Grants (Project No. 7004494 and No. 7004736) from City University of Hong Kong.

Statistics of UD Cantonese

POS Tags





Tokenization and Word Segmentation



Nominal Features

Degree and Polarity

Verbal Features

Pronouns, Determiners, Quantifiers

Other Features


Auxiliary Verbs and Copula

Core Arguments, Oblique Arguments and Adjuncts

Here we consider only relations between verbs (parent) and nouns or pronouns (child).

Relations Overview