home edit page issue tracker

This page pertains to UD version 2.

UD Uyghur UDT

Language: Uyghur (code: ug)
Family: Turkic, Southeastern

This treebank has been part of Universal Dependencies since the UD v1.4 release.

The following people have contributed to making this treebank part of UD: Marhaba Eli, Daniel Zeman, Francis Tyers.

Repository: UD_Uyghur-UDT
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.14

License: CC BY-SA 4.0

Genre: fiction

Questions, comments? General annotation questions (either Uyghur-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [marhaba (æt) xju • edu • cn]. Development of the treebank happens outside the UD repository. If there are bugs, either the original data source or the conversion procedure must be fixed. Do not submit pull requests against the UD repository.

Annotation Source
Lemmas assigned by a program, not checked manually
UPOS annotated manually, natively in UD style
XPOS assigned by a program, with some manual corrections, but not a full manual verification
Features assigned by a program, not checked manually
Relations annotated manually, natively in UD style


The Uyghur UD treebank is based on the Uyghur Dependency Treebank (UDT), created at the Xinjiang University in Ürümqi, China.

The sentences come from literature texts / reading material for primary and middle school, including stories, records and reports.


Statistics of UD Uyghur UDT

POS Tags






Tokenization and Word Segmentation



Nominal Features

Degree and Polarity

Verbal Features

Pronouns, Determiners, Quantifiers

Other Features


Auxiliary Verbs and Copula

Core Arguments, Oblique Arguments and Adjuncts

Here we consider only relations between verbs (parent) and nouns or pronouns (child).

Verbs with Reflexive Core Objects

Relations Overview