home edit page issue tracker

This page pertains to UD version 2.

Universal Dependencies v2

Executive summary of changes from v1 to v2

This is the online documentation for Universal Dependencies, version 2 (2016-12-01). Note: The treebanks listed below still follow the v1 guidelines available here.

Upcoming UD-related events

Want to know more about UD?

If you want to receive news about Universal Dependencies, you can subscribe to the UD mailing list.

UD Treebanks

Ancient Greek 219K
Ancient Greek-PROIEL 188K -
Arabic 217K -
Basque 97K
Bulgarian 140K
Catalan 472K
Chinese 111K
Coptic 4K
Croatian 132K -
Czech 1,330K
Czech-CAC 482K
Czech-CLTT 30K
Danish 94K
Dutch 203K -
Dutch-LassySmall 93K -
English 229K
English-ESL 88K
English-LinES 74K
Estonian 210K -
Faroese 132K -
Finnish 171K
Finnish-FTB 143K -
French 384K
Galician 109K
Galician-TreeGal 21K
German 277K -
Gothic 50K -
Greek 53K
Hebrew 106K -
Hindi 316K -
Hungarian 37K
Indonesian 110K -
Irish 19K
Italian 262K
Japanese 213K -
Japanese-KTC 189K
Kazakh 5K
Korean 89K - - -
Latin 42K -
Latin-ITTB 284K -
Latin-PROIEL 150K -
Latvian 16K -
Norwegian-Bokmaal 280K
Norwegian-Nynorsk 280K
Old Church Slavonic 52K -
Persian 135K
Polish 76K -
Portuguese 204K -
Portuguese-BR 268K -
Portuguese-Bosque 208K
Romanian 191K
Russian 89K
Russian-SynTagRus 960K
Sanskrit <1K -
Slovak 93K -
Slovenian 126K
Slovenian-SST 26K
Spanish 415K
Spanish-AnCora 495K
Swedish 76K
Swedish-LinES 71K
Swedish Sign Language <1K -
Tamil 6K -
Turkish 48K
Ukrainian 1K
Uyghur 10K -
Vietnamese 37K -

Upcoming UD Treebanks

Amharic - - ? -
Arabic-LDC - -
Buryat - -
Cantonese - -
Chinese-HK - -
Kurmanji - - ?
Marathi - -
Serbian - -
Somali - -
Sorani - - ?
Urdu - -

Disclaimer: Our use of flags to symbolise languages is only intended as a visual enhancement of the website and should not be interpreted as a political statement in any way.


The data is released through LINDAT/CLARIN.

Query online

You can query the UD treebanks on-line using

Language family documentation (experimental)