home edit page issue tracker

This page pertains to UD version 2.

UD Ottoman Turkish BOUN

Language: Ottoman Turkish (code: ota)
Family: Turkic, Southwestern

This treebank has been part of Universal Dependencies since the UD v2.14 release.

The following people have contributed to making this treebank part of UD: Şaziye Betül Özateş, Tarık Emre Tıraş, Efe Eren Genç, Esma Fatıma Bilgin Taşdemir.

Repository: UD_Ottoman_Turkish-BOUN
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.14

License: CC BY-SA 4.0

Genre: fiction, nonfiction

Questions, comments? General annotation questions (either Ottoman Turkish-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [saziye • ozates (æt) bogazici • edu • tr]. Development of the treebank happens outside the UD repository. If there are bugs, either the original data source or the conversion procedure must be fixed. Do not submit pull requests against the UD repository.

Annotation Source
Lemmas assigned by a program, with some manual corrections, but not a full manual verification
UPOS assigned by a program, with some manual corrections, but not a full manual verification
XPOS assigned by a program, with some manual corrections, but not a full manual verification
Features assigned by a program, not checked manually
Relations annotated manually, natively in UD style


An Ottoman Turkish dependency treebank annotated in UD style. Created by Şaziye Betül Özateş, Tarık Emre Tıraş, Efe Eren Genç from Boğaziçi University, and Esma Fatıma Bilgin Taşdemir from Medeniyet University.

This is an Ottoman Turkish dependency treebank in the Universal Dependencies (UD) annotation style. Ottoman Turkish is one of the historical versions of modern Turkish. The OTA-BOUN Treebank includes 514 manually annotated sentences from ten different texts by seven different writers. All of the texts are from literature published between 1900 and 1928.


You can use the following reference for the treebank:

title = "Dependency Annotation of {O}ttoman {T}urkish with Multilingual {BERT}",
author = {{\"O}zate{\c{s}}, {\c{S}}aziye and T{\i}ra{\c{s}}, Tar{\i}k and Gen{\c{c}}, Efe and Bilgin Tasdemir, Esma},
booktitle = "Proceedings of The 18th Linguistic Annotation Workshop (LAW-XVIII)",
month = mar,
year = "2024",
address = {St. Julians, Malta},
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2024.law-1.18",
pages = "188--196",


Statistics of UD Ottoman Turkish BOUN

POS Tags






Tokenization and Word Segmentation



Nominal Features

Degree and Polarity

Verbal Features

Pronouns, Determiners, Quantifiers

Other Features


Auxiliary Verbs and Copula

Core Arguments, Oblique Arguments and Adjuncts

Here we consider only relations between verbs (parent) and nouns or pronouns (child).

Verbs with Reflexive Core Objects

Relations Overview