home edit page issue tracker

This page pertains to UD version 2.

UD for Ottoman Turkish


This is a short introduction of the UD annotation for Ottoman Turkish.

Tokenization and Word Segmentation


In general, tokens are delimited by whitespace characters and punctuation. This includes clitics and grammatical particles such as the ezafeh, “ki,” and the question clitic “mI.” Copulas can either be adjoined to words or be separate words, which determines whether they’re seen as distinct tokens or not.

Morphology


Tags

Features

Syntax


Treebanks


As of UD 2.14, there are two Ottoman Turkish UD treebanks: