home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

UD for Ottoman Turkish

This is a short introduction of the UD annotation for Ottoman Turkish.

Tokenization and Word Segmentation

In general, tokens are delimited by whitespace characters and punctuation. This includes clitics and grammatical particles such as the ezafeh, “ki,” and the question clitic “mI.” Copulas can either be adjoined to words or be separate words, which determines whether they’re seen as distinct tokens or not.

Morphology

Features

Nouns have the main inflectional features of Case, Number, and Person.
The notable inflectional features of verbs are Aspect, Case (if nominalized), Evident, Mood, Number, Person, Polarity (negative or positive depending on the semantic content), Tense, VerbForm, and Voice.
Subject-verb agreement through Number and Person are accurately reflected in the morphological features of nouns and verbs.
One thing to note is that the feature Voice has two values (Cau and Pass) although a verb can carry both of those, which is not reflected in the features.

Syntax

The following relation subtypes are used in Ottoman Turkish:
- advmod:emph
- cc:preconj
- compound:lvc
- compound:redup
- csubj:pass
- dep:der
- discourse:q
- nmod:part
- nmod:poss
- nsubj:pass
- obl:agent
- obl:cau
- obl:tmod
- flat:name
- aux:q

Treebanks