UD for Azerbaijani
Tokenization and Word Segmentation
- In general, words are delimited by whitespace characters.
- Punctuation marks tokenized as separate tokens (words).
- Hyphenated fixed and compounds are kept as three tokens: tez-tez, sil-süpür.
Morphology
Tags
- Azerbaijani uses 14 universal POS categories, including:
ADJyaxçı, böyük, ect.ADPqədər, için, -ki, etc.ADVdaha, də, fәqәt, etc.AUXi, dəyil, etc.CCONJamma, və, ya, etc.DETo, bir, bu, etc.INTJPəs.NOUNpәncәrә, maşɪn, etc.NUMbir, neçә, beş, etc.PRONo, mən, etc.PROPNDeniz, Sam, etc.PUNCT-, ?, ., etc.SCONJki, əyər, etc.VERBgetdi, fikr eliyәm, yudurtdu, yazdığını, etc.
Features
- Azerbaijani nouns are typically suffixed. However, a small number of prefixes are borrowed mostly from Persian.
- Azerbaijani, like all other Turkic languages, is devoid of grammatical gender. But there are ways to express gender lexically, if required.
- In Number marking of nouns, there are two values: Singular and Plural; Singular forms are unmarked.
- There are seven Cases in Azerbaijani: NOM, GEN, DAT, ACC, COM/INS, LOC, ABL. The NOM case does not take a suffix.
- Yes-no questions can be constructed in two ways: (i) by using interrogative intonation contour, and (ii) by adding the interrogative particle aya.
Inflectional Structure Representation
- Noun + (Plural marker) + (Possessive pronoun) + (Case marker)
- Adjective + (Comparative/Superlative degree marker)
- Verb stem + (Reciprocal marker) + (Causative marker) + (Voice marker) + (Reflexive marker) + (Negation marker) + (Mood marker) + {(Imperfective Aspect marker) + (Perfective Aspect marker)} + (Tense marker) + Person and Number marker.
Syntax
Core Arguments, Oblique Arguments and Adjuncts
- Nominal subject
nsubjis a noun phrase in the nominative case, without adposition.- A subordinate clause may serve as the subject and is labeled
csubj.
- A subordinate clause may serve as the subject and is labeled
- Object
objis a noun phrase without adposition and typically in the accusative case. - Oblique
oblis a non-core nominal (noun, pronoun, noun phrase) argument or adjunct.
Relations Overview
- The following relation subtypes are used in Azerbaijani:
compound:redupfor reduplicated compoundscsubj:outerfor outer clause clausal subjectnsubj:outerouter clause nominal subjectadvmod:emphfor emphasizing word, intensifier- compound:lvc` for light-verb constructions
- The following relation types are not used in Azerbaijani at all:
dep,dislocated,expl,goeswith,iobj,list,reparandum
Treebanks
There is 1 Azerbaijani UD treebank: