home edit page issue tracker

This page pertains to UD version 2.

UD Classical Armenian CAVaL

Language: Classical Armenian (code: xcl)
Family: IE

This treebank has been part of Universal Dependencies since the UD v2.13 release.

The following people have contributed to making this treebank part of UD: Petr Kocharov, Lilit Kharatyan.

Repository: UD_Classical_Armenian-CAVaL
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.15

License: CC BY-NC-ND 4.0

Genre: bible, fiction

Questions, comments? General annotation questions (either Classical Armenian-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [petr • kocharov (æt) uni-wuerzburg • de]. Development of the treebank happens directly in the UD repository, so you may submit bug fixes as pull requests against the dev branch.

Annotation Source
Lemmas annotated manually in non-UD style, automatically converted to UD, with some manual corrections of the conversion
UPOS annotated manually in non-UD style, automatically converted to UD, with some manual corrections of the conversion
XPOS not available
Features annotated manually in non-UD style, automatically converted to UD, with some manual corrections of the conversion
Relations annotated manually in non-UD style, automatically converted to UD, with some manual corrections of the conversion

Description

The present release includes the Classical Armenian translation of the Gospels and the first ten chapters of the “History of the Armenians” by Movses Khorenatsi. The annotation of the Gospels results from a rule-based conversion from the PROIEL annotation, manually corrected and extended with additional information. The annotation of the “History of the Armenians” has been performed by a UDPipe2 annotator and manually corrected.

The present release includes a treebank of the Classical Armenian Gospels and the first ten chapters of the “History of the Armenians” by Movses Khorenatsi. The treebank results from a rule-based conversion of the PROIEL annotation (see Dag T. T. Haug and Marius L. Jøhndal. 2008. ‘Creating a Parallel Treebank of the Old Indo-European Bible Translations’, in: Caroline Sporleder and Kiril Ribarov (eds.), Proceedings of the Second Workshop on Language Technology for Cultural Heritage Data (LaTeCH 2008) (2008), pp. 27-34). The PROIEL annotation is based on a digitalized version of Beda O. Künzle “Das altarmenische Evangelium” (Bern/Frankfurt am Main/New York: Peter Lang, 1984); https://titus.fkidg1.uni-frankfurt.de/texte/etcc/arm/armntbk/armnt.htm. The conversion from the PROIEL to UD annotation has been performed using a rule-based convertor (Petr Kocharov, Lilit Kharatyan). The conversion result has been manually corrected (Petr Kocharov) and extended with additional morphological features and relation subtypes, spelling in the Armenian alphabet, English glosses and sentence translations. The treebank of the “History of the Armenians” is based on the digital edition of the Arak29 Project. The editions of Arak29 are adapted from the American University of Armenia’s Digital Library and other published sources. The morphological annotation of Arak29 has been automatically converted to UD with a rule-based convertor (Petr Kocharov, Lilit Kharatyan); the syntactic annotation is performed by a UDPipe2 model (Kharatyan & Kocharov 2024). All annotation has been manually corrected (Petr Kocharov).

Acknowledgments

The treebank is developed by Petr Kocharov and Lilit Kharatyan at the University of Würzburg as part of the “CAVaL: Classical Armenian Valency Lexicon” project (PI Dr. Petr Kocharov), funded by the Deutsche Forschungsgemeinschaft (DFG), project number 518003859. We thank Professor Dr. Dag T. T. Haug and the PROIEL team for the permission to reuse the PROIEL annotation of the Classical Armenian Gospels for the purposes of the UD Classical Armenian-CAVaL treebank. We thank Dr. Daniil Kocharov (Tampere University) for advisory support and programming of the module for the processing of punctuation tokens for the convertor of the Gospels. We acknowledge the permission of the Arak29 Charitable Foundation for a non-commercial use of their digital editions of Classical Armenian texts.

Statistics of UD Classical Armenian CAVaL

POS Tags

ADJADPADVAUXCCONJDETINTJNOUNNUMPARTPRONPROPNPUNCTSCONJVERBX

Features

AnimacyAspectCaseConnegativeDefiniteDeixisForeignMoodNumberNumTypePersonPolarityPossPronTypeReflexTenseVerbFormVoice

Relations

acladvcladvmodamodapposauxaux:causcaseccccompcompoundcompound:redupconjcopcsubjcsubj:causcsubj:passdetdiscoursedislocatedfixedflatiobjmarknmodnsubjnsubj:causnsubj:passnummodobjoblobl:agentobl:argorphanparataxispunctrootvocativexcomp

Tokenization and Word Segmentation

Morphology

Tags

Nominal Features

Degree and Polarity

Verbal Features

Pronouns, Determiners, Quantifiers

Other Features

Syntax

Auxiliary Verbs and Copula

Core Arguments, Oblique Arguments and Adjuncts

Here we consider only relations between verbs (parent) and nouns or pronouns (child).

Verbs with Reflexive Core Objects

Relations Overview