home edit page issue tracker

This page pertains to UD version 2.

UD Old Russian RNC

Language: Old Russian (code: orv)
Family: Indo-European, Slavic

This treebank has been part of Universal Dependencies since the UD v2.4 release.

The following people have contributed to making this treebank part of UD: Olga Lyashevskaya.

Repository: UD_Old_Russian-RNC
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.2

License: CC BY-SA 4.0

Genre: legal, nonfiction

Questions, comments? General annotation questions (either Old Russian-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [olesar (æt) yandex • ru]. Development of the treebank happens outside the UD repository. If there are bugs, either the original data source or the conversion procedure must be fixed. Do not submit pull requests against the UD repository.

Annotation Source
Lemmas annotated manually, natively in UD style
UPOS annotated manually in non-UD style, automatically converted to UD
XPOS annotated manually
Features annotated manually in non-UD style, automatically converted to UD
Relations annotated manually, natively in UD style

Description

UD_Old_Russian-RNC is a sample of the Middle Russian corpus (1300-1700), a part of the Russian National Corpus. The data were originally annotated according to the RNC and extended UD-Russian morphological schemas and UD 2.4 dependency schema.

Middle Russian Corpus (http://ruscorpora.ru/search-mid_rus.html) is part of the Russian National Corpus included in the collection of historical corpora [Sichinava 2014]. The lists of part-of-speech and core grammatical tags is available at: https://github.com/olesar/UD_MidRussian/blob/master/MidRussianUD.md, the document also shows the mapping between the RNC and UD tags. The annotation project is maintained by Vinogradov Institute of the Russian Language RAS (Moscow) in collaboration with researchers and students of the National Research University Higher School of Economics (Moscow) and Lomonosov Moscow State University.

Acknowledgments

We are immensely grateful to Irina Juryeva, Roman Ilushin, Maria Skachedubova, and Elizaveta Bunina who contributed to the annotation of the original Middle Russian Corpus data. We would like to thank Dmitri Sitchinava, Anna Pichhadze, Alexandr Moldovan, Vladimir Plungian, Roman Krivko, Yves Scherrer, Achim Rabus, Hanne Eckhoff for fruitful discussion and advice.

References

Statistics of UD Old Russian RNC

POS Tags

ADJADPADVAUXCCONJDETINTJNOUNNUMPARTPRONPROPNPUNCTSCONJSYMVERBX

Features

AnalytAnimacyAspectCaseDegreeGenderMoodNumberNumFormPersonPolarityPossPronTypeReflexTenseVariantVerbFormVoice

Relations

aclacl:relcladvcladvmodamodapposauxaux:passcaseccccompcompoundconjcopcsubjdepdetdiscoursedislocatedexplfixedflatflat:nameiobjmarknmodnsubjnsubj:passnummodnummod:govobjoblobl:agentorphanparataxispunctrootvocativexcomp

Tokenization and Word Segmentation

Morphology

Tags

Nominal Features

Degree and Polarity

Verbal Features

Pronouns, Determiners, Quantifiers

Other Features

Syntax

Auxiliary Verbs and Copula

Core Arguments, Oblique Arguments and Adjuncts

Here we consider only relations between verbs (parent) and nouns or pronouns (child).

Verbs with Reflexive Core Objects

Relations Overview