home edit page issue tracker

This page pertains to UD version 2.

UD Old East Slavic RNC

Language: Old East Slavic (code: orv)
Family: Indo-European, Slavic

This treebank has been part of Universal Dependencies since the UD v2.4 release.

The following people have contributed to making this treebank part of UD: Olga Lyashevskaya, Dmitri Sitchinava.

Repository: UD_Old_East_Slavic-RNC
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.14

License: CC BY-SA 4.0

Genre: legal, nonfiction

Questions, comments? General annotation questions (either Old East Slavic-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [olesar (æt) yandex • ru]. Development of the treebank happens outside the UD repository. If there are bugs, either the original data source or the conversion procedure must be fixed. Do not submit pull requests against the UD repository.

Annotation Source
Lemmas annotated manually
UPOS annotated manually, natively in UD style
XPOS annotated manually
Features annotated manually, natively in UD style
Relations annotated manually, natively in UD style

Description

UD_Old_East_Slavic-RNC is a sample of the Middle Russian corpus (1300-1700), a part of the Russian National Corpus. The data were originally annotated according to the RNC and extended UD-Russian morphological schemas and UD 2.4 dependency schema.

Middle Russian Corpus is part of the Russian National Corpus included in the collection of historical corpora (Sichinava 2014). The lists of part-of-speech and core grammatical tags is available at: , the document also shows the mapping between the RNC and UD tags. The annotation project is maintained by Vinogradov Institute of the Russian Language RAS (Moscow) in collaboration with researchers and students of the National Research University Higher School of Economics (Moscow) and Lomonosov Moscow State University.

Acknowledgments

We are immensely grateful to Irina Juryeva, Maria Ermolova, Roman Ilushin, and Elizaveta Bunina who contributed to the annotation of the original Middle Russian Corpus data. We would like to thank Anna Pichhadze, Alexandr Moldovan, Vladimir Plungian, Roman Krivko, Yves Scherrer, Achim Rabus, Hanne Eckhoff for fruitful discussion and advice.

References

Text collection

Statistics of UD Old East Slavic RNC

POS Tags

ADJADPADVAUXCCONJDETINTJNOUNNUMPARTPRONPROPNPUNCTSCONJSYMVERBX

Features

AbbrAnalytAnimacyAspectCaseCliticDegreeForeignGenderInflClassMoodNameTypeNumberNumFormNumTypePersonPolarityPossPronTypeReflexTenseTypoVariantVerbFormVoice

Relations

aclacl:relcladvcladvmodamodapposauxaux:passcaseccccompcompoundconjcopcsubjdepdetdiscoursedislocatedexplexpl:pvfixedflatflat:foreignflat:namegoeswithiobjlistmarknmodnsubjnsubj:outernsubj:passnummodnummod:govobjoblobl:agentobl:depictobl:floatobl:pronmodobl:tmodorphanparataxisparataxis:discoursepunctreparandumrootvocativexcomp

Tokenization and Word Segmentation

Morphology

Tags

Nominal Features

Degree and Polarity

Verbal Features

Pronouns, Determiners, Quantifiers

Other Features

Syntax

Auxiliary Verbs and Copula

Core Arguments, Oblique Arguments and Adjuncts

Here we consider only relations between verbs (parent) and nouns or pronouns (child).

Reflexive Verbs

Verbs with Reflexive Core Objects

Relations Overview