home edit page issue tracker

This page pertains to UD version 2.

UD Ukrainian IU

Language: Ukrainian (code: uk)
Family: Indo-European, Slavic

This treebank has been part of Universal Dependencies since the UD v1.4 release.

The following people have contributed to making this treebank part of UD: Natalia Kotsyba, Bohdan Moskalevskyi, Mykhailo Romanenko.

Repository: UD_Ukrainian-IU
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.6

License: CC BY-NC-SA 4.0

Genre: blog, email, fiction, grammar-examples, legal, news, reviews, social, web, wiki

Questions, comments? General annotation questions (either Ukrainian-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [org (æt) mova • institute]. Development of the treebank happens outside the UD repository. If there are bugs, either the original data source or the conversion procedure must be fixed. Do not submit pull requests against the UD repository.

Annotation Source
Lemmas annotated manually
UPOS annotated manually, natively in UD style
XPOS annotated manually
Features annotated manually, natively in UD style
Relations annotated manually, natively in UD style

Description

Gold standard Universal Dependencies corpus for Ukrainian, developed for UD originally, by Institute for Ukrainian, NGO. [українською]

UD Ukrainian comprises 122K tokens in 7000 sentences of fiction, news, opinion articles, Wikipedia, legal documents, letters, posts, and comments — from the last 15 years, as well as from the first half of the 20th century.

Consider using the latest version at ‘dev’ branch on GitHub. It contains the latest stable improvements while the official releases are up to 6 month old [discussion].

Acknowledgments

Major contributors: Natalia Kotsyba, Bohdan Moskalevskyi, Mykhailo Romanenko.

Large portion of annotation was made by Halyna Samoridna, Ivanka Kosovska, Olha Lytvyn, Oksana Orlenko and by students of Kyiv-Mohyla Academy department of Ukrainian language (headed by Liudmyla Dyka): Hanna Brovko, Bohdana Matushko, Natalia Onyshchuk, Valeriia Pareviazko, Yaroslava Rychyk, Anastasiia Stetsenko, Snizhana Umanets.

We thank Prof. Larysa Masenko for guidance.

Statistics of UD Ukrainian IU

POS Tags

ADJADPADVAUXCCONJDETINTJNOUNNUMPARTPRONPROPNPUNCTSCONJSYMVERBX

Features

AbbrAnimacyAnimacy[gram]AspectCaseDegreeForeignGenderHyphMoodNameTypeNumberNumTypeOrthPartTypePersonPolarityPossPronTypePunctTypeReflexTenseUninflectVariantVerbFormVoice

Relations

aclacl:advacl:relcladvcladvcl:spadvcl:svcadvmodadvmod:detamodapposauxcaseccccompcompoundcompound:svcconjconj:svccopcsubjdetdet:numgovdet:nummoddiscoursedislocatedexplfixedflatflat:absflat:foreignflat:nameflat:rangeflat:repeatflat:siblflat:titlegoeswithiobjlistmarknmodnsubjnummodnummod:govobjoblorphanparataxisparataxis:discourseparataxis:newsentparataxis:relpunctreparandumrootvocativevocative:clxcompxcomp:sp

Tokenization and Word Segmentation

Morphology

Tags

Nominal Features

Degree and Polarity

Verbal Features

Pronouns, Determiners, Quantifiers

Other Features

Syntax

Auxiliary Verbs and Copula

Core Arguments, Oblique Arguments and Adjuncts

Here we consider only relations between verbs (parent) and nouns or pronouns (child).

Verbs with Reflexive Core Objects

Relations Overview