home edit page issue tracker

This page pertains to UD version 2.

UD Lithuanian ALKSNIS

Language: Lithuanian (code: lt)
Family: Indo-European, Baltic

This treebank has been part of Universal Dependencies since the UD v2.4 release.

The following people have contributed to making this treebank part of UD: Andrius Utka, Erika Rimkutė, Agnė Bielinskienė, Jolanta Kovalevskaitė, Loïc Boizou, Gabrielė Aleksandravičiūtė, Kristina Brokaitė, Daniel Zeman, Natalia Perkova, Bernadeta Griciūtė.

Repository: UD_Lithuanian-ALKSNIS
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.13

License: CC BY-SA 4.0

Genre: news, nonfiction, legal, fiction

Questions, comments? General annotation questions (either Lithuanian-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [semantikalt (æt) gmail • com, andrius • utka (æt) vdu • lt, zeman (æt) ufal • mff • cuni • cz]. Development of the treebank happens outside the UD repository. If there are bugs, either the original data source or the conversion procedure must be fixed. Do not submit pull requests against the UD repository.

Annotation Source
Lemmas annotated manually
UPOS annotated manually in non-UD style, automatically converted to UD
XPOS annotated manually
Features annotated manually in non-UD style, automatically converted to UD
Relations annotated manually in non-UD style, automatically converted to UD

Description

The Lithuanian dependency treebank ALKSNIS v3.0 (Vytautas Magnus University).

This is a conversion of the ALKSNIS treebank to Universal Dependencies. The original treebank was annotated in a style derived from the Prague Dependency Treebank of Czech. The original treebank is available at https://github.com/Semantika2/Alksnis-v.3.0. ALKSNIS v2.1 is published in the CLARIN LT repository at http://hdl.handle.net/20.500.11821/10. (Some users experience DNS errors when trying to access the repository; configuring the client machine to use 8.8.8.8 as the DNS server may help. See also http://clarin-lt.lt/?page_id=86.)

ALKSNIS v3 consists of 3,643 syntactically annotated sentences. Each node of a tree corresponds to a word, a punctuation mark or other text element (symbol, digit etc.) within a sentence. The following information is presented for each node: 1) a used form; 2) a lemma; 3) a morphology tag, and 4) a syntactic function (subject, object, etc.). Dependencies are shown by links between words.

The morphology tag set Jablonskis is used since ALKSNIS v2.2 and in the XPOS column of the UD conversion. This is a change from ALKSNIS 2 where a version of the MULTEXT-East tag set was used. Syntactically annotated sentences are corrected according to guidelines that were created by scientists of VMU CCL, following rules of Prague Dependency Treebank. All the sentences are being manually checked and corrected by a group of linguists.

Acknowledgments

From v2.1 to v3.0 the treebank was developed within the project “Semantika2” (Nr. 02.3.1-CPVA-V-527-01-0002). The project was funded by European Structural Funds.

References

Statistics of UD Lithuanian ALKSNIS

POS Tags

ADJADPADVAUXCCONJDETINTJNOUNNUMPARTPRONPROPNPUNCTSCONJSYMVERBX

Features

AbbrAdpTypeAspectCaseDefiniteDegreeForeignGenderHyphMoodNumberNumFormNumTypePersonPolarityPronTypePunctTypeReflexTenseVerbFormVoice

Relations

aclacl:relcladvcladvmodadvmod:emphamodapposcaseccccompcompoundconjcopcsubjcsubj:passdepdetdiscourseflatflat:foreigniobjmarknmodnsubjnsubj:passnummodnummod:govobjoblobl:argorphanparataxispunctrootxcomp

Tokenization and Word Segmentation

Morphology

Tags

Nominal Features

Degree and Polarity

Verbal Features

Pronouns, Determiners, Quantifiers

Other Features

Syntax

Auxiliary Verbs and Copula

Core Arguments, Oblique Arguments and Adjuncts

Here we consider only relations between verbs (parent) and nouns or pronouns (child).

Verbs with Reflexive Core Objects

Relations Overview