home edit page issue tracker

This page pertains to UD version 2.

UD Ruuli RDT

Language: Ruuli (code: ruc)
Family: Niger-Congo

This treebank has been part of Universal Dependencies since the UD v2.18 release.

The following people have contributed to making this treebank part of UD: Kira Tulchynska, Anna Veselovsky, Alena Witzlack-Makarevich.

Repository: UD_Ruuli-RDT
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.18

License: CC BY-NC-SA 4.0

Genre: fiction, grammar-examples, nonfiction, spoken

Questions, comments? General annotation questions (either Ruuli-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [kira • tulchynska (æt) mail • huji • ac • il]. Development of the treebank happens directly in the UD repository, so you may submit bug fixes as pull requests against the dev branch.

Annotation Source
Lemmas annotated manually in non-UD style, automatically converted to UD
UPOS annotated manually in non-UD style, automatically converted to UD
XPOS annotated manually in non-UD style, automatically converted to UD
Features annotated manually in non-UD style, automatically converted to UD
Relations annotated manually, natively in UD style

Description

UD_Ruuli-RDT is a Universal Dependencies (UD) treebank for the Ruruuli-Lunyala (Ruuli) language. The annotation was converted from interlinear glossed text and manually annotated for syntactic relations. The treebank includes texts from various sources: conversations, oral folktales, biographic monologue, movie subtitles, grammar examples, and factual prose. The treebank contains approximately 6,000 tokens.

The UD_Ruuli-RDT treebank consists of texts recorded in Ruuli or translated into it by native speakers, and subsequently glossed and annotated. The included texts are:

All sentences were converted from interlinear glossed text into CoNLL-U format using a custom conversion script. The syntactic relations were subsequently manually annotated following the UD framework.

Sentences from written texts and conversations were shuffled to anonymize the data.

Acknowledgments

References

Statistics of UD Ruuli RDT

POS Tags

ADJADPADVAUXCCONJDETINTJNOUNNUMPARTPRONPROPNPUNCTSCONJVERBX

Features

AbbrAspectAspect[add]DeixisExtPosForeignHortInfStructMoodNounClassNounClass[iobj]NounClass[obj]NounClass[psed]NounClass[psor]NumberNumber[iobj]Number[obj]NumFormNumTypePersonPerson[iobj]Person[obj]Person[psed]Person[psor]PolarityPossPronTypeRedReferentTenseVerbFormVoiceVoice[add]

Relations

aclacl:relcladvcladvcl:relcladvmodadvmod:copadvmod:emphadvmod:locamodapposauxcaseccccompcompoundconjcopcsubjcsubj:outerdetdiscoursedislocatedfixedflatflat:foreignflat:nameflat:numiobjiobj:applmarknmodnmod:descnmod:possnsubjnsubj:outernummodobjobj:applobj:causoblobl:agentparataxispunctreparandumrootvocativexcomp

Tokenization and Word Segmentation

Morphology

Tags

Nominal Features

Degree and Polarity

Verbal Features

Pronouns, Determiners, Quantifiers

Other Features

Syntax

Auxiliary Verbs and Copula

Core Arguments, Oblique Arguments and Adjuncts

Here we consider only relations between verbs (parent) and nouns or pronouns (child).

Relations Overview