home edit page issue tracker

This page pertains to UD version 2.

UD Nepali BK

Language: Nepali (code: ne)
Family: IE

This treebank has been part of Universal Dependencies since the UD v2.18 release.

The following people have contributed to making this treebank part of UD: Samuel BK, Luigi Talamo, Annemarie Verkerk.

Repository: UD_Nepali-BK
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.18

License: CC BY-SA 4.0

Genre: fiction, nonfiction

Questions, comments? General annotation questions (either Nepali-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [luigi • talamo (æt) uni-saarland • de]. Development of the treebank happens directly in the UD repository, so you may submit bug fixes as pull requests against the dev branch.

Annotation Source
Lemmas annotated manually
UPOS annotated manually, natively in UD style
XPOS not available
Features annotated manually, natively in UD style
Relations annotated manually, natively in UD style

Description

UD_Nepali-BK is a manually annotated Universal Dependencies treebank for Nepali, an Indo-Aryan language written in Devanagari. The treebank contains sentences from a fictional narrative story and an argumentative discourse text, and follows the Universal Dependencies v2 guidelines.

The UD_Nepali-BK treebank consists of Nepali sentences annotated according to the Universal Dependencies guidelines. The data comes from two text types: a fictional narrative story, Bhoot ko Katha, and an argumentative/discussion text titled Adhikar Thulo Ki Kartabya? (अधिकार ठूलो कि कर्तव्य?) from a Grade 10 Nepali school textbook.

The treebank covers constructions such as head-final SOV clause structure, case and postpositional marking, copular clauses, participial modifiers, converbs, coordination, discourse particles, and reported/evidential forms.

The annotation was carried out manually and includes lemmas, universal part-of-speech tags, morphological features, and dependency relations.

Acknowledgments

We would like to thank Jun.-Prof. Dr. Annemarie Verkerk for leading and supervising the project and Luigi Talamo for coordinating the treebank submission, providing guidance during the annotation process, answering queries, and helping with the tests. We also thank Saarland University, Saarland, Germany, for supporting the work on this treebank.

References

Statistics of UD Nepali BK

POS Tags

ADJADPADVAUXCCONJDETINTJNOUNNUMPARTPRONPROPNPUNCTVERBX

Features

AspectCaseEvidentForeignGenderMoodNumberNumTypePersonPolarityPronTypeReflexTenseVerbFormVoice

Relations

aclacl:relcladvcladvmodamodapposauxcaseccccompcompoundcompound:redupconjcopdepdetdiscoursedislocatedflat:nameiobjnmodnmod:possnsubjnummodobjoblparataxispunctreparandumrootxcomp

Tokenization and Word Segmentation

Morphology

Tags

Nominal Features

Degree and Polarity

Verbal Features

Pronouns, Determiners, Quantifiers

Other Features

Syntax

Auxiliary Verbs and Copula

Core Arguments, Oblique Arguments and Adjuncts

Here we consider only relations between verbs (parent) and nouns or pronouns (child).

Relations Overview