home edit page issue tracker

This page pertains to UD version 2.

UD Coptic Bohairic

Language: Coptic (code: cop)
Family: Afro-Asiatic

This treebank has been part of Universal Dependencies since the UD v2.16 release.

The following people have contributed to making this treebank part of UD: Amir Zeldes, Nina Speransky.

Repository: UD_Coptic-Bohairic
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.17

License: CC BY 4.0

Genre: bible, fiction, nonfiction

Questions, comments? General annotation questions (either Coptic-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [amir • zeldes (æt) georgetown • edu]. Development of the treebank happens outside the UD repository. If there are bugs, either the original data source or the conversion procedure must be fixed. Do not submit pull requests against the UD repository.

Annotation Source
Lemmas annotated manually
UPOS annotated manually in non-UD style, automatically converted to UD
XPOS annotated manually
Features assigned by a program, not checked manually
Relations annotated manually, natively in UD style

Description

UD_Coptic-Bohairic contains manually annotated Bohairic Coptic texts, including Biblical narrative and poetic texts, epistles, and hagiography.

The Bohairic Coptic Universal Dependency Treebank is a manually annotated corpus of Bohairic Coptic texts, currently containing excerpts from the Bohairic New Testament Gospel of Mark, 1 Corinthians, the Old Testament Book of Habakkuk, lives of Sts. Isaac and Shenoute, and the Coptic version of the Lausiac History. Detailed information about the treebank is available in the reference paper cited below.

The data was digitized or previously available in digital format, and annotated manually for part of speech in the project Coptic Scriptorium. For individual credit and further information see:

http://copticscriptorium.org/

Native Coptic XPOS tags come from the Coptic Scriptorium tag set, which is available from the project and treebank websites.

Acknowledgments

The underlying POS tagged material was produced as part of the project Coptic Scriptorium, funded by the NEH (see http://copticscriptorium.org/ for more details). Treebank annotation was done mainly by Nina Speransky and Amir Zeldes. Thanks are also due to Nicholas Wagner for contributions to the annotation of the underlying texts and their POS and entity tagging.

Statistics of UD Coptic Bohairic

POS Tags

ADJADPADVAUXCCONJDETNOUNNUMPARTPRONPROPNPUNCTSCONJVERBX

Features

DefiniteExtPosForeignGenderGender[psor]MoodNumberNumber[psor]NumTypePersonPolarityPossPronTypeReflexVerbForm

Relations

aclacl:relcladvcladvmodamodapposauxcaseccccompconjcopcsubjdetdiscoursedislocatedexplfixedflatiobjmarknmodnmod:possnmod:unmarkednsubjnummodobjoblobl:unmarkedorphanparataxispunctreparandumrootvocativexcomp

Tokenization and Word Segmentation

Morphology

Tags

Nominal Features

Degree and Polarity

Verbal Features

Pronouns, Determiners, Quantifiers

Other Features

Syntax

Auxiliary Verbs and Copula

Core Arguments, Oblique Arguments and Adjuncts

Here we consider only relations between verbs (parent) and nouns or pronouns (child).

Relations Overview