home edit page issue tracker

This page pertains to UD version 2.

Introduction

These guidelines describe the application of the Stanford Universal Dependency scheme (de Marneffe et al. 2014) to Sahidic Coptic sentences. Where cases are left unspecified, the principles governing the Universal Dependency guidelines for languages other than Coptic may be consulted (see http://universaldependencies.org/).

In general, the attempt has been made to remain ‘lexico-centric’ in preferring lexical items as the heads of sub-trees, i.e. Coptic prepositions are analyzed as ‘case markers’, and the lexical infinitive is seen as the verbal root, with conjugation bases forming auxiliaries.

Some functions from the Universal Dependencies are not used, most notably labels for passives. Although Coptic has some form of passive constructions, they are not easily distinguishable and often ambiguous (actional passive identical with third person active sentence in the absence of a singular agent phrase; stative passive depending on transitivity of the verb for interpretation; see the nsubj label below). In these, and other cases, the realistic demands that accurate automatic parsing must satisfy have played a role in the decision in favor of a simpler analysis.

The dependency guidelines assume part of speech tagging based on the Coptic Scriptorium Guidelines (Zeldes & Schroeder 2016). Please consult the tagging guidelines for background on tagging, as well as tokenization decisions (e.g. portmanteau tokens and tags, such as fused ⲉⲣⲉ for 2nd person feminine singular, etc.; see Section 3). In general, these guidelines have been formed to complement the POS tag’s expressivity, e.g. favoring distinctions that cannot be easily obtained from the POS tags (for example the label amod for the archaic attributive adjectives, which have no distinct POS tag).

List of dependency labels

Label Example
acl – adjunct clause ⲡⲣⲱⲙⲉ ⲉⲧⲥⲱⲧⲙ
advcl – adverbial clause ⲉϥϫⲱ, ϫⲉⲕⲁⲁⲥ ϥⲟⲩⲱϣ
advmod – adverbial modifier ⲟⲛ, ⲕⲁⲗⲱⲥ
amod – adjective modifier ϣⲏⲣⲉ ϣⲏⲙ
appos – appositions ⲡⲉⲕⲉⲓⲱⲧ ⲙⲁⲙⲙⲱⲛⲁⲥ
aux – auxiliary ⲁ, ⲙⲡⲁⲧ, ϣⲁⲣⲉ
case – case marking/preposition ⲥⲱⲧⲡ ⲙⲡⲣⲱⲙⲉ, ⲙⲙⲟϥ, ϩⲙⲡⲏⲓ
cc – coordination ⲁⲩⲱ, ⲙⲛ, ϩⲓ, ⲏ
ccomp – complement clause ⲡⲉϫⲁ ϥ ϫⲉ ⲁ ⲓ ⲥⲱⲧⲙ
compound – nominal or verbal ϣⲃⲣ ⲣϩⲱⲃ
conj – conjunct in coordination ϣⲏⲣⲉ ⲙⲛϣⲉⲉⲣⲉ, ⲁϥⲛⲁⲩ ⲁⲩⲱ ⲁϥⲥⲱⲧⲙ
cop – the copula ⲡⲉ, ⲧⲉ, ⲛⲉ
csubj – clausal subject (fin./inf.) ⲡ ⲉⲧ ϣϣⲉ ⲉ ⲁⲛⲉⲭⲉ ⲙⲙⲟ ⲕ
det – article or other determiner ⲡⲣⲱⲙⲉ, ⲕⲉⲥⲟⲡ, ⲥⲟⲛ ⲛⲓⲙ
discourse – interjections etc. ⲉϩⲉ, ⲟⲩⲟⲉⲓ!
dislocated – extraposed argument ϥ … ⲛϭⲓⲡϩⲗⲗⲟ, ⲡⲣⲱⲙⲉ ⲁϥⲥⲱⲧⲙ
obj – direct object ⲥⲱⲧⲡ ⲙⲡⲣⲱⲙⲉ, ϯⲛⲁϩⲟⲧⲃⲉⲕ
obl - oblique verb modifier ⲟⲩⲏϩ ϩⲙⲡⲏⲓ
iobj – indirect object (possessor) ⲟⲩⲛⲧⲁⲩ ⲁⲡⲟⲧ
mark – clause marker or converter ⲡⲉϫⲁ ϥ ϫⲉ ⲉⲧⲃⲉ ⲟⲩ, ⲉ ⲓ ⲥⲱⲧⲙ ⲉⲣⲟ ⲕ
mwe – multiword expression ⲉⲃⲟⲗ ϩⲛ
name – multi-word name ⲁⲡⲁ ⲡⲁⲡⲛⲟⲩⲧⲉ
neg – negation ⲧⲙ ⲥⲱⲧⲙ, ⲟⲩ ⲣⲱⲙⲉ ⲁⲛ
nmod – nominal modifier ⲟⲩ ⲣⲱⲙⲉ ⲛ ⲁⲡⲓⲥⲧⲟⲥ
nummod – number modifier ϩⲙⲉ ⲛ ϩⲟⲟⲩ
nsubj – nominal subject ⲡⲣⲱⲙⲉ ⲥⲱⲧⲙ, ⲁⲓⲣⲛⲟⲃⲉ
parataxis – loose clausal joint ⲛⲁ ⲉⲡⲓⲥⲧⲟⲗⲏ ⲛⲉ ⲏ ϯ ⲥⲟⲟⲩⲛ ⲁⲛ ϫⲉ ⲕ ⲛⲁ ⲡⲁϩ ⲟⲩ
punct – punctuation · . ,
orphan – dependent of elliptical word ⲙⲡⲓⲥⲱⲧⲙ, ⲁⲗⲗⲁ ⲛⲧⲟϥ
root – root/predicate of utterance ⲥⲱⲧⲙ, ⲛⲁⲛⲟⲩ ϥ, ⲡ ⲣⲣⲟ ⲡⲉ
vocative – appellation ⲡ ⲡⲟⲛⲏⲣⲟⲥ!
xcomp – subjectless clause ⲙⲡ ϥ ⲟⲩⲱϣ ⲉ ⲱⲛϩ

dep – undefined dependency – used when no other definition applies