UD Coptic Bohairic
Language: Coptic (code: cop)
Family: Afro-Asiatic
This treebank has been part of Universal Dependencies since the UD v2.16 release.
The following people have contributed to making this treebank part of UD: Amir Zeldes, Nina Speransky.
Repository: UD_Coptic-Bohairic
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.17
License: CC BY 4.0
Genre: bible, fiction, nonfiction
Questions, comments? General annotation questions (either Coptic-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [amir • zeldes (æt) georgetown • edu]. Development of the treebank happens outside the UD repository. If there are bugs, either the original data source or the conversion procedure must be fixed. Do not submit pull requests against the UD repository.
| Annotation | Source |
|---|---|
| Lemmas | annotated manually |
| UPOS | annotated manually in non-UD style, automatically converted to UD |
| XPOS | annotated manually |
| Features | assigned by a program, not checked manually |
| Relations | annotated manually, natively in UD style |
Description
UD_Coptic-Bohairic contains manually annotated Bohairic Coptic texts, including Biblical narrative and poetic texts, epistles, and hagiography.
The Bohairic Coptic Universal Dependency Treebank is a manually annotated corpus of Bohairic Coptic texts, currently containing excerpts from the Bohairic New Testament Gospel of Mark, 1 Corinthians, the Old Testament Book of Habakkuk, lives of Sts. Isaac and Shenoute, and the Coptic version of the Lausiac History. Detailed information about the treebank is available in the reference paper cited below.
The data was digitized or previously available in digital format, and annotated manually for part of speech in the project Coptic Scriptorium. For individual credit and further information see:
http://copticscriptorium.org/
Native Coptic XPOS tags come from the Coptic Scriptorium tag set, which is available from the project and treebank websites.
Acknowledgments
The underlying POS tagged material was produced as part of the project Coptic Scriptorium, funded by the NEH (see http://copticscriptorium.org/ for more details). Treebank annotation was done mainly by Nina Speransky and Amir Zeldes. Thanks are also due to Nicholas Wagner for contributions to the annotation of the underlying texts and their POS and entity tagging.
Statistics of UD Coptic Bohairic
POS Tags
ADJ – ADP – ADV – AUX – CCONJ – DET – NOUN – NUM – PART – PRON – PROPN – PUNCT – SCONJ – VERB – X
Features
Definite – ExtPos – Foreign – Gender – Gender[psor] – Mood – Number – Number[psor] – NumType – Person – Polarity – Poss – PronType – Reflex – VerbForm
Relations
acl – acl:relcl – advcl – advmod – amod – appos – aux – case – cc – ccomp – conj – cop – csubj – det – discourse – dislocated – expl – fixed – flat – iobj – mark – nmod – nmod:poss – nmod:unmarked – nsubj – nummod – obj – obl – obl:unmarked – orphan – parataxis – punct – reparandum – root – vocative – xcomp
Tokenization and Word Segmentation
- This corpus contains 1001 sentences, 15255 tokens and 32723 syntactic words.
- All tokens in this corpus are followed by a space.
- This corpus does not contain words with spaces.
- This corpus does not contain words that contain both letters and punctuation.
- This corpus contains 10307 multi-word tokens. On average, one multi-word token consists of 2.69 syntactic words.
- There are 5601 types of multi-word tokens. Examples: ⲙⲙⲟⲥ, ⲛⲁϥ, ⲙⲙⲟϥ, ⲉⲑⲟⲩⲁⲃ, ⲛⲱⲟⲩ, ⲉⲣⲟϥ, ⲡⲉϫⲁϥ, ⲙⲙⲱⲟⲩ, ⲛⲧⲉⲫⲛⲟⲩϯ, ⲧⲏⲣⲟⲩ, ⲉⲣⲱⲟⲩ, ⲙⲫⲣⲏϯ, ⲛⲁⲕ, ⲛⲉⲙⲁϥ, ⲁϥⲓ, ⲉϥϫⲱ, ⲛⲏⲓ, ⲙⲫⲛⲟⲩϯ, ⲛϧⲏⲧϥ, ⲡⲭⲣⲓⲥⲧⲟⲥ, ⲙⲡⲁⲓⲣⲏϯ, ⲛϩⲏⲧ, ⲛⲁϥϫⲱ, ⲛⲱⲧⲉⲛ, ⲛⲉⲙⲛⲏ, ⲥⲁⲧⲟⲧϥ, ⲡⲁⲓⲣⲏϯ, ⲁⲥϣⲱⲡⲓ, ⲉⲧⲁϥⲓ, ⲛⲛⲏ, ⲙⲙⲟⲓ, ⲙⲙⲟⲕ, ⲛⲉϩⲟⲟⲩ, ⲛⲧⲟⲧϥ, ⲕⲁⲧⲁⲫⲣⲏϯ, ⲛϧⲏⲧⲟⲩ, ⲡⲓⲟⲩⲁⲓ, ϫⲉⲟⲩ, ϫⲉⲟⲩⲏⲓ, ⲁⲩⲓ, ⲉⲧⲉⲙⲙⲁⲩ, ⲛⲉⲙⲱⲟⲩ, ⲛⲥⲱϥ, ϧⲉⲛⲑⲏⲛⲟⲩ, ⲁϥϣⲉ, ⲉⲧϣⲟⲡ, ⲙⲫⲏ, ⲛⲉⲥⲱⲟⲩ, ⲟⲩⲙⲏϣ, ⲫⲛⲟⲩϯ.
Morphology
Tags
- This corpus uses 15 UPOS tags out of 17 possible: ADJ, ADP, ADV, AUX, CCONJ, DET, NOUN, NUM, PART, PRON, PROPN, PUNCT, SCONJ, VERB, X
- This corpus does not use the following tags: INTJ, SYM
- This corpus contains 34 word types tagged as particles (PART): ϩⲁⲣⲁ, ϩⲏⲡⲡⲉ, ϫⲉ, ⲁ, ⲁϩⲏ, ⲁⲙⲏⲛ, ⲁⲙⲟⲓ, ⲁⲛ, ⲁⲣⲉ, ⲅⲁⲣ, ⲅⲉ, ⲇⲉ, ⲉ, ⲉϩⲟⲧⲉ, ⲉⲑⲃⲉ, ⲉⲣⲉ, ⲉⲧ, ⲉⲧⲉ, ⲓⲉ, ⲓⲥ, ⲓⲥϫⲉⲛ, ⲕⲉ, ⲙ, ⲙⲉⲛ, ⲙⲉⲛⲉⲛⲥⲁ, ⲙⲉⲛⲧⲟⲓ, ⲙⲙⲟⲛ, ⲛ, ⲛϫⲉ, ⲟⲩⲛ, ⲟⲩⲟⲓ, ⲣⲱ, ⲥⲉ, ⲱ
- This corpus contains 45 lemmas tagged as pronouns (PRON): ϥ, ϧⲁⲧⲉⲛ_ⲁⲛⲟⲕ, ϧⲉⲛ_ⲁⲛⲟⲕ, ϩⲓⲧⲉⲛ_ⲁⲛⲟⲕ, ϩⲱ_ⲁⲛⲟⲕ, ϩⲱⲱ, ϩⲱⲱ_ⲁⲛⲟⲕ, ⲁϣ, ⲁⲛⲟⲕ, ⲁⲛⲟⲛ, ⲁⲟⲩⲏⲣ, ⲁⲣⲉ_ⲛⲑⲟ, ⲁⲣⲉϣⲁⲛ_ⲁⲛⲟⲕ, ⲁⲣⲉϣⲁⲛ_ⲛⲑⲟϥ, ⲁⲣⲉϣⲁⲛ_ⲛⲑⲟⲕ, ⲁⲣⲉϣⲁⲛ_ⲛⲑⲟⲥ, ⲁⲣⲉϣⲁⲛ_ⲛⲑⲱⲟⲩ, ⲉ_ⲛⲑⲟ, ⲉⲣⲁⲧ_ⲁⲛⲟⲕ, ⲉⲣⲉ_ⲁⲛⲟⲕ, ⲉⲣⲉ_ⲛⲑⲟϥ, ⲉⲣⲉ_ⲛⲑⲟⲕ, ⲉⲣⲉ_ⲛⲑⲟⲥ, ⲉⲣⲉ_ⲛⲑⲱⲟⲩ, ⲑⲛⲁⲩ, ⲑⲱⲛ, ⲓⲉⲣ_ⲁⲛⲟⲕ, ⲓⲥⲁⲁⲕ, ⲙⲁⲩⲁⲧ_ⲁⲛⲟⲕ, ⲙⲙⲁⲩⲁⲧ_ⲁⲛⲟⲕ, ⲛⲑⲟ, ⲛⲑⲟϥ, ⲛⲑⲟⲕ, ⲛⲑⲟⲥ, ⲛⲑⲱⲟⲩ, ⲛⲑⲱⲧⲉⲛ, ⲛⲓⲃⲉⲛ, ⲛⲓⲙ, ⲛⲟⲩϯ, ⲛⲧⲉⲛ_ⲁⲛⲟⲕ, ⲟⲩ, ⲟⲩⲏⲣ, ⲡⲉ, ⲡⲟⲩ, ⲥ
- This corpus contains 35 lemmas tagged as determiners (DET): ϯ, ⲁⲡⲁ, ⲕⲉ, ⲛⲉⲥ, ⲛⲉⲧⲉⲛ, ⲛⲑⲟϥ, ⲛⲑⲱⲟⲩ, ⲛⲑⲱⲧⲉⲛ, ⲛⲟⲩϫ, ⲛⲧⲟϥ, ⲟⲩ, ⲡ, ⲡⲁ, ⲡⲁⲓ, ⲡⲁⲧϣⲉⲗⲉⲧ, ⲡⲉ, ⲡⲉϥ, ⲡⲉⲕ, ⲡⲉⲛ, ⲡⲉⲥ, ⲡⲉⲧⲉⲛ, ⲡⲉⲧⲛ, ⲡⲓ, ⲡⲟⲩ, ⲣⲱⲙⲓ, ⲧⲁⲓ, ⲧⲟⲩ, ⲫⲁ, ⲫⲁⲓ, ⲫⲏ, ⲫⲱϥ, ⲫⲱⲕ, ⲫⲱⲟⲩ, ⲫⲱⲧⲉⲛ, ⲫⲱⲧⲉⲧⲉⲛ
- Out of the above, 6 lemmas occurred sometimes as PRON and sometimes as DET: ⲛⲑⲟϥ, ⲛⲑⲱⲟⲩ, ⲛⲑⲱⲧⲉⲛ, ⲟⲩ, ⲡⲉ, ⲡⲟⲩ
- This corpus contains 18 lemmas tagged as auxiliaries (AUX): ϣ, ϣⲁⲣⲉ, ϣⲁⲧⲉ, ⲁ, ⲁⲣⲉϣⲁⲛ, ⲉⲣⲉ, ⲉⲧ, ⲙⲁⲣⲉ, ⲙⲙⲟⲛ, ⲙⲡⲁⲣⲉ, ⲙⲡⲁⲧⲉ, ⲙⲡⲉ, ⲙⲡⲉⲛⲑⲣⲉ, ⲛⲁ, ⲛⲁⲣⲉ, ⲛⲛⲉ, ⲛⲧⲉ, ⲟⲩⲟⲛ
- Out of the above, 3 lemmas occurred sometimes as AUX and sometimes as VERB: ⲙⲙⲟⲛ, ⲛⲁ, ⲟⲩⲟⲛ
- There are 2 (de)verbal forms:
- Fin
- AUX: ϣ, ⲉϣ
- VERB: ⲓ, ϣⲱⲡⲓ, ϫⲱ, ⲟⲩⲁⲃ, ⲛⲁⲩ, ⲡⲉϫⲁ, ϣⲉ, ⲥⲱⲧⲉⲙ, ⲟⲓ, ⲉⲣ
- Inf
- VERB: ⲉⲣ, ⲛⲁⲩ, ϯ, ϣⲱⲡⲓ, ϭⲓ, ϫⲟ, ⲉⲙⲓ, ⲥⲁϫⲓ, ⲥⲱⲧⲉⲙ, ϧⲟⲑⲃⲉ
Nominal Features
- Fem
- DET: ϯ, ⲧ, ⲧⲉϥ, ⲧⲉⲕ, ⲧⲁ, ⲑⲏ, ⲧⲁⲓ, ⲑ, ⲑⲁⲓ, ⲧⲉⲥ
- PRON: ⲥ, ⲧⲉ, ⲉ, ⲉⲥⲉ, ⲁⲥϣⲁⲛ, ⲓ, ϥ, ⲁⲣⲉ, ⲉⲣⲟ, ⲧⲉⲣ
- Masc
- DET: ⲡⲓ, ⲡ, ⲫ, ⲡⲉϥ, ⲫⲏ, ⲡⲁⲓ, ⲡⲁ, ⲫⲁⲓ, ⲡⲉⲕ, ⲡⲟⲩ
- PRON: ϥ, ⲕ, ⲡⲉ, ⲛⲑⲟϥ, ⲉϥⲉ, ⲡ, ⲭ, ⲛⲑⲟⲕ, ⲁϥϣⲁⲛ, ⲉⲕⲉ
- Plur
- DET: ⲛⲓ, ⲛⲏ, ⲛⲉϥ, ⲛⲁⲓ, ⲛⲁ, ⲛⲟⲩ, ⲛⲉⲕ, ⲛ, ⲛⲉⲛ, ⲛⲉⲧⲉⲛ
- PRON: ⲩ, ⲟⲩ, ⲧⲉⲛ, ⲥⲉ, ⲛ, ⲑⲏⲛⲟⲩ, ⲧⲉⲧⲉⲛ, ⲛⲉ, ⲉⲩⲉ, ⲛⲑⲱⲧⲉⲛ
- Sing
- DET: ⲡⲓ, ⲟⲩ, ⲡ, ⲫ, ϯ, ϩⲁⲛ, ⲡⲉϥ, ⲫⲏ, ⲡⲁⲓ, ⲡⲁ
- PRON: ϥ, ⲥ, ⲓ, ⲕ, ⲡⲉ, ϯ, ⲁⲛⲟⲕ, ⲛⲑⲟϥ, ⲉϥⲉ, ⲡ
- Def
- DET: ⲡⲓ, ⲛⲓ, ⲡ, ⲫ, ϯ, ⲛⲏ, ⲡⲉϥ, ⲫⲏ, ⲡⲁⲓ, ⲛⲉϥ
- NOUN: ⲙⲙⲓⲛⲙⲙⲟ
- PRON: ϥ, ⲟⲩ, ⲩ, ⲥ, ⲓ, ⲕ, ⲧⲉⲛ, ϯ, ⲛ, ⲥⲉ
- Ind
- DET: ⲟⲩ, ϩⲁⲛ, ⲩ
Degree and Polarity
- Neg
- ADV: ⲁⲛ, ⲛ, ϣⲧⲉⲙ, ⲙⲡⲉⲣ, ⲙⲡⲣ, ⲟⲩ, ⲙⲫⲏ, ⲟⲩⲕ, ⲟⲩⲭⲓ
- AUX: ⲙⲡⲉ, ⲙⲡ, ⲙⲡⲁ, ⲛⲛⲉ, ⲙⲡⲉⲛⲑⲣⲉ, ⲙⲙⲟⲛ, ⲙⲡⲁⲧⲉ, ⲙⲡⲁⲣⲉ, ⲛⲛ
- CCONJ: ⲟⲩⲇⲉ
- PART: ⲙⲙⲟⲛ
- SCONJ: ⲟⲩⲇⲉ
- VERB: ⲙⲙⲟⲛ, ⲙⲙⲟⲛⲧⲉ, ⲙⲙⲟⲛⲧ, ⲙⲙⲟⲛⲛⲧⲁ, ⲙⲙⲟⲛⲧⲱ
- X: ⲟⲩ
Verbal Features
- Cnd
- VERB-Fin: ϫⲟ, ϣⲱⲡⲓ, ϫⲉⲙϩⲏⲟⲩ, ϭⲓ, ⲓ, ⲥⲱⲧⲉⲙ, ⲫⲟϩ, ϩⲉⲛ, ϫⲟⲟ, ϭⲓϩⲟ
- Imp
- VERB-Fin: ⲙⲁϣⲉ, ⲁⲛⲁⲩ, ⲙⲁ, ⲁⲙⲟⲩ, ϣⲱⲡⲓ, ⲁⲙⲱⲓⲛⲓ, ⲁⲛⲓⲧ, ϫⲟϫ, ⲁϫⲟ, ⲁⲣⲓⲉⲙⲓ
- Ind
- VERB: ⲓ, ϣⲱⲡⲓ, ϫⲱ, ⲟⲩⲁⲃ, ⲛⲁⲩ, ⲡⲉϫⲁ, ϣⲉ, ⲑⲣⲉ, ⲉⲣ, ⲟⲓ
- VERB-Fin: ⲓ, ϣⲱⲡⲓ, ϫⲱ, ⲟⲩⲁⲃ, ⲛⲁⲩ, ⲡⲉϫⲁ, ϣⲉ, ⲟⲓ, ⲉⲣ, ⲥⲱⲧⲉⲙ
- VERB-Inf: ⲉⲣ, ⲛⲁⲩ, ϯ, ϣⲱⲡⲓ, ϭⲓ, ϫⲟ, ⲉⲙⲓ, ⲥⲁϫⲓ, ⲥⲱⲧⲉⲙ, ϧⲟⲑⲃⲉ
- Jus
- VERB-Fin: ϣⲱⲡⲓ, ⲭⲁ, ⲙⲟϣⲓ, ⲥⲱⲧⲉⲙ, ϣⲟⲩϣⲟⲩ, ϭⲓ, ϯ, ⲙⲟⲩⲛⲕ, ⲟϩⲓ, ϣⲉⲛ
- Opt
- VERB-Fin: ϣⲱⲡⲓ, ⲓ, ϯ, ⲛⲁⲩ, ϩⲟⲃⲥ, ⲉⲙⲓ, ⲉⲣ, ⲑⲱⲟⲩϯ, ⲙⲟϣⲓ, ⲟⲩⲛⲟϥ
- Pot
- AUX-Fin: ϣ, ⲉϣ
Pronouns, Determiners, Quantifiers
- Art
- DET: ⲡⲓ, ⲟⲩ, ⲛⲓ, ⲡ, ⲫ, ϯ, ϩⲁⲛ, ⲕⲉ, ⲧ, ⲩ
- Dem
- DET: ⲛⲏ, ⲫⲏ, ⲡⲁⲓ, ⲫⲁⲓ, ⲛⲁⲓ, ⲑⲏ, ⲧⲁⲓ, ⲑⲁⲓ, ⲛⲓ, ⲡⲓ
- PRON: ⲡⲉ, ⲛⲉ, ⲡ, ⲧⲉ
- Ind
- PRON: ⲛⲓⲃⲉⲛ, ⲟⲩ
- Int
- ADV: ⲑⲱⲛ, ⲁⲟⲩⲏⲣ, ⲧⲱⲛ, ⲡⲱⲥ
- PRON: ⲟⲩ, ⲛⲓⲙ, ⲟⲩⲏⲣ, ⲁϣ, ⲑⲱⲛ, ⲑⲛⲁⲩ, ⲁⲟⲩⲏⲣ, ⲛⲑⲟⲕ
- Prs
- DET: ⲡⲉϥ, ⲛⲉϥ, ⲡⲁ, ⲧⲉϥ, ⲛⲁ, ⲡⲉⲕ, ⲡⲟⲩ, ⲡⲉⲛ, ⲛⲟⲩ, ⲧⲉⲕ
- NOUN: ⲙⲙⲓⲛⲙⲙⲟ
- PRON: ϥ, ⲟⲩ, ⲩ, ⲥ, ⲓ, ⲕ, ⲧⲉⲛ, ϯ, ⲛ, ⲥⲉ
- Rcp
- NOUN: ⲉⲣⲏⲟⲩ
- Tot
- NOUN: ⲧⲏⲣ
- Card
- NUM: ⲟⲩⲁⲓ, ⲃ, ⲓⲃ, ⲅ, ⲟⲩⲓ, ϣⲟ, ⲍ, ⲣ, ⲉ, ⲋ
- Yes
- DET: ⲡⲉϥ, ⲛⲉϥ, ⲡⲁ, ⲧⲉϥ, ⲛⲁ, ⲡⲉⲕ, ⲡⲟⲩ, ⲡⲉⲛ, ⲛⲟⲩ, ⲧⲉⲕ
- PRON: ϥ, ⲟⲩ, ⲥ, ⲕ, ⲓ, ⲛ, ⲧⲉⲛ, ⲧ, ⲩ
- Yes
- NOUN: ⲙⲙⲓⲛⲙⲙⲟ
- 1
- DET: ⲡⲁ, ⲡⲉⲛ, ⲛⲁ, ⲧⲁ, ⲛⲉⲛ, ⲧⲉⲛ
- PRON: ⲓ, ϯ, ⲛ, ⲁⲛⲟⲕ, ⲧⲉⲛ, ⲁⲛⲟⲛ, ⲧⲁ, ⲧ, ϩⲱ, ϧⲁⲧⲟⲧ
- 2
- DET: ⲡⲉⲕ, ⲡⲟⲩ, ⲛⲟⲩ, ⲧⲉⲕ, ⲛⲉⲕ, ⲛⲉⲧⲉⲛ, ⲡⲉⲧⲉⲛ, ⲧⲟⲩ, ⲛⲟⲩⲧⲉⲛ, ⲛⲟⲩⲕ
- PRON: ⲕ, ⲧⲉⲛ, ⲑⲏⲛⲟⲩ, ⲧⲉⲧⲉⲛ, ⲛⲑⲱⲧⲉⲛ, ⲭ, ⲛⲑⲟⲕ, ⲉⲕⲉ, ⲁⲕϣⲁⲛ, ⲉ
- 3
- DET: ⲡⲉϥ, ⲛⲉϥ, ⲧⲉϥ, ⲡⲉⲥ, ⲧⲉⲥ, ⲛⲟⲩϥ, ⲑⲱϥ, ⲛⲟⲩⲟⲩ
- PRON: ϥ, ⲩ, ⲟⲩ, ⲥ, ⲥⲉ, ⲛⲑⲟϥ, ⲉϥⲉ, ⲉⲩⲉ, ⲛⲑⲱⲟⲩ, ⲁϥϣⲁⲛ
- Fem
- DET: ⲡⲟⲩ, ⲛⲟⲩ, ⲡⲉⲥ, ⲧⲉⲥ, ⲧⲟⲩ
- Masc
- DET: ⲡⲉϥ, ⲛⲉϥ, ⲧⲉϥ, ⲡⲉⲕ, ⲧⲉⲕ, ⲛⲉⲕ, ⲛⲟⲩϥ, ⲛⲟⲩⲕ, ⲡⲉⲥ, ⲑⲱϥ
- Plur
- DET: ⲡⲉⲛ, ⲛⲉⲛ, ⲛⲉⲧⲉⲛ, ⲡⲉⲧⲉⲛ, ⲛⲟⲩⲧⲉⲛ, ⲧⲉⲛ, ⲧⲉⲧⲉⲛ, ⲛⲉⲧⲛ, ⲛⲟⲩⲟⲩ
- Sing
- DET: ⲡⲉϥ, ⲛⲉϥ, ⲡⲁ, ⲧⲉϥ, ⲡⲉⲕ, ⲡⲟⲩ, ⲛⲟⲩ, ⲛⲁ, ⲧⲉⲕ, ⲛⲉⲕ
Other Features
- ExtPos
- ADP
- ADP: ⲉⲃⲏⲗ
- ADV: ⲉⲃⲟⲗ, ⲉϧⲟⲩⲛ, ⲉⲃⲏⲗ, ⲥⲁⲃⲟⲗ, ⲛϧⲣⲏⲓ, ϣⲁⲉϧⲟⲩⲛ, ⲥⲁϧⲟⲩⲛ, ⲉϧⲣⲏⲓ, ⲛϩⲣⲏⲓ
- SCONJ
- ADP: ⲉⲑⲃⲉ
- ADV: ⲉⲃⲏⲗ
- ADP
- Foreign
- Yes
- ADJ: ϩⲁⲅⲓⲟⲥ, ϩⲟⲥⲟⲛ
- ADP: ⲕⲁⲧⲁ, ⲡⲁⲣⲁ, ⲡⲣⲟⲥ, ⲭⲱⲣⲓⲥ, ⲁⲡⲁ, ⲙⲉⲛ
- ADV: ⲧⲟⲧⲉ, ⲙⲁⲗⲗⲟⲛ, ⲕⲁⲗⲱⲥ, ⲡⲁⲗⲓⲛ, ⲡⲱⲥ, ⲗⲟⲓⲡⲟⲛ, ϩⲏⲇⲏ, ⲉⲧⲓ, ϩⲟⲗⲱⲥ, ⲁⲗⲏⲑⲱⲥ
- CCONJ: ⲁⲗⲗⲁ, ⲟⲩⲇⲉ, ⲓⲧⲉ, ⲙⲏ, ⲕⲁⲓ, ⲕⲁⲛ, ϩⲟⲥⲟⲛ, ϩⲱⲥ, ⲙⲏⲧⲓ, ⲟⲩⲭⲓ
- DET: ⲁⲡⲁ
- NOUN: ⲁⲡⲁ, ⲭⲣⲓⲥⲧⲟⲥ, ⲡⲛⲉⲩⲙⲁ, ⲙⲁⲑⲏⲧⲏⲥ, ⲕⲟⲥⲙⲟⲥ, ⲥⲱⲙⲁ, ⲯⲩⲭⲏ, ⲇⲁⲓⲙⲱⲛ, ⲁⲣⲉⲧⲏ, ⲡⲣⲟⲫⲏⲧⲏⲥ
- NUM: ⲇ
- PART: ⲇⲉ, ⲅⲁⲣ, ⲟⲩⲛ, ⲙⲉⲛ, ⲱ, ⲁⲙⲏⲛ, ⲟⲩⲟⲓ, ϩⲁⲣⲁ, ⲁⲛ, ⲅⲉ
- PRON: ⲓⲥⲁⲁⲕ
- PROPN: ⲓⲏⲥⲟⲩⲥ, ⲓⲱⲁⲛⲛⲏⲥ, ⲓⲥⲁⲁⲕ, ⲏⲗⲓⲁⲥ, ⲓⲁⲕⲱⲃⲟⲥ, ⲡⲉⲧⲣⲟⲥ, ⲡⲁⲩⲗⲟⲥ, ⲅⲁⲗⲓⲗⲉⲁ, ⲥⲁⲧⲁⲛⲁⲥ, ⲥⲓⲙⲱⲛ
- SCONJ: ϩⲓⲛⲁ, ϩⲱⲥⲧⲉ, ϩⲱⲥ, ⲉⲡⲓⲇⲏ, ⲕⲁⲛ, ⲙⲏⲡⲱⲥ, ϩⲟⲡⲱⲥ, ϩⲟⲧⲉ, ⲉⲡⲉⲓⲇⲏ, ϩⲏⲇⲏ
- VERB-Fin: ⲉⲣⲥⲕⲁⲛⲇⲁⲗⲓⲍⲉⲥⲑⲉ, ⲉⲣⲭⲣⲓⲁ, ⲉⲣⲁⲛⲁⲭⲱⲣⲓⲛ, ⲉⲣϩⲉⲗⲡⲓⲥ, ⲉⲣⲉⲧⲓⲛ, ⲉⲣⲁⲡⲁⲛⲧⲁⲛ, ⲉⲣⲉⲡⲓⲧⲓⲙⲁⲛ, ⲉⲣⲛⲏⲥⲧⲉⲩⲓⲛ, ⲉⲣⲡⲣⲟⲕⲟⲡⲧⲉⲓⲛ, ⲧⲟⲓ
- VERB-Inf: ϩⲓⲇⲁⲓⲙⲱⲛ, ⲉⲣⲁⲡⲁⲛⲧⲁⲛ, ⲉⲣⲇⲓⲁⲕⲣⲓⲛⲓⲛ
- X: ⲉⲡⲫⲁⲑⲁ, ⲕⲉ, ⲕⲟⲩⲙ, ⲙⲟⲛⲟⲛ, ⲣⲁⲃⲃⲓ, ⲧⲁⲩⲧⲁ, ⲟⲩ
- Yes
Syntax
Auxiliary Verbs and Copula
- This corpus uses 1 lemmas as copulas (cop). Examples: ⲡⲉ.
- This corpus uses 18 lemmas as auxiliaries (aux). Examples: ⲁ, ⲛⲁⲣⲉ, ⲛⲧⲉ, ⲛⲁ, ⲉⲧ, ⲙⲡⲉ, ϣⲁⲣⲉ, ⲙⲁⲣⲉ, ϣ, ϣⲁⲧⲉ, ⲙⲡⲁⲣⲉ, ⲛⲛⲉ, ⲙⲡⲉⲛⲑⲣⲉ, ⲙⲙⲟⲛ, ⲁⲣⲉϣⲁⲛ, ⲙⲡⲁⲧⲉ, ⲟⲩⲟⲛ, ⲉⲣⲉ.
Core Arguments, Oblique Arguments and Adjuncts
Here we consider only relations between verbs (parent) and nouns or pronouns (child).
- nsubj
- VERB--NOUN (94)
- VERB--PRON (13)
- VERB-Fin--NOUN (282)
- VERB-Fin--PRON (2611)
- obj
- VERB--NOUN (9)
- VERB--NOUN-ADP(ⲛ) (4)
- VERB--PRON (32)
- VERB--PRON-ADP(ⲛ) (1)
- VERB-Fin--NOUN (155)
- VERB-Fin--NOUN-ADP(ⲛ) (266)
- VERB-Fin--NOUN-ADP(ⲛ)-ADP(ⲛ) (4)
- VERB-Fin--NOUN-ADP(ⲥⲁⲃⲟⲗ) (2)
- VERB-Fin--PRON (385)
- VERB-Fin--PRON-ADP(ⲛ) (149)
- VERB-Fin--PRON-ADP(ⲣⲁⲛ) (1)
- VERB-Inf--NOUN (19)
- VERB-Inf--NOUN-ADP(ⲛ) (13)
- VERB-Inf--PRON (22)
- VERB-Inf--PRON-ADP(ⲛ) (1)
- iobj
- VERB--PRON (18)
- VERB--PRON-ADP(ⲛ) (3)
- VERB-Fin--NOUN (2)
- VERB-Fin--PRON (10)
- VERB-Fin--PRON-ADP(ⲛ) (1)
Relations Overview
- This corpus uses 4 relation subtypes: acl:relcl, nmod:poss, nmod:unmarked, obl:unmarked
- The following 5 relation types are not used in this corpus at all: clf, compound, list, goeswith, dep