UD Coptic Scriptorium
Language: Coptic (code: cop)
Family: Afro-Asiatic
This treebank has been part of Universal Dependencies since the UD v1.4 release.
The following people have contributed to making this treebank part of UD: Mitchell Abrams, Elizabeth Davidson, Amir Zeldes.
Repository: UD_Coptic-Scriptorium
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.18
License: CC BY 4.0
Genre: bible, fiction, nonfiction
Questions, comments? General annotation questions (either Coptic-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [amir • zeldes (æt) georgetown • edu]. Development of the treebank happens outside the UD repository. If there are bugs, either the original data source or the conversion procedure must be fixed. Do not submit pull requests against the UD repository.
| Annotation | Source |
|---|---|
| Lemmas | annotated manually |
| UPOS | annotated manually in non-UD style, automatically converted to UD |
| XPOS | annotated manually |
| Features | assigned by a program, not checked manually |
| Relations | annotated manually, natively in UD style |
Description
UD Coptic contains manually annotated Sahidic Coptic texts, including Biblical texts, sermons, letters, and hagiography.
The Coptic Universal Dependency Treebank is a manually annotated corpus of Sahidic Coptic texts, currently containing excerpts from the Sahidic New Testament Gospel of Mark and 1 Corinthians, the Old Testament Book of Ruth, Works by Archmandrite Shenoute of Atripe, the Letters of Besa, lives of Sts. Cyrus, Onnophrius and John the Kalybites, Epistle of Pseudo-Ephrem, homilies and discourses by Proclus, Pseudo-Athanasius and Pseudo-Flavianus, the Dormition of John the Apostle and short stories from the Apophthegmata Patrum (Sayings of the Desert Fathers). Detailed information about the treebank is available here:
http://copticscriptorium.org/treebank.html
The data was digitized or previously available in digital format, and annotated manually for part of speech in the project Coptic Scriptorium. For individual credit and further information see:
http://copticscriptorium.org/
Coptic POS tags come from the Coptic Scriptorium tag set, which is available from the project and treebank websites.
Acknowledgments
The underlying POS tagged material was produced as part of the projects Coptic Scriptorium, KOMeT and KELLIA, funded by the NEH in the USA and BMBF and DFG in Germany (see http://copticscriptorium.org/ for more details). Treebank annotation was done mainly by Mitchell Abrams, Liz Davidson and Amir Zeldes. Thanks are also due to Israel Avrahamy, Asael Benyami, Yinon Kahan and Oran Szachter for their contributions.
Statistics of UD Coptic Scriptorium
POS Tags
ADJ – ADP – ADV – AUX – CCONJ – DET – NOUN – NUM – PART – PRON – PROPN – PUNCT – SCONJ – VERB – X
Features
Definite – Emph – ExtPos – Foreign – Gender – Gender[psor] – Mood – Number – Number[psor] – NumType – Person – Polarity – Poss – PronType – Reflex – VerbForm
Relations
acl – acl:relcl – advcl – advmod – amod – appos – aux – case – cc – ccomp – compound – conj – cop – csubj – dep – det – discourse – dislocated – expl – fixed – flat – iobj – mark – nmod – nmod:poss – nmod:unmarked – nsubj – nummod – obj – obl – obl:unmarked – orphan – parataxis – punct – reparandum – root – vocative – xcomp
Tokenization and Word Segmentation
- This corpus contains 2252 sentences, 27693 tokens and 58974 syntactic words.
- All tokens in this corpus are followed by a space.
- This corpus does not contain words with spaces.
- This corpus contains 11 types of words that contain both letters and punctuation. Examples: ·ⲻ, .....ⲟ..., [....]ⲥ, [...]ϥ, ϩⲏ[..]ⲉ, ⲉ......ⲙ...., ⲉ[.....], ⲉⲃ[........], ⲟⲩⲇ[.......], ⲡ[…]ⲡⲟⲥ, ⲡⲁ[…]ϥⲟϭⲉ
- This corpus contains 17419 multi-word tokens. On average, one multi-word token consists of 2.80 syntactic words.
- There are 9276 types of multi-word tokens. Examples: ⲛⲁϥ, ⲙⲙⲟⲥ, ⲙⲙⲟϥ, ⲉⲣⲟϥ, ⲡⲉϫⲁϥ, ⲙⲡⲛⲟⲩⲧⲉ, ⲛⲁⲩ, ⲛⲧϩⲉ, ⲛⲁⲓ, ⲙⲙⲟⲟⲩ, ⲧⲏⲣⲟⲩ, ⲉϥϫⲱ, ⲛⲁⲕ, ⲛϩⲏⲧϥ, ⲉⲣⲟⲓ, ⲉⲣⲟⲟⲩ, ⲉⲧⲙⲙⲁⲩ, ⲙⲙⲟⲕ, ⲛⲏⲧⲛ, ⲉⲧⲟⲩⲁⲁⲃ, ⲛⲁⲥ, ⲙⲡϫⲟⲉⲓⲥ, ⲛϩⲏⲧ, ⲉⲣⲟⲕ, ⲛⲧⲉⲩⲛⲟⲩ, ⲛⲙⲙⲁϥ, ⲛⲣⲱⲙⲉ, ⲛⲧⲉⲓϩⲉ, ⲁϥⲉⲓ, ⲛⲁⲛ, ⲡⲛⲟⲩⲧⲉ, ⲁϥⲃⲱⲕ, ⲉⲧⲃⲉⲡⲁⲓ, ⲛⲟⲩⲱⲧ, ⲛϩⲟⲩⲟ, ⲧⲏⲣϥ, ⲉⲣⲟⲛ, ⲙⲙⲟⲓ, ⲙⲡⲣⲱⲙⲉ, ⲛϩⲟⲟⲩ, ⲛⲥⲱϥ, ⲡⲉⲭⲣⲓⲥⲧⲟⲥ, ⲡⲟⲩⲁ, ⲛϩⲏⲧⲟⲩ, ⲁⲩⲉⲓ, ⲉⲣⲱⲧⲛ, ⲛⲧⲉⲡⲛⲟⲩⲧⲉ, ⲡϫⲟⲉⲓⲥ, ⲁϥϫⲟⲟⲥ, ⲉⲡⲉⲥⲏⲧ.
Morphology
Tags
- This corpus uses 15 UPOS tags out of 17 possible: ADJ, ADP, ADV, AUX, CCONJ, DET, NOUN, NUM, PART, PRON, PROPN, PUNCT, SCONJ, VERB, X
- This corpus does not use the following tags: INTJ, SYM
- This corpus contains 42 word types tagged as particles (PART): ϩⲁⲙⲏⲛ, ϩⲏⲏⲛⲉ, ϩⲏⲏⲧⲉ, ϩⲛ, ϭⲉ, ⲁϩⲉ, ⲁϫⲛ, ⲁⲛⲧⲓ, ⲁⲣⲁ, ⲅⲁⲣ, ⲇⲉ, ⲉ, ⲉϩⲉ, ⲉⲓⲉ, ⲉⲓⲥ, ⲉⲛⲉ, ⲉⲛⲧ, ⲉⲣⲉ, ⲉⲧⲃⲉ, ⲙ, ⲙⲉⲛ, ⲙⲙⲟ, ⲙⲙⲟⲛ, ⲙⲛ, ⲙⲛⲛⲥⲁ, ⲛ, ⲛϭⲉ, ⲛϭⲓ, ⲛⲁ, ⲛⲉ, ⲛⲧ, ⲛⲧⲉ, ⲟⲩⲇⲉ, ⲟⲩⲛ, ⲟⲩⲟⲉⲓ, ⲟⲩⲟⲓ, ⲡⲉ, ⲣⲱ, ⲥⲉ, ⲭⲁⲓⲣⲉ, ⲭⲱⲣⲓⲥ, ⲱ
- This corpus contains 64 lemmas tagged as pronouns (PRON): ϩⲁ_ⲛⲧⲟ, ϩⲁϩⲧⲛ, ϩⲓϫⲛ_ⲛⲧⲟ, ϩⲓⲧⲛ_ⲁⲛⲟⲕ, ϩⲛ_ⲁⲛⲟⲕ, ϩⲱ_ⲁⲛⲟⲕ, ϩⲱⲱ_ⲁⲛⲟⲕ, ⲁ, ⲁ_ⲛⲧⲟ, ⲁϣ, ⲁⲛⲟⲕ, ⲁⲛⲟⲕ_ⲛⲧⲉ, ⲁⲛⲟⲛ, ⲁⲟⲩⲏⲣ, ⲅ, ⲉ_ⲛⲧⲟ, ⲉϫⲛ_ⲛⲧⲟ, ⲉⲓ, ⲉⲕⲉ, ⲉⲣϣⲁⲛ_ⲁⲛⲟⲕ, ⲉⲣϣⲁⲛ_ⲁⲛⲟⲛ, ⲉⲣϣⲁⲛ_ⲛⲧⲟ, ⲉⲣϣⲁⲛ_ⲛⲧⲟϥ, ⲉⲣϣⲁⲛ_ⲛⲧⲟⲕ, ⲉⲣϣⲁⲛ_ⲛⲧⲟⲟⲩ, ⲉⲣϣⲁⲛ_ⲛⲧⲟⲥ, ⲉⲣϣⲁⲛ_ⲛⲧⲱⲧⲛ, ⲉⲣⲉ_ⲁⲛⲟⲕ, ⲉⲣⲉ_ⲁⲛⲟⲛ, ⲉⲣⲉ_ⲛⲧⲟ, ⲉⲣⲉ_ⲛⲧⲟϥ, ⲉⲣⲉ_ⲛⲧⲟⲕ, ⲉⲣⲉ_ⲛⲧⲟⲟⲩ, ⲉⲣⲉ_ⲛⲧⲟⲥ, ⲉⲣⲉ_ⲛⲧⲱⲧⲛ, ⲉⲥ, ⲉⲧⲃⲉ_ⲁⲛⲟⲕ, ⲉⲧⲉⲣⲉ_ⲛⲧⲟ, ⲉⲧⲉⲧⲛϣⲁⲛ, ⲉⲧⲉⲧⲛⲉ, ⲉⲧⲛ_ⲁⲛⲟⲕ, ⲕ, ⲙⲉⲩ, ⲙⲙⲓⲛⲙⲙⲟ_ⲛⲧⲟ, ⲙⲡⲉ_ⲛⲧⲟ, ⲛ_ⲛⲧⲟ, ⲛⲉⲣⲉ_ⲛⲧⲟ, ⲛⲓⲙ, ⲛⲥⲁ_ⲛⲧⲟ, ⲛⲧⲉ_ⲁⲛⲟⲕ, ⲛⲧⲉⲧⲛ, ⲛⲧⲛ_ⲁⲛⲟⲕ, ⲛⲧⲟ, ⲛⲧⲟϥ, ⲛⲧⲟⲕ, ⲛⲧⲟⲟⲩ, ⲛⲧⲟⲥ, ⲛⲧⲱⲧⲛ, ⲟⲩ, ⲟⲩⲏⲣ, ⲡⲉ, ⲡⲱⲥ, ⲣⲁⲧ_ⲁⲛⲟⲕ, ⲣⲟ_ⲛⲧⲟ
- This corpus contains 29 lemmas tagged as determiners (DET): ϩⲛ, ϭⲉ, ϯ, ⲕⲉ, ⲛ, ⲛⲁ, ⲛⲁⲓ, ⲛⲉⲓ, ⲛⲟⲩⲓ, ⲟⲩ, ⲡ, ⲡⲁ, ⲡⲁⲓ, ⲡⲉϥ, ⲡⲉⲓ, ⲡⲉⲕ, ⲡⲉⲛ, ⲡⲉⲥ, ⲡⲉⲧⲛ, ⲡⲉⲩ, ⲡⲏ, ⲡⲓ, ⲡⲟⲩ, ⲡⲟⲩⲕ, ⲡⲱϥ, ⲡⲱⲓ, ⲡⲱⲕ, ⲡⲱⲧⲛ, ⲧ
- Out of the above, 1 lemmas occurred sometimes as PRON and sometimes as DET: ⲟⲩ
- This corpus contains 21 lemmas tagged as auxiliaries (AUX): ϣ, ϣⲁⲛⲧⲉ, ϣⲁⲣⲉ, ϫⲡⲓ, ⲁ, ⲉⲣϣⲁⲛ, ⲉⲣⲉ, ⲙⲁⲣⲉ, ⲙⲉⲣⲉ, ⲙⲛ, ⲙⲡⲁⲧⲉ, ⲙⲡⲉ, ⲙⲡⲣⲧⲣⲉ, ⲛⲁ, ⲛⲉϣ, ⲛⲉⲣⲉ, ⲛⲛⲉ, ⲛⲧⲉ, ⲛⲧⲉⲣⲉ, ⲟⲩⲛ, ⲧⲁⲣⲉ
- Out of the above, 4 lemmas occurred sometimes as AUX and sometimes as VERB: ϫⲡⲓ, ⲙⲛ, ⲛⲁ, ⲟⲩⲛ
- There are 2 (de)verbal forms:
- Fin
- AUX: ϣ, ⲉϣ, ϫⲡⲓ, ⲛⲉϣ
- VERB: ⲡⲉϫⲁ, ϫⲱ, ⲉⲓ, ϣⲱⲡⲉ, ⲃⲱⲕ, ϫⲟⲟ, ⲛⲁⲩ, ϯ, ⲣ, ⲥⲱⲧⲙ
- Inf
- VERB: ϯ, ⲃⲱⲕ, ⲛⲁⲩ, ⲣ, ϭⲱ, ⲁⲁ, ϫⲓ, ⲕⲁ, ⲧⲁⲙⲟ, ϣⲁϫⲉ
Nominal Features
- Fem
- DET: ⲧ, ⲧⲉ, ⲧⲉϥ, ⲧⲉⲓ, ⲧⲁ, ⲧⲉⲕ, ⲧⲁⲓ, ⲧⲉⲩ, ⲧⲉⲥ, ⲧⲟⲩ
- PRON: ⲥ, ⲧⲉ, ⲉ, ⲛⲧⲟⲥ, ⲉⲣⲟ, ⲁ, ⲛⲧⲟ, ⲙⲙⲟ, ⲣ, ⲁⲣ
- Masc
- DET: ⲡ, ⲡⲉ, ⲡⲁ, ⲡⲉϥ, ⲡⲁⲓ, ⲡⲉⲓ, ⲡⲉⲕ, ⲡⲉⲛ, ⲡⲉⲩ, ⲡⲉⲥ
- PRON: ϥ, ⲕ, ⲡⲉ, ⲅ, ⲛⲧⲟϥ, ⲡ, ⲛⲧⲟⲕ, ⲉⲕϣⲁⲛ, ⲉϥϣⲁⲛ, ⲉϥⲉ
- Plur
- DET: ⲛ, ⲛⲉ, ⲛⲉϥ, ⲛⲁⲓ, ⲛⲁ, ⲙ, ⲛⲉⲛ, ⲛⲉⲩ, ⲛⲉⲕ, ⲛⲉⲧⲛ
- PRON: ⲩ, ⲟⲩ, ⲛ, ⲧⲛ, ⲧⲉⲧⲛ, ⲥⲉ, ⲛⲉ, ⲧⲏⲩⲧⲛ, ⲉⲩⲉ, ⲛⲧⲱⲧⲛ
- Sing
- DET: ⲡ, ⲧ, ⲟⲩ, ⲡⲉ, ϩⲉⲛ, ⲡⲁ, ⲡⲉϥ, ⲧⲉ, ⲡⲁⲓ, ⲡⲉⲓ
- PRON: ϥ, ⲥ, ⲓ, ⲕ, ⲡⲉ, ϯ, ⲧⲉ, ⲁⲛⲟⲕ, ⲅ, ⲛⲧⲟϥ
- Def
- DET: ⲡ, ⲧ, ⲛ, ⲡⲉ, ⲡⲁ, ⲡⲉϥ, ⲧⲉ, ⲡⲁⲓ, ⲛⲉ, ⲛⲉϥ
- NOUN: ⲙⲙⲓⲛⲙⲙⲟ, ⲙⲙⲓⲛⲙⲙⲱ
- PRON: ϥ, ⲩ, ⲥ, ⲓ, ⲟⲩ, ⲕ, ⲛ, ⲧⲛ, ⲧⲉⲧⲛ, ⲥⲉ
- Ind
- DET: ⲟⲩ, ϩⲉⲛ, ⲩ
Degree and Polarity
- Neg
- ADV: ⲁⲛ, ⲛ, ⲙⲡⲣ, ⲧⲙ, ⲙ, ⲟⲩ, ⲟⲩⲕ, ⲙⲉ, ⲟⲩⲇⲉ
- AUX: ⲙⲡ, ⲙⲡⲉ, ⲛⲛⲉ, ⲙⲉ, ⲙⲡⲣⲧⲣⲉ, ⲙⲛ, ⲙⲡⲁⲧ, ⲛⲛ, ⲙⲁⲣⲉ, ⲙⲡⲁⲧⲉ
- CCONJ: ⲟⲩⲇⲉ
- PART: ⲙⲙⲟⲛ, ⲟⲩⲇⲉ
- PRON: ⲙⲡⲉ
- SCONJ: ⲟⲩⲇⲉ
- VERB: ⲙⲛ, ⲙⲛⲧ, ⲙⲛⲧⲁ, ⲙⲙⲛ, ⲙⲙⲛⲧ, ⲙⲛⲧⲏ, ⲙⲙⲛⲧⲁ, ⲙⲙⲛⲧⲉ, ⲙⲙⲛⲧⲏ
Verbal Features
- Cnd
- VERB-Fin: ⲉⲓ, ⲣ, ⲥⲱⲧⲙ, ⲛⲁⲩ, ⲟⲩⲱϣ, ϫⲟⲟ, ⲣⲛⲟⲃⲉ, ϣⲱⲡⲉ, ⲉⲓⲃⲉ, ⲡⲱⲣϫ
- Imp
- VERB-Fin: ⲁⲣⲓ, ⲁϫⲓ, ⲁⲙⲟⲩ, ⲃⲱⲕ, ⲙⲁ, ⲕⲁ, ϩⲁⲣⲉϩ, ⲁⲛⲁⲩ, ⲥⲱⲧⲙ, ϣⲱⲡⲉ
- Ind
- VERB: ⲧⲣⲉ, ⲡⲉϫⲁ, ϫⲱ, ⲉⲓ, ⲃⲱⲕ, ϣⲱⲡⲉ, ϫⲟⲟ, ⲛⲁⲩ, ϯ, ⲣ
- VERB-Fin: ⲡⲉϫⲁ, ϫⲱ, ⲉⲓ, ⲃⲱⲕ, ϣⲱⲡⲉ, ϫⲟⲟ, ⲛⲁⲩ, ϯ, ⲣ, ϣⲟⲟⲡ
- VERB-Inf: ϯ, ⲃⲱⲕ, ⲛⲁⲩ, ⲣ, ϭⲱ, ⲁⲁ, ϫⲓ, ⲕⲁ, ⲧⲁⲙⲟ, ϣⲁϫⲉ
- Jus
- VERB-Fin: ϣⲱⲡⲉ, ϫⲓ, ϭⲱ, ⲥⲱⲧⲙ, ϣⲗⲏⲗ, ϭⲱϣⲧ, ϯ, ⲉⲓⲙⲉ, ⲣ, ϣⲟⲩϣⲟⲩ
- Nec
- AUX-Fin: ϫⲡⲓ
- Opt
- VERB-Fin: ϣⲱⲡⲉ, ϫⲓ, ϣⲙϣⲉ, ϫⲱϩ, ϯ, ⲉⲓ, ⲕⲁⲁ, ⲛⲁⲩ, ϩⲁⲣⲉϩ, ϩⲉ
- Pot
- AUX-Fin: ϣ, ⲉϣ, ⲛⲉϣ
- Sub
- VERB-Fin: ⲥⲣϥⲉ, ϣⲱⲡⲉ, ⲉⲓⲙⲉ, ⲙⲧⲟⲛ, ϣⲟⲩϣⲟⲩ, ϭⲱ, ϯ, ϯϩⲏⲩ, ⲕⲁⲁ, ⲣⲭⲣⲏⲥⲧⲟⲥ
Pronouns, Determiners, Quantifiers
- Art
- DET: ⲡ, ⲧ, ⲟⲩ, ⲛ, ⲡⲉ, ϩⲉⲛ, ⲕⲉ, ⲧⲉ, ⲛⲉ, ⲩ
- Dem
- DET: ⲡⲁⲓ, ⲛⲁⲓ, ⲡⲉⲓ, ⲧⲉⲓ, ⲧⲁⲓ, ⲛⲉⲓ, ϯ, ⲡⲓ, ⲛⲓ, ⲡⲏ
- PRON: ⲡⲉ, ⲧⲉ, ⲡ, ⲛⲉ, ⲛ, ⲧ
- Ind
- PRON: ⲛⲓⲙ, ⲟⲩ
- Int
- ADV: ⲧⲱⲛ
- PRON: ⲟⲩ, ⲁϣ, ⲛⲓⲙ, ⲟⲩⲏⲣ, ⲁⲟⲩⲏⲣ, ⲡⲱⲥ
- Prs
- DET: ⲡⲁ, ⲡⲉϥ, ⲛⲉϥ, ⲧⲉϥ, ⲡⲉⲕ, ⲛⲁ, ⲡⲉⲛ, ⲧⲁ, ⲛⲉⲛ, ⲛⲉⲩ
- NOUN: ⲙⲙⲓⲛⲙⲙⲟ, ⲙⲙⲓⲛⲙⲙⲱ
- PRON: ϥ, ⲩ, ⲥ, ⲓ, ⲟⲩ, ⲕ, ⲛ, ⲧⲛ, ⲧⲉⲧⲛ, ⲥⲉ
- Rcp
- NOUN: ⲉⲣⲏⲩ
- Tot
- NOUN: ⲧⲏⲣ
- Card
- NUM: ⲟⲩⲁ, ⲥⲛⲁⲩ, ϣⲉ, ϣⲟⲙⲛⲧ, ⲙⲛⲧⲥⲛⲟⲟⲩⲥ, ⲙⲏⲧ, ⲥⲁϣϥ, ⲧⲃⲁ, ⲟⲩⲉⲓ, ⲥⲛⲧⲉ
- Yes
- DET: ⲡⲁ, ⲡⲉϥ, ⲛⲉϥ, ⲧⲉϥ, ⲡⲉⲕ, ⲛⲁ, ⲡⲉⲛ, ⲧⲁ, ⲛⲉⲛ, ⲛⲉⲩ
- PRON: ϥ, ⲟⲩ, ⲕ, ⲥ, ⲛ, ⲧⲛ, ⲧ, ⲩ, ⲧⲏⲩⲧⲛ, ⲓ
- Yes
- NOUN: ⲙⲙⲓⲛⲙⲙⲟ, ⲙⲙⲓⲛⲙⲙⲱ
- PRON: ⲙⲙⲓⲛⲙⲙⲟ
- 1
- DET: ⲡⲁ, ⲡⲉⲛ, ⲛⲁ, ⲧⲁ, ⲛⲉⲛ, ⲧⲉⲛ, ⲡⲱⲓ, ⲛⲟⲩⲓ
- PRON: ⲓ, ⲛ, ϯ, ⲁⲛⲟⲕ, ⲧⲛ, ⲧ, ⲧⲁ, ⲁⲛⲅ, ⲁⲛⲟⲛ, ⲁ
- 2
- DET: ⲡⲉⲕ, ⲧⲉⲕ, ⲛⲉⲕ, ⲛⲉⲧⲛ, ⲡⲟⲩ, ⲧⲟⲩ, ⲡⲉⲧⲛ, ⲛⲟⲩ, ⲧⲉⲧⲛ, ⲡⲱⲧⲛ
- PRON: ⲕ, ⲧⲉⲧⲛ, ⲧⲛ, ⲅ, ⲧⲏⲩⲧⲛ, ⲉ, ⲛⲧⲟⲕ, ⲛⲧⲱⲧⲛ, ⲧⲉ, ⲉⲕϣⲁⲛ
- 3
- DET: ⲡⲉϥ, ⲛⲉϥ, ⲧⲉϥ, ⲛⲉⲩ, ⲡⲉⲩ, ⲧⲉⲩ, ⲡⲉⲥ, ⲧⲉⲥ, ⲛⲉⲥ, ⲡⲱϥ
- PRON: ϥ, ⲩ, ⲥ, ⲟⲩ, ⲥⲉ, ⲛⲧⲟϥ, ⲉⲩⲉ, ⲛⲧⲟⲟⲩ, ⲛⲧⲟⲥ, ⲉϥϣⲁⲛ
- Fem
- DET: ⲡⲉⲥ, ⲧⲉⲥ, ⲡⲟⲩ, ⲧⲟⲩ, ⲛⲟⲩ, ⲛⲉⲥ
- Masc
- DET: ⲡⲉϥ, ⲛⲉϥ, ⲧⲉϥ, ⲡⲉⲕ, ⲧⲉⲕ, ⲛⲉⲕ, ⲡⲱⲕ, ⲡⲱϥ, ⲛⲟⲩⲕ, ⲧⲱϥ
- Plur
- DET: ⲡⲉⲛ, ⲛⲉⲛ, ⲛⲉⲩ, ⲡⲉⲩ, ⲛⲉⲧⲛ, ⲧⲉⲩ, ⲡⲉⲧⲛ, ⲧⲉⲛ, ⲧⲉⲧⲛ, ⲡⲱⲧⲛ
- Sing
- DET: ⲡⲉϥ, ⲡⲁ, ⲛⲉϥ, ⲧⲉϥ, ⲡⲉⲕ, ⲛⲁ, ⲧⲁ, ⲧⲉⲕ, ⲛⲉⲕ, ⲡⲉⲥ
Other Features
- Emph
- Yes
- PART: ⲉ, ⲛⲧ, ⲉⲣⲉ, ⲉⲛⲧ
- PRON: ⲉⲣ, ⲉⲣⲉ
- Yes
- ExtPos
- ADP
- ADP: ⲉⲧⲃⲉ, ϣⲁ
- ADV: ⲉⲃⲟⲗ, ⲉϩⲣⲁⲓ, ⲉϩⲟⲩⲛ, ⲉϩⲟⲩ, ⲛϩⲟⲩⲛ, ϣⲁϩⲣⲁⲓ, ϫⲓⲛ
- ADV
- ADP: ϣⲁ
- ADV: ⲟⲩ
- CCONJ: ⲕⲁⲓ
- PART: ⲉⲓⲥ
- SCONJ
- ADP: ⲉⲧⲃⲉ, ⲉ, ⲛ
- ADV: ⲉⲃⲟⲗ
- ADP
- Foreign
- Yes
- ADJ: ⲕⲁⲑⲟⲗⲓⲕⲏ
- ADP: ⲕⲁⲧⲁ, ⲭⲱⲣⲓⲥ, ⲡⲁⲣⲁ, ⲡⲣⲟⲥ, ⲕⲁⲧⲁⲣⲟ, ⲙ, ⲡⲁⲣⲁⲣⲟ
- ADV: ⲕⲁⲗⲱⲥ, ⲧⲟⲧⲉ, ⲗⲟⲓⲡⲟⲛ, ⲙⲁⲗⲓⲥⲧⲁ, ⲉⲧⲓ, ⲏⲇⲏ, ⲡⲁⲗⲓⲛ, ϩⲟⲗⲱⲥ, ⲉⲓⲧⲁ, ⲟⲩⲕ
- AUX: ⲟⲩⲛ
- CCONJ: ⲁⲗⲗⲁ, ⲏ, ⲟⲩⲇⲉ, ⲉⲓⲧⲉ, ⲙⲏ, ⲡⲗⲏⲛ, ⲕⲁⲓ, ⲟⲩⲧⲉ, ⲕⲁⲛ, ϩⲟⲧⲁⲛ
- DET: ⲛⲁⲓ
- NOUN: ⲭⲣⲓⲥⲧⲟⲥ, ⲡⲛⲉⲩⲙⲁ, ⲁⲡⲁ, ⲕⲟⲥⲙⲟⲥ, ⲥⲱⲙⲁ, ⲙⲁⲑⲏⲧⲏⲥ, ⲡⲟⲗⲓⲥ, ⲗⲁⲟⲥ, ⲥⲁⲣⲝ, ⲙⲟⲛⲁⲭⲟⲥ
- NUM: ⲟⲩⲁ, ⲥⲉ
- PART: ⲇⲉ, ⲅⲁⲣ, ⲱ, ⲙⲉⲛ, ϩⲁⲙⲏⲛ, ⲁⲣⲁ, ⲟⲩⲛ, ⲭⲁⲓⲣⲉ, ⲭⲱⲣⲓⲥ, ⲁⲛⲧⲓ
- PRON: ⲟⲩ
- PROPN: ⲓⲏⲥⲟⲩⲥ, ⲓⲱϩⲁⲛⲛⲏⲥ, ⲃⲟⲉⲥ, ϩⲣⲟⲩⲑ, ⲛⲟⲉⲙⲓⲛ, ⲥⲁⲧⲁⲛⲁⲥ, ⲓⲁⲕⲱⲃ, ⲓⲱⲛⲁⲥ, ⲇⲏⲙⲏⲧⲣⲓⲟⲥ, ⲇⲓⲁⲃⲟⲗⲟⲥ
- SCONJ: ϩⲱⲥⲧⲉ, ϩⲱⲥ, ⲉⲓⲙⲏⲧⲓ, ⲉⲡⲉⲓⲇⲏ, ⲙⲏⲡⲟⲧⲉ, ϩⲟⲧⲁⲛ, ⲕⲁⲛ, ⲙⲏⲧⲓ, ⲉⲡⲉⲓ, ⲙⲏ
- VERB: ⲥⲧⲁⲩⲣⲟⲩ, ⲡⲓⲥⲧⲉⲩⲉ, ⲕⲣⲓⲛⲉ, ⲡⲁⲣⲁⲕⲁⲗⲉⲓ, ⲕⲏⲣⲩⲥⲥⲉ, ⲥⲕⲁⲛⲇⲁⲗⲓⲍⲉ, ⲑⲩⲥⲓⲁⲍⲉ, ⲛⲏⲥⲧⲉⲩⲉ, ⲁⲛⲁⲭⲱⲣⲉⲓ, ⲁⲥⲡⲁⲍⲉ
- VERB-Fin: ⲥⲧⲁⲩⲣⲟⲩ, ⲡⲓⲥⲧⲉⲩⲉ, ⲡⲁⲣⲁⲕⲁⲗⲉⲓ, ⲕⲣⲓⲛⲉ, ⲕⲏⲣⲩⲥⲥⲉ, ⲥⲕⲁⲛⲇⲁⲗⲓⲍⲉ, ⲑⲩⲥⲓⲁⲍⲉ, ⲛⲏⲥⲧⲉⲩⲉ, ⲁⲛⲁⲭⲱⲣⲉⲓ, ⲕⲗⲏⲣⲟⲛⲟⲙⲉⲓ
- VERB-Inf: ⲧⲁⲗⲉⲇⲱⲣⲟⲛ, ⲡⲁⲣⲁⲅⲉ, ϯⲛⲧⲟⲗⲏ, ⲁⲛⲉⲭⲉ, ⲁⲡⲟⲧⲁⲥⲥⲉ, ⲁⲥⲡⲁⲍⲉ, ⲃⲁⲡⲧⲓⲍⲉ, ⲇⲓⲁⲕⲣⲓⲛⲉ, ⲉⲡⲓⲃⲟⲩⲗⲉⲩⲉ, ⲉⲩⲁⲅⲅⲉⲗⲓⲍⲉ
- X: ⲙⲟⲛⲟⲛ, ⲁⲣⲭⲏⲉⲡⲓⲥⲕⲟⲡⲟⲩ, ⲁⲩⲧⲟⲩ, ⲉⲡⲓⲥⲧⲟⲗⲏ, ⲉⲡⲫⲁⲑⲁ, ⲕⲟⲩⲙ, ⲟⲩⲇ[.......], ⲧⲟⲩ, ⲭⲉⲓⲙⲏⲧⲓ
- Yes
Syntax
Auxiliary Verbs and Copula
- This corpus uses 1 lemmas as copulas (cop). Examples: ⲡⲉ.
- This corpus uses 21 lemmas as auxiliaries (aux). Examples: ⲁ, ⲛⲁ, ⲛⲧⲉ, ⲛⲧⲉⲣⲉ, ⲛⲉⲣⲉ, ⲙⲡⲉ, ϣⲁⲣⲉ, ⲙⲁⲣⲉ, ϣ, ⲛⲛⲉ, ⲉⲣϣⲁⲛ, ⲙⲉⲣⲉ, ϣⲁⲛⲧⲉ, ⲙⲛ, ⲙⲡⲣⲧⲣⲉ, ⲙⲡⲁⲧⲉ, ⲧⲁⲣⲉ, ⲟⲩⲛ, ⲉⲣⲉ, ϫⲡⲓ, ⲛⲉϣ.
Core Arguments, Oblique Arguments and Adjuncts
Here we consider only relations between verbs (parent) and nouns or pronouns (child).
- nsubj
- VERB--NOUN (114)
- VERB--NOUN-ADP(ⲛ) (2)
- VERB--NOUN-ADP(ⲛ)-ADP(ⲛ) (1)
- VERB--PRON (32)
- VERB-Fin--NOUN (578)
- VERB-Fin--NOUN-ADP(ϩⲓⲣⲛ) (1)
- VERB-Fin--NOUN-ADP(ⲛ) (7)
- VERB-Fin--PRON (4479)
- obj
- VERB--NOUN (43)
- VERB--NOUN-ADP(ⲛ) (13)
- VERB--PRON (179)
- VERB-Fin--NOUN (410)
- VERB-Fin--NOUN-ADP(ⲙ) (3)
- VERB-Fin--NOUN-ADP(ⲛ) (498)
- VERB-Fin--NOUN-ADP(ⲛ)-ADP(ⲛ) (12)
- VERB-Fin--PRON (786)
- VERB-Fin--PRON-ADP(ⲉ) (1)
- VERB-Fin--PRON-ADP(ⲛ) (245)
- VERB-Fin--PRON-ADP(ⲧⲟⲟⲧ) (1)
- VERB-Inf--NOUN (25)
- VERB-Inf--NOUN-ADP(ⲛ) (19)
- VERB-Inf--NOUN-ADP(ⲛ)-ADP(ⲛ) (1)
- VERB-Inf--PRON (54)
- VERB-Inf--PRON-ADP(ⲛ) (4)
- iobj
- VERB--NOUN (5)
- VERB--PRON (73)
- VERB--PRON-ADP(ⲛ) (2)
- VERB-Fin--NOUN (1)
- VERB-Fin--PRON (2)
Relations Overview
- This corpus uses 4 relation subtypes: acl:relcl, nmod:poss, nmod:unmarked, obl:unmarked
- The following 3 relation types are not used in this corpus at all: clf, list, goeswith