UD Coptic Scriptorium
Language: Coptic (code: cop
)
Family: Afro-Asiatic, Egyptian
This treebank has been part of Universal Dependencies since the UD v1.4 release.
The following people have contributed to making this treebank part of UD: Mitchell Abrams, Elizabeth Davidson, Amir Zeldes.
Repository: UD_Coptic-Scriptorium
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.12
License: CC BY 4.0
Genre: bible, fiction, nonfiction
Questions, comments? General annotation questions (either Coptic-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [amir • zeldes (æt) georgetown • edu]. Development of the treebank happens outside the UD repository. If there are bugs, either the original data source or the conversion procedure must be fixed. Do not submit pull requests against the UD repository.
Annotation | Source |
---|---|
Lemmas | annotated manually |
UPOS | annotated manually in non-UD style, automatically converted to UD |
XPOS | annotated manually |
Features | assigned by a program, not checked manually |
Relations | annotated manually, natively in UD style |
Description
UD Coptic contains manually annotated Sahidic Coptic texts, including Biblical texts, sermons, letters, and hagiography.
The Coptic Universal Dependency Treebank is a manually annotated corpus of Sahidic Coptic texts, currently containing excerpts from the Sahidic New Testament Gospel of Mark, Works by Archmandrite Shenoute of Atripe, the Letters of Besa, lives of Sts. Cyrus and Onnophrius, Epistle of Pseudo-Ephrem, the Dormition of John the Apostle and short stories from the Apophthegmata Patrum (Sayings of the Desert Fathers). Detailed information about the treebank is available here:
http://copticscriptorium.org/treebank.html
The data was digitized or previously available in digital format, and annotated manually for part of speech in the project Coptic Scriptorium. For individual credit and further information see:
http://copticscriptorium.org/
Coptic POS tags come from the Coptic Scriptorium tag set, which is available from the project and treebank websites.
Acknowledgments
The underlying POS tagged material was produced as part of the projects Coptic Scriptorium, KOMeT and KELLIA, funded by the NEH in the USA and BMBF and DFG in Germany (see http://copticscriptorium.org/ for more details). Treebank annotation was done mainly by Mitchell Abrams, Liz Davidson and Amir Zeldes. Thanks are also due to Israel Avrahamy, Asael Benyami, Yinon Kahan and Oran Szachter for their contributions.
Statistics of UD Coptic Scriptorium
POS Tags
ADJ – ADP – ADV – AUX – CCONJ – DET – NOUN – NUM – PART – PRON – PROPN – PUNCT – SCONJ – VERB – X
Features
Definite – Foreign – Gender – Gender[psor] – Number – Number[psor] – NumType – Person – Polarity – Poss – PronType – Reflex – VerbForm
Relations
acl – acl:relcl – advcl – advmod – amod – appos – aux – case – cc – ccomp – compound – conj – cop – csubj – dep – det – discourse – dislocated – fixed – flat – iobj – mark – nmod – nsubj – nummod – obj – obl – obl:npmod – orphan – parataxis – punct – reparandum – root – vocative – xcomp
Tokenization and Word Segmentation
- This corpus contains 2163 sentences, 26330 tokens and 55858 syntactic words.
- All tokens in this corpus are followed by a space.
- This corpus does not contain words with spaces.
- This corpus contains 11 types of words that contain both letters and punctuation. Examples: ·ⲻ, .....ⲟ..., [....]ⲥ, [...]ϥ, ϩⲏ[..]ⲉ, ⲉ......ⲙ...., ⲉ[.....], ⲉⲃ[........], ⲟⲩⲇ[.......], ⲡ[…]ⲡⲟⲥ, ⲡⲁ[…]ϥⲟϭⲉ
- This corpus contains 16475 multi-word tokens. On average, one multi-word token consists of 2.79 syntactic words.
- There are 8837 types of multi-word tokens. Examples: ⲛⲁϥ, ⲙⲙⲟⲥ, ⲉⲣⲟϥ, ⲙⲙⲟϥ, ⲡⲉϫⲁϥ, ⲙⲡⲛⲟⲩⲧⲉ, ⲛⲁⲩ, ⲛⲧϩⲉ, ⲛⲁⲓ, ⲙⲙⲟⲟⲩ, ⲧⲏⲣⲟⲩ, ⲉϥϫⲱ, ⲛⲁⲕ, ⲛϩⲏⲧϥ, ⲉⲣⲟⲓ, ⲉⲣⲟⲟⲩ, ⲉⲧⲙⲙⲁⲩ, ⲙⲙⲟⲕ, ⲛⲏⲧⲛ, ⲉⲧⲟⲩⲁⲁⲃ, ⲛⲁⲥ, ⲉⲣⲟⲕ, ⲛⲧⲉⲩⲛⲟⲩ, ⲛϩⲏⲧ, ⲛⲙⲙⲁϥ, ⲁϥⲉⲓ, ⲛⲁⲛ, ⲛⲣⲱⲙⲉ, ⲛⲧⲉⲓϩⲉ, ⲁϥⲃⲱⲕ, ⲛⲟⲩⲱⲧ, ⲡⲛⲟⲩⲧⲉ, ⲙⲡϫⲟⲉⲓⲥ, ⲧⲏⲣϥ, ⲉⲧⲃⲉⲡⲁⲓ, ⲙⲙⲟⲓ, ⲡⲉⲭⲣⲓⲥⲧⲟⲥ, ⲛⲥⲱϥ, ⲉⲣⲟⲛ, ⲙⲡⲣⲱⲙⲉ, ⲛϩⲏⲧⲟⲩ, ⲛϩⲟⲟⲩ, ⲁϥϫⲟⲟⲥ, ⲁⲩⲉⲓ, ⲛϩⲟⲩⲟ, ⲛⲧⲉⲡⲛⲟⲩⲧⲉ, ⲡⲉϫⲁⲥ, ⲉⲣⲱⲧⲛ, ⲡⲁϣⲏⲣⲉ, ⲉⲡⲉⲥⲏⲧ.
Morphology
Tags
- This corpus uses 15 UPOS tags out of 17 possible: ADJ, ADP, ADV, AUX, CCONJ, DET, NOUN, NUM, PART, PRON, PROPN, PUNCT, SCONJ, VERB, X
- This corpus does not use the following tags: INTJ, SYM
- This corpus contains 42 word types tagged as particles (PART): ϩⲁⲙⲏⲛ, ϩⲏⲏⲛⲉ, ϩⲏⲏⲧⲉ, ϩⲛ, ϭⲉ, ⲁϩⲉ, ⲁϫⲛ, ⲁⲛⲧⲓ, ⲁⲣⲁ, ⲅⲁⲣ, ⲇⲉ, ⲉ, ⲉϩⲉ, ⲉⲓⲉ, ⲉⲓⲥ, ⲉⲛⲉ, ⲉⲛⲧ, ⲉⲣⲉ, ⲉⲧⲃⲉ, ⲙ, ⲙⲉⲛ, ⲙⲙⲟ, ⲙⲙⲟⲛ, ⲙⲛ, ⲙⲛⲛⲥⲁ, ⲛ, ⲛϭⲉ, ⲛϭⲓ, ⲛⲁ, ⲛⲉ, ⲛⲧ, ⲛⲧⲉ, ⲟⲩⲇⲉ, ⲟⲩⲛ, ⲟⲩⲟⲉⲓ, ⲟⲩⲟⲓ, ⲡⲉ, ⲣⲱ, ⲥⲉ, ⲭⲁⲓⲣⲉ, ⲭⲱⲣⲓⲥ, ⲱ
- This corpus contains 72 lemmas tagged as pronouns (PRON): ϩⲁ_ⲛⲧⲟ, ϩⲁϩⲧⲛ, ϩⲓϫⲛ_ⲛⲧⲟ, ϩⲓⲧⲛ_ⲁⲛⲟⲕ, ϩⲛ_ⲁⲛⲟⲕ, ϩⲱ_ⲁⲛⲟⲕ, ϩⲱⲱ_ⲁⲛⲟⲕ, ϫⲓ_ⲁⲛⲟⲕ, ϫⲡⲟ_ⲛⲧⲟ, ⲁ, ⲁ_ⲛⲧⲟ, ⲁϣ, ⲁⲛⲟⲕ, ⲁⲛⲟⲕ_ⲛⲧⲉ, ⲁⲛⲟⲛ, ⲁⲟⲩⲏⲣ, ⲅ, ⲉ_ⲛⲧⲟ, ⲉϫⲛ_ⲛⲧⲟ, ⲉⲓ, ⲉⲕⲉ, ⲉⲛⲉ, ⲉⲣϣⲁⲛ_ⲁⲛⲟⲕ, ⲉⲣϣⲁⲛ_ⲁⲛⲟⲛ, ⲉⲣϣⲁⲛ_ⲛⲧⲟ, ⲉⲣϣⲁⲛ_ⲛⲧⲟϥ, ⲉⲣϣⲁⲛ_ⲛⲧⲟⲕ, ⲉⲣϣⲁⲛ_ⲛⲧⲟⲟⲩ, ⲉⲣϣⲁⲛ_ⲛⲧⲟⲥ, ⲉⲣϣⲁⲛ_ⲛⲧⲱⲧⲛ, ⲉⲣⲉ, ⲉⲣⲉ_ⲁⲛⲟⲕ, ⲉⲣⲉ_ⲁⲛⲟⲛ, ⲉⲣⲉ_ⲛⲧⲟ, ⲉⲣⲉ_ⲛⲧⲟϥ, ⲉⲣⲉ_ⲛⲧⲟⲕ, ⲉⲣⲉ_ⲛⲧⲟⲟⲩ, ⲉⲣⲉ_ⲛⲧⲟⲥ, ⲉⲣⲉ_ⲛⲧⲱⲧⲛ, ⲉⲥ, ⲉⲧⲃⲉ_ⲁⲛⲟⲕ, ⲉⲧⲉⲣⲉ_ⲛⲧⲟ, ⲉⲧⲉⲧⲛϣⲁⲛ, ⲉⲧⲉⲧⲛⲉ, ⲕ, ⲙⲉⲩ, ⲙⲙⲓⲛⲙⲙⲟ_ⲛⲧⲟ, ⲙⲡⲉ_ⲛⲧⲟ, ⲛ, ⲛ_ⲛⲧⲟ, ⲛⲉⲣⲉ_ⲛⲧⲟ, ⲛⲓⲙ, ⲛⲥⲁ_ⲛⲧⲟ, ⲛⲧⲉ_ⲁⲛⲟⲕ, ⲛⲧⲉⲧⲛ, ⲛⲧⲛ_ⲁⲛⲟⲕ, ⲛⲧⲟ, ⲛⲧⲟϥ, ⲛⲧⲟⲕ, ⲛⲧⲟⲟⲩ, ⲛⲧⲟⲥ, ⲛⲧⲱⲧⲛ, ⲟⲩ, ⲟⲩⲏⲣ, ⲡⲉ, ⲡⲱⲥ, ⲣⲁⲧ_ⲁⲛⲟⲕ, ⲣⲟ_ⲛⲧⲟ, ⲥϥ, ⲧⲉⲧ, ⲧⲉⲧⲛ, ⲧⲣⲉϥ
- This corpus contains 29 lemmas tagged as determiners (DET): ϩⲛ, ϭⲉ, ϯ, ⲕⲉ, ⲛ, ⲛⲁ, ⲛⲁⲓ, ⲛⲟⲩⲓ, ⲛⲧⲟⲟⲩ, ⲟⲩ, ⲡ, ⲡⲁ, ⲡⲁⲓ, ⲡⲉϥ, ⲡⲉⲓ, ⲡⲉⲕ, ⲡⲉⲛ, ⲡⲉⲥ, ⲡⲉⲧⲛ, ⲡⲉⲩ, ⲡⲏ, ⲡⲓ, ⲡⲟⲩ, ⲡⲟⲩⲕ, ⲡⲱϥ, ⲡⲱⲓ, ⲡⲱⲕ, ⲡⲱⲧⲛ, ⲧ
- Out of the above, 3 lemmas occurred sometimes as PRON and sometimes as DET: ⲛ, ⲛⲧⲟⲟⲩ, ⲟⲩ
- This corpus contains 26 lemmas tagged as auxiliaries (AUX): ϣ, ϣⲁ, ϣⲁⲛⲧⲉ, ϣⲁⲣⲉ, ϫⲡⲓ, ⲁ, ⲉϣ, ⲉⲣϣⲁⲛ, ⲉⲣⲉ, ⲙⲁⲣⲉ, ⲙⲉ, ⲙⲉⲣⲉ, ⲙⲛ, ⲙⲡⲁⲧⲉ, ⲙⲡⲉ, ⲙⲡⲣⲧⲣⲉ, ⲛⲁ, ⲛⲉ, ⲛⲉϣ, ⲛⲉⲣⲉ, ⲛⲛⲉ, ⲛⲧⲉ, ⲛⲧⲉⲣⲉ, ⲟⲩⲛ, ⲧⲁⲣ, ⲧⲁⲣⲉ
- Out of the above, 6 lemmas occurred sometimes as AUX and sometimes as VERB: ϣⲁ, ϫⲡⲓ, ⲙⲉ, ⲙⲛ, ⲛⲁ, ⲟⲩⲛ
- There are 2 (de)verbal forms:
- Fin
- AUX: ϣ, ⲉϣ, ϫⲡⲓ, ⲛⲉϣ
- PRON: ϫⲓⲧ, ϫⲡⲟ
- VERB: ⲡⲉϫⲁ, ⲉⲓ, ϫⲱ, ϣⲱⲡⲉ, ⲃⲱⲕ, ϫⲟⲟ, ⲛⲁⲩ, ϯ, ⲣ, ⲥⲱⲧⲙ
- Inf
- VERB: ϯ, ⲃⲱⲕ, ⲛⲁⲩ, ⲣ, ϫⲓ, ⲁⲁ, ⲕⲁ, ⲧⲁⲙⲟ, ϣⲁϫⲉ, ϣⲁⲁⲧ
Nominal Features
- Fem
- DET: ⲧ, ⲧⲉ, ⲧⲉϥ, ⲧⲉⲓ, ⲧⲁ, ⲧⲉⲕ, ⲧⲁⲓ, ⲧⲉⲩ, ⲧⲉⲥ, ⲧⲟⲩ
- PRON: ⲥ, ⲧⲉ, ⲉ, ⲛⲧⲟⲥ, ⲉⲣⲟ, ⲁ, ⲛⲧⲟ, ⲙⲙⲟ, ⲣ, ⲁⲣ
- Masc
- DET: ⲡ, ⲡⲉ, ⲡⲁ, ⲡⲉϥ, ⲡⲁⲓ, ⲡⲉⲓ, ⲡⲉⲕ, ⲡⲉⲛ, ⲡⲉⲩ, ⲡⲟⲩ
- PRON: ϥ, ⲕ, ⲡⲉ, ⲅ, ⲛⲧⲟϥ, ⲡ, ⲛⲧⲟⲕ, ⲉϥϣⲁⲛ, ⲉⲕϣⲁⲛ, ⲉϥⲉ
- Plur
- DET: ⲛ, ⲛⲉ, ⲛⲉϥ, ⲛⲁⲓ, ⲛⲁ, ⲙ, ⲛⲉⲛ, ⲛⲉⲩ, ⲛⲉⲕ, ⲛⲉⲧⲛ
- PRON: ⲩ, ⲟⲩ, ⲛ, ⲧⲛ, ⲧⲉⲧⲛ, ⲥⲉ, ⲛⲉ, ⲧⲏⲩⲧⲛ, ⲉⲩⲉ, ⲛⲧⲱⲧⲛ
- Sing
- DET: ⲡ, ⲧ, ⲟⲩ, ⲡⲉ, ϩⲉⲛ, ⲡⲁ, ⲡⲉϥ, ⲧⲉ, ⲡⲁⲓ, ⲡⲉⲓ
- PRON: ϥ, ⲥ, ⲓ, ⲕ, ⲡⲉ, ϯ, ⲧⲉ, ⲅ, ⲁⲛⲟⲕ, ⲛⲧⲟϥ
- Def
- ADV: ⲙⲙⲓⲛⲙⲙⲟ, ⲙⲙⲓⲛⲙⲙⲱ
- DET: ⲡ, ⲧ, ⲛ, ⲡⲉ, ⲡⲁ, ⲡⲉϥ, ⲛⲉ, ⲧⲉ, ⲡⲁⲓ, ⲛⲉϥ
- PRON: ϥ, ⲩ, ⲥ, ⲓ, ⲟⲩ, ⲕ, ⲛ, ⲧⲛ, ⲧⲉⲧⲛ, ⲥⲉ
- Ind
- DET: ⲟⲩ, ϩⲉⲛ, ⲩ
Degree and Polarity
- Neg
- ADV: ⲁⲛ, ⲛ, ⲧⲙ, ⲙⲡⲣ, ⲙ, ⲟⲩⲕ, ⲙⲉ, ⲟⲩ, ⲟⲩⲇⲉ
- AUX: ⲙⲡ, ⲙⲡⲉ, ⲛⲛⲉ, ⲙⲉ, ⲙⲛ, ⲙⲡⲁⲧ, ⲙⲡⲣⲧⲣⲉ, ⲙⲁⲣⲉ, ⲙⲡⲁⲧⲉ, ⲛⲛ
- CCONJ: ⲟⲩⲇⲉ
- PART: ⲙⲙⲟⲛ, ⲟⲩⲇⲉ
- PRON: ⲙⲡⲉ
- SCONJ: ⲟⲩⲇⲉ
- VERB: ⲙⲛ, ⲙⲛⲧ, ⲙⲙⲛ, ⲙⲙⲛⲧ, ⲙⲛⲧⲁ, ⲙⲛⲧⲏ, ⲙⲙⲛⲧⲉ
- X: ⲟⲩ
Verbal Features
Pronouns, Determiners, Quantifiers
- Art
- DET: ⲡ, ⲧ, ⲛ, ⲟⲩ, ⲡⲉ, ϩⲉⲛ, ⲕⲉ, ⲛⲉ, ⲧⲉ, ⲩ
- Dem
- DET: ⲡⲁⲓ, ⲛⲁⲓ, ⲡⲉⲓ, ⲧⲉⲓ, ⲧⲁⲓ, ⲛⲉⲓ, ⲡⲓ, ⲛⲓ, ⲡⲏ, ⲛⲏ
- Ind
- PRON: ⲛⲓⲙ, ⲟⲩ
- Int
- ADV: ⲧⲱⲛ
- PRON: ⲟⲩ, ⲛⲓⲙ, ⲁϣ, ⲟⲩⲏⲣ, ⲁⲟⲩⲏⲣ, ⲡⲱⲥ
- Prs
- ADV: ⲙⲙⲓⲛⲙⲙⲟ, ⲙⲙⲓⲛⲙⲙⲱ
- DET: ⲡⲁ, ⲡⲉϥ, ⲛⲉϥ, ⲧⲉϥ, ⲡⲉⲕ, ⲛⲁ, ⲡⲉⲛ, ⲧⲁ, ⲛⲉⲛ, ⲛⲉⲩ
- PRON: ϥ, ⲩ, ⲥ, ⲓ, ⲟⲩ, ⲕ, ⲛ, ⲧⲛ, ⲧⲉⲧⲛ, ⲥⲉ
- Rcp
- NOUN: ⲉⲣⲏⲩ
- Tot
- ADV: ⲧⲏⲣ
- Card
- NUM: ⲟⲩⲁ, ⲥⲛⲁⲩ, ϣⲉ, ϣⲟⲙⲛⲧ, ⲙⲛⲧⲥⲛⲟⲟⲩⲥ, ⲙⲏⲧ, ⲥⲁϣϥ, ⲧⲃⲁ, ⲥⲛⲧⲉ, ϩⲙⲉ
- Yes
- DET: ⲡⲁ, ⲡⲉϥ, ⲛⲉϥ, ⲧⲉϥ, ⲡⲉⲕ, ⲛⲁ, ⲡⲉⲛ, ⲧⲁ, ⲛⲉⲛ, ⲛⲉⲩ
- PRON: ϥ, ⲟⲩ, ⲕ, ⲥ, ⲛ, ⲧⲛ, ⲧ, ⲩ, ⲓ, ⲧⲏⲩⲧⲛ
- Yes
- ADV: ⲙⲙⲓⲛⲙⲙⲟ, ⲙⲙⲓⲛⲙⲙⲱ
- PRON: ⲙⲙⲓⲛⲙⲙⲟ
- 1
- DET: ⲡⲁ, ⲡⲉⲛ, ⲛⲁ, ⲛⲉⲛ, ⲧⲁ, ⲧⲉⲛ, ⲡⲱⲓ, ⲛⲟⲩⲓ
- PRON: ⲓ, ⲛ, ϯ, ⲁⲛⲟⲕ, ⲧⲛ, ⲧⲁ, ⲧ, ⲁⲛⲅ, ⲁⲛⲟⲛ, ⲁ
- 2
- DET: ⲡⲉⲕ, ⲧⲉⲕ, ⲛⲉⲕ, ⲛⲉⲧⲛ, ⲡⲟⲩ, ⲧⲟⲩ, ⲡⲉⲧⲛ, ⲛⲟⲩ, ⲧⲉⲧⲛ, ⲡⲱⲧⲛ
- PRON: ⲕ, ⲧⲉⲧⲛ, ⲧⲛ, ⲅ, ⲧⲏⲩⲧⲛ, ⲉ, ⲛⲧⲱⲧⲛ, ⲧⲉ, ⲛⲧⲟⲕ, ⲉⲕϣⲁⲛ
- 3
- DET: ⲡⲉϥ, ⲛⲉϥ, ⲧⲉϥ, ⲛⲉⲩ, ⲡⲉⲩ, ⲧⲉⲩ, ⲧⲉⲥ, ⲡⲉⲥ, ⲛⲉⲥ, ⲡⲱϥ
- PRON: ϥ, ⲩ, ⲥ, ⲟⲩ, ⲥⲉ, ⲛⲧⲟϥ, ⲉⲩⲉ, ⲛⲧⲟⲟⲩ, ⲛⲧⲟⲥ, ⲉϥϣⲁⲛ
- Fem
- DET: ⲧⲉⲥ, ⲡⲟⲩ, ⲡⲉⲥ, ⲧⲟⲩ, ⲛⲟⲩ, ⲛⲉⲥ
- Masc
- DET: ⲡⲉϥ, ⲛⲉϥ, ⲧⲉϥ, ⲡⲉⲕ, ⲧⲉⲕ, ⲛⲉⲕ, ⲡⲱⲕ, ⲡⲱϥ, ⲛⲟⲩⲕ, ⲧⲱϥ
- Plur
- DET: ⲡⲉⲛ, ⲛⲉⲛ, ⲛⲉⲩ, ⲡⲉⲩ, ⲧⲉⲩ, ⲛⲉⲧⲛ, ⲡⲉⲧⲛ, ⲧⲉⲛ, ⲧⲉⲧⲛ, ⲡⲱⲧⲛ
- Sing
- DET: ⲡⲉϥ, ⲡⲁ, ⲛⲉϥ, ⲧⲉϥ, ⲡⲉⲕ, ⲛⲁ, ⲧⲁ, ⲧⲉⲕ, ⲛⲉⲕ, ⲧⲉⲥ
Other Features
- Foreign
- Yes
- ADJ: ⲕⲁⲑⲟⲗⲓⲕⲏ
- ADP: ⲕⲁⲧⲁ, ⲭⲱⲣⲓⲥ, ⲡⲁⲣⲁ, ⲕⲁⲧⲁⲣⲟ, ⲙ, ⲡⲁⲣⲁⲣⲟ
- ADV: ⲧⲟⲧⲉ, ⲕⲁⲗⲱⲥ, ⲗⲟⲓⲡⲟⲛ, ⲙⲁⲗⲓⲥⲧⲁ, ⲉⲧⲓ, ⲏⲇⲏ, ⲡⲁⲗⲓⲛ, ϩⲟⲗⲱⲥ, ⲉⲓⲧⲁ, ⲟⲩⲕ
- AUX: ⲟⲩⲛ
- CCONJ: ⲁⲗⲗⲁ, ⲏ, ⲟⲩⲇⲉ, ⲉⲓⲧⲉ, ⲙⲏ, ⲡⲗⲏⲛ, ⲕⲁⲓ, ⲟⲩⲧⲉ, ⲕⲁⲛ, ϩⲟⲧⲁⲛ
- DET: ⲛⲁⲓ
- NOUN: ⲭⲣⲓⲥⲧⲟⲥ, ⲡⲛⲉⲩⲙⲁ, ⲁⲡⲁ, ⲕⲟⲥⲙⲟⲥ, ⲙⲁⲑⲏⲧⲏⲥ, ⲥⲱⲙⲁ, ⲗⲁⲟⲥ, ⲙⲟⲛⲁⲭⲟⲥ, ⲥⲁⲣⲝ, ⲡⲟⲗⲓⲥ
- NUM: ⲟⲩⲁ, ⲥⲉ
- PART: ⲇⲉ, ⲅⲁⲣ, ⲱ, ⲙⲉⲛ, ϩⲁⲙⲏⲛ, ⲁⲣⲁ, ⲟⲩⲛ, ⲭⲁⲓⲣⲉ, ⲭⲱⲣⲓⲥ, ⲁⲛⲧⲓ
- PRON: ⲟⲩ
- PROPN: ⲓⲏⲥⲟⲩⲥ, ⲓⲱϩⲁⲛⲛⲏⲥ, ⲃⲟⲉⲥ, ϩⲣⲟⲩⲑ, ⲛⲟⲉⲙⲓⲛ, ⲓⲁⲕⲱⲃ, ⲥⲁⲧⲁⲛⲁⲥ, ⲇⲏⲙⲏⲧⲣⲓⲟⲥ, ⲇⲓⲁⲃⲟⲗⲟⲥ, ⲡⲁⲩⲗⲟⲥ
- SCONJ: ϩⲱⲥⲧⲉ, ϩⲱⲥ, ⲉⲓⲙⲏⲧⲓ, ⲉⲡⲉⲓⲇⲏ, ⲙⲏⲡⲟⲧⲉ, ϩⲟⲧⲁⲛ, ⲙⲏⲧⲓ, ⲕⲁⲛ, ⲙⲏ, ⲅⲁⲣ
- VERB: ⲥⲧⲁⲩⲣⲟⲩ, ⲡⲁⲣⲁⲕⲁⲗⲉⲓ, ⲡⲓⲥⲧⲉⲩⲉ, ⲕⲣⲓⲛⲉ, ⲥⲕⲁⲛⲇⲁⲗⲓⲍⲉ, ⲑⲩⲥⲓⲁⲍⲉ, ⲁⲛⲁⲭⲱⲣⲉⲓ, ⲁⲥⲡⲁⲍⲉ, ⲕⲗⲏⲣⲟⲛⲟⲙⲉⲓ, ⲛⲏⲥⲧⲉⲩⲉ
- VERB-Fin: ⲥⲧⲁⲩⲣⲟⲩ, ⲡⲁⲣⲁⲕⲁⲗⲉⲓ, ⲡⲓⲥⲧⲉⲩⲉ, ⲕⲣⲓⲛⲉ, ⲥⲕⲁⲛⲇⲁⲗⲓⲍⲉ, ⲑⲩⲥⲓⲁⲍⲉ, ⲁⲛⲁⲭⲱⲣⲉⲓ, ⲕⲗⲏⲣⲟⲛⲟⲙⲉⲓ, ⲛⲏⲥⲧⲉⲩⲉ, ⲛⲟⲉⲓ
- VERB-Inf: ⲧⲁⲗⲉⲇⲱⲣⲟⲛ, ⲡⲁⲣⲁⲅⲉ, ϯⲛⲧⲟⲗⲏ, ⲁⲛⲉⲭⲉ, ⲁⲡⲟⲧⲁⲥⲥⲉ, ⲁⲥⲡⲁⲍⲉ, ⲃⲁⲡⲧⲓⲍⲉ, ⲇⲓⲁⲕⲣⲓⲛⲉ, ⲉⲡⲓⲃⲟⲩⲗⲉⲩⲉ, ⲉⲩⲁⲅⲅⲉⲗⲓⲍⲉ
- X: ⲙⲟⲛⲟⲛ, ⲁⲣⲭⲏⲉⲡⲓⲥⲕⲟⲡⲟⲩ, ⲁⲩⲧⲟⲩ, ⲉⲡⲓⲥⲧⲟⲗⲏ, ⲉⲡⲫⲁⲑⲁ, ⲕⲟⲩⲙ, ⲟⲩⲇ[.......], ⲧⲟⲩ, ⲭⲉⲓⲙⲏⲧⲓ, ⲟⲩ
- Yes
Syntax
Auxiliary Verbs and Copula
- This corpus uses 1 lemmas as copulas (cop). Examples: ⲡⲉ.
- This corpus uses 26 lemmas as auxiliaries (aux). Examples: ⲁ, ⲛⲁ, ⲛⲧⲉ, ⲛⲧⲉⲣⲉ, ⲛⲉⲣⲉ, ⲙⲡⲉ, ϣⲁⲣⲉ, ⲙⲁⲣⲉ, ⲛⲛⲉ, ϣ, ⲉⲣϣⲁⲛ, ⲙⲉⲣⲉ, ϣⲁⲛⲧⲉ, ⲙⲛ, ⲙⲡⲁⲧⲉ, ⲟⲩⲛ, ⲉⲣⲉ, ⲙⲡⲣⲧⲣⲉ, ⲧⲁⲣ, ⲧⲁⲣⲉ, ϣⲁ, ϫⲡⲓ, ⲉϣ, ⲙⲉ, ⲛⲉ, ⲛⲉϣ.
Core Arguments, Oblique Arguments and Adjuncts
Here we consider only relations between verbs (parent) and nouns or pronouns (child).
- nsubj
- VERB--NOUN (106)
- VERB--NOUN-ADP(ⲛ) (2)
- VERB--NOUN-ADP(ⲛ)-ADP(ⲛ) (1)
- VERB--PRON (32)
- VERB-Fin--NOUN (551)
- VERB-Fin--NOUN-ADP(ϩⲓⲣⲛ) (1)
- VERB-Fin--NOUN-ADP(ⲛ) (7)
- VERB-Fin--PRON (4237)
- obj
- VERB--NOUN (39)
- VERB--NOUN-ADP(ⲛ) (10)
- VERB--PRON (174)
- VERB-Fin--NOUN (389)
- VERB-Fin--NOUN-ADP(ⲙ) (3)
- VERB-Fin--NOUN-ADP(ⲛ) (477)
- VERB-Fin--NOUN-ADP(ⲛ)-ADP(ⲛ) (12)
- VERB-Fin--PRON (868)
- VERB-Fin--PRON-ADP(ⲉ) (1)
- VERB-Fin--PRON-ADP(ⲛ) (376)
- VERB-Fin--PRON-ADP(ⲧⲟⲟⲧ) (1)
- VERB-Inf--NOUN (25)
- VERB-Inf--NOUN-ADP(ⲛ) (19)
- VERB-Inf--NOUN-ADP(ⲛ)-ADP(ⲛ) (1)
- VERB-Inf--PRON (55)
- VERB-Inf--PRON-ADP(ⲛ) (4)
- iobj
- VERB--NOUN (5)
- VERB--PRON (63)
- VERB--PRON-ADP(ⲛ) (2)
- VERB-Fin--NOUN (1)
- VERB-Fin--PRON (1)