UD Egyptian UJaen
Language: Egyptian (code: egy)
Family: Afro-Asiatic
This treebank has been part of Universal Dependencies since the UD v2.14 release.
The following people have contributed to making this treebank part of UD: Roberto Antonio Díaz Hernández, Bruno Guillaume, Daniel Zeman.
Repository: UD_Egyptian-UJaen
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.17
License: CC BY-SA 4.0
Genre: bible, fiction, nonfiction, government
Questions, comments? General annotation questions (either Egyptian-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [radiaz (æt) ujaen • es]. Development of the treebank happens directly in the UD repository, so you may submit bug fixes as pull requests against the dev branch.
| Annotation | Source |
|---|---|
| Lemmas | annotated manually |
| UPOS | annotated manually, natively in UD style |
| XPOS | not available |
| Features | annotated manually, natively in UD style |
| Relations | annotated manually, natively in UD style |
Description
Egyptian-UJaen is the first dependency treebank created for the morphosyntactic annotation of pre-Coptic Egyptian. Its current state (UD v2.17) consists of 2,347 sentences and 24,375 tokens manually annotated from texts written in Old Egyptian, mainly from the Pyramid Texts.
The Egyptian-UJaen treebank (henceforth EUJA treebank) contains a corpus of Egyptian texts manually annotated at the University of Jaén following the Tübingen transcription system (see below). It aims to contribute to the Universal Dependencies (UD) project and to the PARSEME corpora of multiword expressions in order to compare Egyptian morphosyntactic features with those from other languages. The EUJA treebank started as UD release 2.14 with 5,515 words and 707 sentences. It contained Old Egyptian multiword expressions and sentences from the Pyramid Texts (see list of sources, below). The systematic annotation of the Pyramid Texts begins with EUJA-44. Unas’s Pyramid Texts were annotated in the EUJA treebank for the UD release 2.15, and Teti’s Pyramid Texts for the UD release 2.16. Annotation of Pepi I’s Pyramid Texts began for the UD release 2.17. Data exploration in these texts can be carried out using GrewPT
The treebank will contain texts from various historical stages: Old Egyptian, Middle Egyptian, Late Egyptian and Demotic. For an overall description of these linguistic stages, see the Language Page for Egyptian; and the bibliography below.
Acknowledgments
This work received support from the CA21167 COST action UniDive, funded by COST (European Cooperation in Science and Technology). I thank Agata Savary (UniDive/PARSEME), Daniel Zeman (UniDive/UD) and Marco Carlo Passarotti (CIRCSE) for introducing me to computational linguistics.
Statistics of UD Egyptian UJaen
POS Tags
ADJ – ADP – ADV – AUX – CCONJ – DET – INTJ – NOUN – NUM – PART – PRON – PROPN – PUNCT – SCONJ – VERB – X
Features
AdvType – Aspect – Case – ExtPos – Foreign – Gender – Mood – Nominal – Number – NumType – PartType – Person – Polarity – Poss – Prefix – PronType – Reflex – SubForm – Tense – Typo – VerbClass – VerbForm – VerbType – Voice
Relations
acl – acl:relcl – advcl – advmod – amod – appos – aux – case – cc – ccomp – compound – conj – cop – csubj – csubj:outer – csubj:pass – dep – det – discourse – dislocated – expl – expl:pv – fixed – flat – flat:foreign – list – mark – nmod – nmod:poss – nsubj – nsubj:outer – nsubj:pass – nummod – obj – obl – obl:agent – obl:arg – orphan – parataxis – punct – root – vocative – xcomp
Tokenization and Word Segmentation
- This corpus contains 2347 sentences, 23565 tokens and 24375 syntactic words.
- All tokens in this corpus are followed by a space.
- This corpus does not contain words with spaces.
- This corpus contains 2095 types of words that contain both letters and punctuation. Examples: ⸗k, ⸗f, Ḥr.w, č̣(ṭ), n(.ꞽ), wśr(.w), ⸗śn, ⸗ś, ꞽr.t, ⸗č, nčr(.w), p.t, ꞽm(.ꞽ), ⸗čn, ꞽt(ꞽ), ḥnw.t, ś(ꞽ), ꜣḫ.t, rč̣.n, č̣.t, Nw.t, wr.t, (Ꞽ)tm(.w), (ꞽ)m, mw.t, ś.t, ꜣḫ(.w), ḥw.t, n.t, ḫnt(.ꞽ), ꞽm.(ꞽ)w, śk(.w), fꜣ.t, Č̣ḥw.tꞽ, ꞽw.n, ꞽm.t, nčr(.ꞽ), ꞽꞽ.n, <⸗k>, ꜥw(.wꞽ), pśč̣.t, ꜥꜣ(.wꞽ), [⸗k], śḫ.t, ḥr(.ꞽ), ḥꜣ.t, Ꜣś.t, ꞽꜣr.w, [⸗f], kꜣ(.w)
- This corpus contains 709 multi-word tokens. On average, one multi-word token consists of 2.14 syntactic words.
- There are 289 types of multi-word tokens. Examples: m-ꜥw, m-m, m-ḥtp, Mr-n(.ꞽ)-ḫꜣ, Ḥr.w-ꜣḫ.tꞽ, m-ḫt, ꞽ:nč̣-(⸗ꞽ)-ḥr, ꞽ:ḫm-śk(.w), m-bꜣḥ, p(w)-nn, ꞽ:(n)č̣-(⸗ꞽ)-ḥr, m-ꜥb, ẖnw-ꜥw(.wꞽ), ꞽ:ḫm(.w)-śk(.w), m-ẖnw, Nb.t-ḥw.t, n-ꞽw.t(ꞽ), Ḫnt(.ꞽ)-ꞽmn.t(ꞽ)w, ꞽ:ḫm.w-śk(.w), Rꜥw-(Ꞽ)tm(.w), ḥr.t-ꞽb, ḥtp-č̣i̯-nsw, Nb(.t)-ḥw.t, m-ḫnt, m-ṭp, nčr-ꜥꜣ, Śḫ.(w)t-ꞽꜣr.w, ḥtp-č̣i̯, ṭp-ꜥw(.wꞽ), Wr.t-ḥkꜣ(.w), Wꜣč̣-wr, m-ḫśf(.w), n-n.tt, sꜣ-tꜣ, č̣ꜣ-t(ꞽ), ḥr(.ꞽ)-ṭp, ḥr-ṭp, ḥw.t-ꜥnḫ, ṭp-ꜥw.w(ꞽ), ꞽm.(ꞽ)w-ḫt, (ꞽ)m(.ꞽw)-ḫt, Nḥb(.w)-kꜣ(.w), Sḫn-wr, Tꜣ-wr, Wr-ꜥw, pw-nn, pśč̣.t-ꜥꜣ.t, r-gś, wꜣč̣-ꜥn, č̣śr-ṭp.
Morphology
Tags
- This corpus uses 16 UPOS tags out of 17 possible: ADJ, ADP, ADV, AUX, CCONJ, DET, INTJ, NOUN, NUM, PART, PRON, PROPN, PUNCT, SCONJ, VERB, X
- This corpus does not use the following tags: SYM
- This corpus contains 26 word types tagged as particles (PART): [nꞽ], [ꞽn], [ꞽw], m, my, nn, ny, nꞽ, rr, tr, w, wn.t, wnn.t, śk, śwt, ḥ(w), ḥm, ḥw, ꜣ, ꞽ(w), ꞽgr, ꞽn, ꞽr, ꞽw, ꞽś, ꞽḫ
- This corpus contains 35 lemmas tagged as pronouns (PRON): f, k, kw, n, n.tt, n.tꞽ, ntk, ntśn, nꞽ, sy, wꞽ, č, čm, čmt, čn, čnꞽ, čw, čwt, ś, ś(ꞽ), śn, śnꞽ, śtt, św, śwt, śꞽ, ⸗f, ⸗k, ⸗ś, ⸗śn, ⸗ꞽ, ꜥ, ꞽ, ꞽnk, ꞽw.tꞽ
- This corpus contains 16 lemmas tagged as determiners (DET):
f, [p]w, nw, pf, pn, pw, pꞽ, sy, tf, tn, tw, ꞽpf, ꞽpn, ꞽptw, ꞽpw, ꞽtn</li> </ul>
- Out of the above, 1 lemmas occurred sometimes as PRON and sometimes as DET: sy
- This corpus contains 4 lemmas tagged as auxiliaries (AUX): tm, wnn, ꞽmi̯, ꞽw
- Out of the above, 4 lemmas occurred sometimes as AUX and sometimes as VERB: tm, wnn, ꞽmi̯, ꞽw
- There are 3 (de)verbal forms:
- Fin
- AUX: (ꞽ)m, ꞽm, tm, wn.t, ꞽ:tm, [ꞽm], tm.tn, tm.ḫr, wnn
- PROPN: Ꞽ:pꜣ
- VERB: m, rč̣.n, ꞽꞽ, č̣i̯, ꜥḥꜥ, ꜥnḫ, pr, ꞽr, č̣ꜣ, ꞽw.n
- Inf
- NOUN: č̣ꜣu̯.t, mśw.t, č̣ꜣ.t, hꜣb.t, m(w)t.t, mnḫ.t, mwt.t, pr(.t), pẖr, č̣(ṭ)
- VERB: č̣(ṭ), śk(.w), fꜣ.t, ḫm(.w), [č̣(ṭ)], ḥr(.w), ꞽr(.w), ꞽw(.w), pr.t, ꞽr(i̯).t
- Part
- ADJ: ꞽ:ḫm, ꜥꜣ, wč̣ꜣ, wꜥb.t, čn
- AUX: ꞽ:tm.w
- NOUN: ꞽ:ḫm(.w), ꞽ:ḫm.w, ś:(w)ꜣ.tꞽ, ꞽr.tꞽ, mry.tꞽ, mḥ, ꞽ:ḫm, nꜥꜥ, pgꜣ, pr.tꞽ
- PROPN: Pgꜣ, Wbꜣ, Wꜣč̣, Čhn
- VERB: pr, mr.y, č̣śr, wr, ꜥnḫ, ꞽr.w, wp, nfr.t, ḫbč̣, ꜥꜣ
Nominal Features
- Com
- PRON: ⸗śn, (⸗ꞽ), ⸗čn, śn, ⸗čn(ꞽ), ⸗ꞽ, ꞽnk, čn, ⸗śn(ꞽ), [(⸗ꞽ)]
- VERB-Fin: bꜣ.tꞽ, č̣śr.t(ꞽ), śḫm.tꞽ, ḥr.t(ꞽ), wr.t(ꞽ), ꜥnḫ.t(ꞽ), pr.w, rnpw.t(ꞽ), sꞽ.t(ꞽ), tm.tꞽ
- Fem
- ADJ: n.t, ꞽm.t, wr.t, nb(.t), ḥr.t, ṭp.t, ꜥꜣ.t, nb(.wt), nb.t, bnꞽ.t
- ADJ-Part: wꜥb.t
- AUX-Fin: wn.t, tm.tn
- DET: tn, tw, ꞽptw,
f, tf, ꞽtn</li> - NOUN: ꞽr.t, p.t, ḥnw.t, ꜣḫ.t, č̣.t, mw.t, ś.t, ḥw.t, pśč̣.t, ẖ.t
- NOUN-Part: pr.tꞽ, t(w)[t].tꞽ, t(w)t.tꞽ
- NUM: fṭ.t, śfḫ.t{t}, 7.t, fṭ.(w)t, śn.t, šn.(w)t, ḫmn.t, ḫmt.t, ṭꞽ.t, ꞽfṭ.t
- PRON: ⸗ś, ⸗č, ś(ꞽ), ⸗k, čm, [⸗č], čn, śtt, <⸗ś>, (⸗ś)
- PROPN: Nw.t, Ꜣś.t, N(ꞽ).t, Wr.t, Ṭ(w)ꜣ.t, Nb.t, Nb.t-ḥw.t, Śpṭ.t, Mꜣfṭ.t, Nb(.t)
- VERB-Fin: wč̣ꜣ.t(ꞽ), wnm.t, ꜥnḫ.t, ꞽr.tn, ꞽ:rḫ.t(ꞽ), wnm.tn, ꞽtḥ.tn, prr.t, swr.t, ḫnf.tn
- VERB-Part: nfr.t, pr.t, mś.t, šsp.t, ꞽ:ḫm.t, bnꞽ.t, nkn.t, nḥm.t, rm.t, sn.(w)t
</ul> </li> </ul>- Masc
- ADJ: n(.ꞽ), ꞽm(.ꞽ), wr, nb, ꞽm.(ꞽ)w, ḫnt(.ꞽ), ḥr(.ꞽ), ꜥꜣ, ꞽ:ḫm, ẖr(.ꞽ)
- ADJ-Part: ꞽ:ḫm, ꜥꜣ, wč̣ꜣ, čn
- AUX-Part: ꞽ:tm.w
- DET: pn, pw, pꞽ, pf, ꞽpw, p(w), ꞽpf, ꞽpn, [pn], p(ꞽ)
- NOUN: mṭw, nčr(.w), rn, nčr, ꜥw, tꜣ, ꞽb, ḥr, sp, kꜣ
- NOUN-Inf: č̣ꜣu̯.t, mśw.t, č̣ꜣ.t, hꜣb.t, m(w)t.t, mnḫ.t, mwt.t, pr(.t), pẖr, č̣(ṭ)
- NOUN-Part: ꞽ:ḫm(.w), ꞽ:ḫm.w, ś:(w)ꜣ.tꞽ, ꞽr.tꞽ, mry.tꞽ, mḥ, ꞽ:ḫm, nꜥꜥ, pgꜣ, psḥ
- NUM: ḫꜣ, fṭ.w, wꜥ, ḫꜣ(.w), 4, fṭ(.w), ḫtm.nw, 3, 6, 7
- PRON: ⸗k, ⸗f, čw, św, kw, čwt, <⸗k>, [⸗k], [⸗f], ⸗f(ꞽ)
- PROPN: Wnꞽś, Ttꞽ, Ḥr.w, Wśr(.w), Rꜥw, Ppy, Śtẖ, Gbb, (Ꞽ)tm(.w), Č̣ḥw.tꞽ
- PROPN-Part: Pgꜣ, Wbꜣ, Wꜣč̣, Čhn
- VERB: č̣(ṭ), pr, fꜣ.t, [č̣(ṭ)], mr.y, č̣śr, č̣ṭ(.w), ꜥnḫ, ꞽ:rḫ(.w), ꞽr.w
- VERB-Fin: č̣ṭ(.w), ꞽ:rḫ(.w), ꞽr.n, pr(.w), rḫ(.w), ꜣḫ(.w), ḫꜥ(.w), ꞽꞽ.y, mś.n, č̣ṭ.n
- VERB-Inf: č̣(ṭ), fꜣ.t, [č̣(ṭ)], pr.t, ꞽr(i̯).t, ꞽw.t, nwr, rḫś, ḳṭ, [wč̣b]
- VERB-Part: pr, mr.y, č̣śr, wr, ꜥnḫ, ꞽr.w, wp, ḫbč̣, ꜥꜣ, ꞽꞽ
- Coll
- NOUN: pśč̣.t, ḥnmm.t, pꜥ.t, rmč, rḫ.(w)t, rnp.(w)t, ꜣw.t, ꞽs.t, ꞽt, [pśč̣.t]
- Dual
- ADJ: km.tꞽ, wr.t(ꞽ), wr.w(ꞽ), ḫnt.(ꞽ)w(ꞽ), ṭp.tꞽ, ṭšr(.tꞽ), ꜥꜣ.w(ꞽ)
- DET: ꞽpw, ꞽpf
- NOUN: ꜥw(.wꞽ), ꜥꜣ(.wꞽ), ꜥw.w(ꞽ), ꞽr.t(ꞽ), ꜣḫ.tꞽ, rṭ(.wꞽ), sḫn.w(ꞽ), pśč̣.t(ꞽ), tꜣ(.wꞽ), sꜣ.t(ꞽ)
- NOUN-Part: pr.tꞽ, t(w)[t].tꞽ, t(w)t.tꞽ
- PRON: ⸗čn(ꞽ), śn, ⸗śn(ꞽ), śn(ꞽ), čn(ꞽ), črꞽ, ⸗[č]n(ꞽ), ⸗nꞽ
- PROPN: Mꜣꜥ.tꞽ, Rw.tꞽ
- VERB-Fin: pśš.t(ꞽ), sꜣ(u̯), tm.tꞽ, ḥr.t(ꞽ), ḥꜣ.ty, ꞽ:bẖm.wy, ꞽ:ḫm.w(ꞽ)
- VERB-Part: šnm.tꞽ, mśi̯.tꞽ, mẖnm.tꞽ, wtč.tꞽ, wꜣč̣.wꞽ, ś:mn.t(ꞽ), ḳmꜣ.tꞽ, ẖnn.t(ꞽ), ꞽr.tꞽ, ꞽr.w
- Plur
- ADJ: ꞽm.(ꞽ)w, n.(ꞽ)w, nb.w, nb(.wt), mḥ.t(ꞽ)w, nb(.w), ḫnt.(ꞽ)w, ꞽmn.t(ꞽ)w, ꞽꜣb.t(ꞽ)w, rś.(ꞽ)w
- AUX-Part: ꞽ:tm.w
- DET: ꞽpw, ꞽpn, ꞽptw, ꞽpf, nw, pw
- NOUN: nčr(.w), mw, ꜣḫ(.w), kꜣ(.w), ꞽꜣr.w, ḳś(.w), ꞽm.(ꞽ)w, ꞽꜣ.(w)t, ꞽmn.t(ꞽ)w, ś.(w)t
- NOUN-Part: ꞽ:ḫm(.w), ꞽ:ḫm.w, ꞽ:ḫm, ꜥnḫ.w, ꞽ:rḫ.w, ꞽ:ḫmꜥ, ꞽ:ḳṭ.w
- NUM: ḫꜣ(.w), fṭ.w, fṭ(.w), fṭ.(w)t, šn.(w)t
- PRON: ⸗śn, ⸗čn, čn, śn, ntśn, ⸗n, [čn], [⸗čn], ⸗č(n), ⸗ś
- PROPN: Śḫ.(w)t, Ḥ<ḥ>.y, Ꜥf.t(ꞽ)w</li>
- VERB-Fin: m.y, gm.wn, hn.y, nhs.ꞽ, pr.w, pẖr, ḳb(.w), ꞽ:s.y, ꞽ:śšn.w, (w)ṭ.wn
- VERB-Part: ꜥnḫ.w, wr.w, štꜣ.w, bnꞽ.t, mꜣꜣ.w, prr.w, rnp.w, sn.(w)t, tḫtḫ, wr(.w) </ul> </li> </ul>
- Sing
- ADJ: n(.ꞽ), ꞽm(.ꞽ), wr, nb, n.t, wr.t, ꞽm.t, ḫnt(.ꞽ), nb(.t), ḥr(.ꞽ)
- ADJ-Part: ꞽ:ḫm, ꜥꜣ, wč̣ꜣ, wꜥb.t, čn
- AUX-Fin: (ꞽ)m, wn.t, tm.tn
- DET: pn, pw, pꞽ, pf, tn, tw, p(w), [pn], p(ꞽ),
f</li>
- NOUN: mṭw, ꞽr.t, p.t, rn, nčr, ꜥw, tꜣ, ꞽb, ḥr, sp
- NOUN-Part: ś:(w)ꜣ.tꞽ, ꞽr.tꞽ, mry.tꞽ, mḥ, nꜥꜥ, pgꜣ, psḥ, wꜣ, ꞽ:ꜥm
- NUM: ḫꜣ, 4, 3, 6, 7, 8, fṭ.nw, ḫmt.nw, ṭp(.ꞽ)
- PRON: ⸗k, ⸗f, čw, ⸗ś, ⸗č, św, (⸗ꞽ), ś(ꞽ), kw, čwt
- PROPN: Wr, Mr, Wr.t, Ḥw.t, sꜣ, Sḫn, Wp, Śmꜣ, Ḥr.w, Ṭp(.ꞽ)
- PROPN-Part: Pgꜣ, Wbꜣ, Wꜣč̣, Čhn
- VERB: m, pr, ꜥḥꜥ, ḥtm, ꞽn, ḫw, šsp, čs, mr.y, sꜣu̯
- VERB-Fin: m, ḥtm, ꜥḥꜥ, ḫw, ꞽn, šsp, sꜣu̯, čs, sbn, č̣ṭ(.w)
- VERB-Part: pr, mr.y, wr, č̣śr, ꜥnḫ, wp, nfr.t, ḫbč̣, ꜥꜣ, ꞽr.w </ul> </li> </ul>
- Abl
- ADP: m, ꞽm, (ꞽ)m, ḥr
- Acc
- ADP: n, ꞽm
- All
- ADP: r, ꞽr, n, (ꞽ)r
- Ben
- ADP: n, [n], [
], n{t}, ꞽr</li> - NOUN: ꞽr.t
</ul> </li> </ul>- Cau
- ADP: n, ḥr, ḫr, m, [n]
- Cmp
- ADP: ꞽr, r
- Com
- ADP: ḫr, ḥnꜥ, m, ꞽm.(w)t(ꞽ), ꞽm.wt(ꞽ), [ḫr], ꞽm.wtꞽ
- Dis
- ADJ: ꞽm.t
- ADP: m
- Ela
- ADP: m, ꞽm, ꞽr, [ꞽm], ḫr, [m]
- Equ
- ADJ: ꞽm(.ꞽ)
- ADP: m, ꞽś, mr, r,
, [m], ꞽm</li> - NOUN: ꞽm(.ꞽ)
</ul> </li> </ul>- Ess
- ADP: m
- Gen
- ADJ: n(.ꞽ), n.t, n.(ꞽ)w, [n(.ꞽ)], n.ꞽ, [n.t], n(.t)
- NOUN: n(.ꞽ), nčr, (ꞽ)m(.ꞽw), nw.ꞽ
- PROPN: P(ꞽ), Šmꜥ(.w), Wśr(.w), Śtẖ
- Ill
- ADP: ꞽr, r
- Ins
- ADP: m, ꞽm, ḥr, [m],
, [ꞽm], ˹m˺, ẖr</li> </ul> </li> </ul> - Lat
- ADP: ꞽr, n, r
- Loc
- ADJ: ꞽm(.ꞽ), ꞽm.t, ꞽm.(ꞽ)w, ḫnt(.ꞽ), ḫnt.(ꞽ)w, ṭp.t, (ꞽ)m.t, [ꞽm].t, ḥr(.ꞽ), ꞽm(.ꞽw)
- ADP: m, ꞽm, ḥr, [m], č̣r, ḫnt,
, r, ḫft, ꞽr</li> - NOUN: ꞽm(.ꞽ), ꞽm.(ꞽ)w, ꞽm.t, ḥr.t, ḥꜣ.(ꞽ)w, ḫnt(.ꞽ), ḫnt.ꞽ, ṭp(.ꞽw), (ꞽ)m(.ꞽ), Ꞽm(.ꞽ)
</ul> </li> </ul>- Sub
- ADJ: ẖr(.ꞽ), ẖr.ꞽ, [ẖr](.ꞽ), ẖr.t
- ADP: ẖr
- NOUN: ẖr(.ꞽ), ẖr.t, ṭp.t
- Sup
- ADJ: ṭp.(ꞽ)w
- ADP: ḥr, ṭp, [ḥr]
- NOUN: ḥr(.ꞽ), ṭp.(ꞽ)w
- Tem
- ADP: m
Degree and Polarity
- Neg
- ADJ: ꞽ:ḫm
- ADJ-Part: ꞽ:ḫm
- AUX-Fin: (ꞽ)m, ꞽm, tm, ꞽ:tm, [ꞽm], tm.tn
- AUX-Part: ꞽ:tm.w
- NOUN-Part: ꞽ:ḫm(.w), ꞽ:ḫm.w, ꞽ:ḫm, ꞽ:ḫmꜥ
- PRON: ꞽw.t(ꞽ), ꞽw.tꞽ
- VERB-Fin: ꞽm, ḫm.n, ꞽ:ḫm.w(ꞽ)
- VERB-Inf: ḫm(.w), rč̣
- VERB-Part: ꞽ:ḫm, ꞽ:ḫm.t
Verbal Features
- Hab
- AUX: ꞽw, ꞽ(w)
- Perf
- AUX: ꞽw, ꞽ(w)
- Cnd
- AUX-Fin: ꞽ:tm, tm.ḫr
- VERB-Fin: rč̣.kꜣ, sḫ.kꜣ, wṭ.kꜣ, ꞽr.kꜣ, ꞽw.kꜣ
- Imp
- AUX-Fin: (ꞽ)m
- VERB-Fin: m, ḥtm, ꜥḥꜥ, ḫw, ꞽn, šsp, sꜣu̯, čs, sbn, ꞽ:ḫr
- Ind
- AUX-Fin: wn.t, tm, tm.tn, wnn, ꞽm
- PROPN-Fin: Ꞽ:pꜣ
- VERB-Fin: ꞽꞽ, rč̣.n, ꞽw.n, ꞽꞽ.n, ꜥnḫ, pr, mꜣ.n, sꞽ, ꞽw, (w)ṭ.n
- Pot
- VERB-Fin: rč̣.n, śḫm.n, bꞽꜣ.n, swr.n, wnm.n, wp.n, šw.n, ẖn.n, ꞽw.n, ꞽč.n
- Sub
- AUX-Fin: ꞽm, tm, (ꞽ)m, [ꞽm]
- VERB-Fin: č̣i̯, č̣ꜣ, ꜥḥꜥ, ꞽ:nč̣, ꞽr, mꜣ, pr, ḥtp, ꜥnḫ, ꞽ:(n)č̣
- Fut
- AUX-Fin: wnn, ꞽm
- NOUN-Part: ś:(w)ꜣ.tꞽ, ꞽr.tꞽ, mry.tꞽ
- VERB-Fin: pr, ꜥnḫ, m(w)t, ḫśf, śk, č̣ꜣ, gꜣ.w, mr, nhp, pẖr
- VERB-Part: sp.t(ꞽ), ḫm.wt(ꞽ), mṭw.t(ꞽ), mꜣꜣ.t(ꞽ), mꜣꜣ.tꞽ, nḥb.t(ꞽ), pśč̣.wt(ꞽ), wṭ.t(ꞽ), śč̣m.t(ꞽ), ḥm.wt(ꞽ)
- Past
- ADJ-Part: čn
- AUX-Fin: wn.t, tm.tn
- NOUN-Part: pr.tꞽ
- VERB-Fin: ꞽꞽ, rč̣.n, ꞽw.n, ꞽꞽ.n, mꜣ.n, (w)ṭ.n, wn(.w), (w)ṭ(.w), mś.n, ꞽn<.n>
- VERB-Part: pr, mr.y, ꞽr.w, wp, ꞽꞽ, mś, pr.t, mś.t, pr.ꞽ, rč̣
- Pres
- ADJ-Part: ꞽ:ḫm, ꜥꜣ, wč̣ꜣ, wꜥb.t
- AUX-Fin: ꞽ:tm, tm, tm.ḫr
- AUX-Part: ꞽ:tm.w
- NOUN-Part: ꞽ:ḫm(.w), ꞽ:ḫm.w, mḥ, ꞽ:ḫm, nꜥꜥ, pgꜣ, t(w)[t].tꞽ, t(w)t.tꞽ, wꜣ, ꜥnḫ.w
- PROPN-Fin: Ꞽ:pꜣ
- PROPN-Part: Pgꜣ, Wbꜣ, Wꜣč̣, Čhn
- VERB-Fin: ꞽw, ꜥnḫ, ḫr, mꜣꜣ, č̣ṭ, wꜥb, č̣č̣, č̣ṭ(.w), ḥtp, ꜥḥꜥ
- VERB-Part: č̣śr, wr, ꜥnḫ, nfr.t, ḫbč̣, ꜥꜣ, wꜣš, ḥtp, ꜥnḫ.w, mꜣꜥ
- Act
- ADJ-Part: ꞽ:ḫm, ꜥꜣ, wč̣ꜣ, wꜥb.t, čn
- AUX-Fin: ꞽ:tm, tm, tm.ḫr, wnn
- AUX-Part: ꞽ:tm.w
- NOUN-Part: ꞽ:ḫm(.w), ꞽ:ḫm.w, ś:(w)ꜣ.tꞽ, ꞽr.tꞽ, mḥ, ꞽ:ḫm, nꜥꜥ, pgꜣ, pr.tꞽ, psḥ
- PROPN-Fin: Ꞽ:pꜣ
- PROPN-Part: Pgꜣ, Wbꜣ, Wꜣč̣, Čhn
- VERB-Fin: ꞽꞽ, rč̣.n, ꞽw.n, ꞽꞽ.n, ꜥnḫ, pr, mꜣ.n, sꞽ, ꞽw, (w)ṭ.n
- VERB-Part: pr, č̣śr, wr, ꞽr.w, ꜥnḫ, nfr.t, ḫbč̣, ꜥꜣ, pr.t, wp
- Pass
- NOUN-Part: mry.tꞽ
- VERB-Fin: wn(.w), (w)ṭ(.w), mś(.w), šn(.w), ꞽwr(.w), rč̣(.w), ḥtm(.w), ꞽ:sn(.w), ḥ(w.w), č(ꜣ)s(.w)
- VERB-Part: mr.y, wč̣, ꞽꞽ, hp.t, nkn.t, nč̣č̣, nḥm.t, ꞽr.yt, ꞽrr.w, ꞽšš.w
Pronouns, Determiners, Quantifiers
- Dem
- DET: pn, pw, pꞽ, pf, tn, ꞽpw, tw, p(w), ꞽpf, ꞽpn
- NOUN: nw, nn, pn, nw.ꞽ, [nn], [nw], pf
- Emp
- PRON: ⸗k, ⸗f, ⸗čn, ⸗ś
- Int
- ADV: čn(ꞽ), čn, čnꞽ
- DET: sy
- PRON: sy
- Prs
- PRON: ⸗k, ⸗f, ⸗śn, čw, ⸗ś, ⸗č, św, (⸗ꞽ), ⸗čn, ś(ꞽ)
- Rel
- PRON: ꞽw.t(ꞽ), ꞽw.tꞽ, n.t(ꞽ), n.tt, n.tꞽ
- Card
- NUM: 4, 2, 1, ḫꜣ, 5, fṭ.w, wꜥ, ḫꜣ(.w), [2], 3
- Ord
- NUM: ḫtm.nw, fṭ.nw, ḫmt.nw, ṭp(.ꞽ)
- Yes
- ADJ: n(.ꞽ), n.ꞽ, ꞽr.t
- ADP: ẖr
- NOUN: n(.ꞽ)
- PRON: ⸗k, ⸗f, ⸗śn, ⸗č, ⸗ś, (⸗ꞽ), [⸗k], ⸗čn, <⸗k>, <⸗f>
- Yes
- PRON: čw, św, kw, k(w), čn, ś(ꞽ), č(w), śn(ꞽ)
- 1
- PRON: (⸗ꞽ), ⸗ꞽ, ꞽnk, [(⸗ꞽ)], w(ꞽ), ⸗n, <⸗ꞽ>, wꞽ, śn, ⸗nꞽ
- VERB-Fin: nꞽś.k(w), ꞽ:rḫ.k(w)
- 2
- PRON: ⸗k, čw, ⸗č, ⸗čn, kw, čwt, <⸗k>, [⸗k], ⸗f, ⸗čn(ꞽ)
- VERB-Fin: bꜣ.tꞽ, č̣śr.t(ꞽ), śḫm.tꞽ, ḥr.t(ꞽ), wr.t(ꞽ), ꜥnḫ.t(ꞽ), rnpw.t(ꞽ), sꞽ.t(ꞽ), tm.tꞽ, wꜣš.tꞽ
- 3
- PRON: ⸗f, ⸗śn, ⸗ś, św, ś(ꞽ), [⸗f], śn, ⸗f(ꞽ), <⸗f>, śwt
- VERB-Fin: ꞽ:rḫ(.w), wč̣ꜣ.t(ꞽ), č̣ṭ(.w), pr(.w), rḫ(.w), ꜣḫ(.w), ḫꜥ(.w), ꞽ:rḫ.t(ꞽ), ꞽꞽ.y, kk.ꞽ
Other Features
- AdvType
- Deg
- ADV: wr
- Loc
- ADV: ꞽm, nn, [ꞽ]m, ꜥꜣ
- Man
- ADV: č̣w, [ꞽm], ꞽm
- NOUN: mwmw
- Mod
- ADV: ꞽm
- Tim
- ADV: mrn, ꜥn
- NOUN: č̣.t, rꜥw, hrw, grḥ, č̣.t{n}, ꜣ.t
- Deg
- ExtPos
- ADJ
- ADJ: ꞽm.(ꞽ)w
- ADP: r
- ADP
- ADP: m, ṭp, ḥr, r, ꞽr, [m], [n]
- ADV
- ADP: m, r, [ꞽr], n
- NOUN: č̣.t, čꜣs
- INTJ
- ADP: (ꞽ)r, ꞽr
- PART: m
- PRON
- DET: p(w), pw, tw
- NOUN: č̣ś, č̣.t, [č̣.t]
- SCONJ
- ADP: n, m
- ADJ
- Foreign
- Yes
- VERB-Fin: ꜣꜣꜣ
- X: ꜣꜣꜣ, hꞽ, kbb, phtꞽ, pčtꞽ, ꞽmḥw, bś, bꞽ, bꞽtꞽ, hnw
- Yes
- Nominal
- Yes
- ADJ: ṭšr(.tꞽ)
- AUX-Fin: wn.t
- VERB-Fin: ꜥnḫ.t, wnm.t, ꞽr.tn, mś.n, swr.t, gm.wn, gm.y, nnꞽ, wp.tn, wśr.t
- VERB-Part: mr.y, wr, č̣śr, ḫbč̣, ꜥꜣ, ꞽr.w, ꞽꞽ, nfr.t, sp.t(ꞽ), mꜣꜥ
- Yes
- PartType
- Emp
- PART: ꞽn, ꞽś, ḥm, wnn.t, m
- Int
- PART: ꞽn, [ꞽn]
- Mod
- PART: ꜣ, my, wn.t
- Neg
- PART: nꞽ, [nꞽ], w, nn, ny
- Emp
- Prefix
- Yodh
- ADJ: ꞽ:ḫm
- ADJ-Part: ꞽ:ḫm
- AUX-Fin: ꞽ:tm
- AUX-Part: ꞽ:tm.w
- NOUN-Part: ꞽ:ḫm(.w), ꞽ:ḫm.w, ꞽ:ḫm, ꞽ:rḫ.w, ꞽ:ḫmꜥ, ꞽ:ḳṭ.w, ꞽ:ꜥm
- PROPN-Fin: Ꞽ:pꜣ
- VERB-Fin: ꞽ:nč̣, ꞽ:wn, ꞽ:(n)č̣, ꞽ:ḫr, ꞽ:ṭr, ꞽ:šm, ꞽ:č̣ṭ, ꞽ:nn, ꞽ:sn(.w), ꞽ:fḫ
- VERB-Inf: ꞽ:nn.t
- VERB-Part: ꞽ:mḥ.y, ꞽ:b(ꞽ)ꜣ, ꞽ:nn.t, ꞽ:rḫ.w, ꞽ:ś:ḥč̣.t, ꞽ:śč̣, ꞽ:šm.w, ꞽ:ḫm, ꞽ:ṭr, ꞽ:ꜥm
- Yodh
- SubForm
- AbstRel
- AUX-Fin: ꞽ:tm, wnn
- PROPN-Fin: Ꞽ:pꜣ
- VERB-Fin: rč̣.n, ꞽw.n, ꞽꞽ.n, ꜥnḫ, ꞽw, (w)ṭ.n, (w)ṭ(.w), pr, wn(.w), mś(.w)
- Pred
- AUX-Fin: tm, ꞽm
- VERB-Fin: rč̣.n, ꞽ.n, mꜣ.n, č̣.n, ꞽn.n, ꞽč.n, pr, ꜥm.n, ꞽr, gꜣ.w
- RelForm
- AUX-Fin: wn.t, tm.tn
- VERB-Fin: ꞽr.n, wnm.t, ꜥnḫ.t, ꞽr.tn, mś.n, wnm.tn, č̣ṭ.n, ꞽtḥ.tn, prr.t, swr.t
- AbstRel
- Typo
- Yes
- X: [...], {nb}, {n}, {⸗ꞽ}, {r}, {k}, {t}, {tꜣ}, {č̣}, {ḥr}
- Yes
- VerbClass
- 2aeinf
- VERB-Fin: t(ꞽ), tꞽ.t
- 2aered
- AUX-Fin: wn.t, wnn
- VERB-Fin: mꜣ, mꜣ.n, mꜣꜣ, wn, ḥw(w), wnn, wr.t(ꞽ), pšš.n, tmm, wr
- VERB-Inf: mꜣ
- VERB-Part: wr, wr.w, mꜣꜣ.w, mꜣꜣ.ꞽ, wr(.w), śꜣꜣ.ꞽw, fḫḫ(.ꞽ), fḫḫ.ꞽ, mꜣꜣ, mꜣꜣ.t(ꞽ)
- 2lit
- ADJ: ꞽ:ḫm
- ADJ-Part: ꞽ:ḫm
- AUX-Fin: tm, ꞽ:tm, tm.tn, tm.ḫr
- AUX-Part: ꞽ:tm.w
- NOUN-Inf: č̣ꜣ.t
- NOUN-Part: ꞽ:ḫm(.w), ꞽ:ḫm.w, mḥ, ꞽ:ḫm, ꞽ:rḫ.w, ꞽ:ḳṭ.w, ꞽ:ꜥm
- VERB-Fin: wč̣, ꞽ:nč̣, ꞽp, sꞽ, wn(.w), ḫr, ꞽ:(n)č̣, ꞽ:wn, ꞽ:ḫr, rś
- VERB-Inf: č̣(ṭ), ḫm(.w), [č̣(ṭ)], ṭr, wn, č̣ṭ, śč̣, ḫm, ḳṭ, ꜥḳ
- VERB-Part: wč̣, ꞽ:ḫm.t, mr, nč̣č̣, rś, ḥč̣.ꞽt, ꞽ:mḥ.y, [rṭ], [wč̣], bṭ.t
- 3aeinf
- ADJ-Part: ꜥꜣ, čn
- NOUN-Inf: č̣ꜣu̯.t, mśw.t, č̣ni̯.t
- NOUN-Part: ꞽr.tꞽ, mry.tꞽ, nꜥꜥ, pr.tꞽ, wꜣ
- PROPN-Fin: Ꞽ:pꜣ
- VERB-Fin: pr, ꞽr, č̣ꜣ, (w)ṭ, ḫw, ꞽč, šm, (w)ṭ.n, wp, (w)ṭ(.w)
- VERB-Inf: śk(.w), fꜣ.t, ḥr(.w), ꞽr(.w), pr.t, gꜣ.w, rč̣, sš.w, ꜥš.w, ꞽr(i̯).t
- VERB-Part: pr, mr.y, ꞽr.w, wp, mś, pr.t, ꜥꜣ, mś.t, pr.ꞽ, ꞽr
- 3aered
- VERB-Fin: čḥnn, ꞽ:śšꜣ.w
- 3lit
- ADJ: wč̣ꜣ, wꜣč̣, wꜣč̣.t, wꜥb.t
- ADJ-Part: wč̣ꜣ, wꜥb.t
- NOUN: m(w)t.t, mwt.t, pgꜣ, psḥ, pẖr, wnb, ꜥmꜣ, ꞽ:ḫmꜥ
- NOUN-Inf: m(w)t.t, mwt.t, pẖr, ꜥmꜣ
- NOUN-Part: pgꜣ, psḥ, ꞽ:ḫmꜥ
- VERB-Fin: ꜥḥꜥ, ꜥnḫ, ḥtp, ḥtm, šsp, wꜥb, pẖr, wnm, śḫm, m(w)t
- VERB-Inf: nwr, rḫś, śč̣m, [wč̣b], m(w)t, pt(r), sꜣč, tkn.t, twr(.w), wčs
- VERB-Part: ꜥnḫ, nfr.t, č̣śr, ḫbč̣, wꜣš, śḫm, ḥtp, ꜥnḫ.w, mꜣꜥ, nfr
- 4aeinf
- VERB: ḥmś, nč̣r, mṭw, ḥmś.w, bꞽꜣ.n, nč̣r.n, nč̣r.w, rnp.w, sḫn.n, č̣św
- VERB-Fin: ḥmś, nč̣r, mṭw, bꞽꜣ.n, nč̣r.n, nč̣r.w, sḫn.n, č̣św, śšm, ḥmś.w
- VERB-Inf: mṭ(w).t, sḫn, ḥmś
- VERB-Part: rnp.w, ḥfṭ.w, [ꞽ:śšm], bꞽꜣ, mśč̣č̣, mśč̣č̣.w, mṭw(.w), mṭw.t(ꞽ), mṭw.w, ḥmś.w
- 4lit
- VERB-Fin: ꞽ:nn, wnwn, nmnm, (ꞽ)m(ꞽ)m, nnꞽ, śnčr, [śnśn], gbgb.n, mꜥḥꜣ, nn
- VERB-Inf: ꞽ:nn.t
- VERB-Part: tḫtḫ, [šbšb], śnśn, ꜣḥꜣḥ.ꞽ, ꞽ:nn.t
- 5aeinf
- VERB-Fin: nḫḫ, ḥꜥꜥ.t(ꞽ)
- VERB-Part: nḫḫ
- 5lit
- VERB-Fin: ḥbnbn, nbꜣbꜣ, nṭfṭf, nwꜣwꜣ, nč̣ṭnč̣ṭ, nšbšb.tn, nḥ
ḥr, nḫbḫb(.w), nḫrḫr, nṭbṭb</li> - VERB-Inf: ntktk, nwtwt.w
- VERB-Part: nhmhm(.w), nḫꜣḫꜣ.t
</ul> </li>- Anom
- VERB: č̣i̯, ꞽꞽ, rč̣.n, ꞽw.n, ꞽꞽ.n, ꞽn, ꞽw, ꞽn<.n>, ꞽn.n, č̣.n
- VERB-Fin: rč̣.n, ꞽꞽ, č̣i̯, ꞽw.n, ꞽꞽ.n, ꞽn, ꞽw, ꞽn<.n>, ꞽn.n, č̣.n
- VERB-Inf: ꞽw(.w), ꞽw.t, rč̣(.w), ꞽw, ꞽw(i̯).t, ꞽw.w
- VERB-Part: rč̣, ꞽn, ꞽw.w, ꞽꞽ, (r)č̣i̯, rč̣.yt, č̣(.y), č̣č̣.t, ꞽn(n).w, ꞽn.(y)t
- Caus2aered
- VERB-Fin: ś:fḫḫ(.w), ś:fḫḫ, ś:fḫḫ.w, ś:ḳbb, ś:ꜣḫ.w, šsp, ꞽ:ś:fkk.tn
- VERB-Inf: <ś:>fḫḫ(.w), ś:fḫḫ.w
- VERB-Part: ś:mꜣꜣ
- Caus2lit
- VERB-Fin: ś:č̣ꜣ, ś:mn, ś:ꜣḫ, ś:ꞽw, ś:sn(.w), ś:č̣ꜣ.n, ś:ḥč̣, ś:ꞽp, ś:bš, ś:fḫ
- VERB-Inf: ś:č̣ꜣ
- VERB-Part: ś:mn.t(ꞽ), ś:śn
- Caus3aeinf
- NOUN-Part: ś:(w)ꜣ.tꞽ
- VERB-Fin: ś:wꜣ, ś:ḫt, ś:pꜣ.n, ś:wꜣ.n, ś:ḥm.n, ś:ḳṭ, ś:ḳṭṭ.t, ś:wꜣꜣ, ś:č̣ꜣ.w, ś:šw
- VERB-Part: ś:wꜣ.w, ś:sꜣ.t, ś:ḳṭ
- Caus3lit
- VERB-Fin: ś:ꞽꜥ, ś:(w)ꜣč̣, ś:ḥtp, ś:nhṭ, ś:swn.tn, ś:wꜥb, ś:čꜣs.tꞽ, ś:štꜣ(.w), ś:ḥtm(.w), ś:ꜥnḫ.n
- VERB-Part: ś:škr.t, ś:ꜥnḫ, ś:ꜥḥꜥ.w, ś:ꞽꜥ, ꞽ:ś:ḥč̣.t
- Caus4aeinf
- VERB-Fin: ś:bꜣg.y
- VERB-Part: ś:bꜣg
- Def
</ul> </li> </ul>- AUX-Fin: (ꞽ)m, ꞽm, [ꞽm]
- VERB-Fin: m, ꞽ.n, ꞽm, m(ꞽ), m.y, ꞽ(.w), ꞽ.t(ꞽ)
- VerbType
- Aux
- AUX-Fin: (ꞽ)m, ꞽm, [ꞽm]
- Aux
Syntax
Auxiliary Verbs and Copula
- This corpus uses 2 lemmas as copulas (cop). Examples: pw, pꞽ.
- This corpus uses 4 lemmas as auxiliaries (aux). Examples: ꞽmi̯, ꞽw, tm, wnn.
Core Arguments, Oblique Arguments and Adjuncts
Here we consider only relations between verbs (parent) and nouns or pronouns (child).- nsubj
- VERB-Fin--NOUN (382)
- VERB-Fin--NOUN-Gen (1)
- VERB-Fin--NOUN-Loc (12)
- VERB-Fin--PRON (1300)
- VERB-Fin--PRON-ADP(m) (1)
- VERB-Inf--NOUN (6)
- VERB-Inf--PRON (35)
- VERB-Part--NOUN (12)
- VERB-Part--PRON (11)
- obj
- VERB-Fin--NOUN (759)
- VERB-Fin--NOUN-ADP(m) (1)
- VERB-Fin--NOUN-ADP(n) (1)
- VERB-Fin--NOUN-Ben (1)
- VERB-Fin--NOUN-Loc (3)
- VERB-Fin--NOUN-Sub (1)
- VERB-Fin--NOUN-Sup (2)
- VERB-Fin--PRON (423)
- VERB-Fin--PRON-ADP(m) (3)
- VERB-Fin--PRON-ADP(n) (6)
- VERB-Inf--NOUN (392)
- VERB-Inf--PRON (11)
- VERB-Part--NOUN (81)
- VERB-Part--NOUN-Loc (2)
- VERB-Part--PRON (16)
- VERB-Part--PRON-ADP(n) (1)
Reflexive Verbs
- This corpus contains 8 lemmas that occur at least once with an expl:pv child. Examples: sꜣu̯ čw, wni̯ čw, mr św, ms św, sꜣu̯ k(w), sꜣu̯ čn, wni̯ k(w), ḳfn śn(ꞽ)
Verbs with Reflexive Core Objects
- This corpus contains 37 lemmas that occur at least once with a reflexive core object (obj or iobj). Examples: ḥtm čw, čsi̯ čw, ṭr čw, pnꜥ čw, ms kw, pšš ś(ꞽ), pẖr čw, ꞽꜥr kw, mr św, nꞽnꞽ čw, smn čw, ḥtm kw, ꞽꜥi̯ św, mn św, ms k(w), nčr čw, nč̣ św, nꞽnꞽ kw, pč̣ čw, pšš čn, rḫ św, sḫi̯ čw, wč̣b čw, wꜥb św, čsi̯ św, ś:čꜣs čw, ḥtm k(w), ḥtm św, ḥww kw, ḥww č(w), ḥww čw, ṭrp čw, ꜥbꜣ čw, ꞽmn čn, ꞽmn čw, ꞽꜥi̯ kw, ꞽꜥi̯ čw
- Out of those, 1 lemmas occurred more than once, but never without a reflexive dependent. Examples: pšš
Relations Overview
- This corpus uses 10 relation subtypes: acl:relcl, csubj:outer, csubj:pass, expl:pv, flat:foreign, nmod:poss, nsubj:outer, nsubj:pass, obl:agent, obl:arg
- The following 4 relation types are not used in this corpus at all: iobj, clf, goeswith, reparandum
- VERB-Fin: ḥbnbn, nbꜣbꜣ, nṭfṭf, nwꜣwꜣ, nč̣ṭnč̣ṭ, nšbšb.tn, nḥ
- 2aeinf
- Lat
- ADP: m, ꞽm, ḥr, [m],
- ADP: n, [n], [