UD Hittite HitTB
Language: Hittite (code: hit
)
Family: IE
This treebank has been part of Universal Dependencies since the UD v2.10 release.
The following people have contributed to making this treebank part of UD: Erik Andersen, Ben Rozonoyer.
Repository: UD_Hittite-HitTB
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.15
License: CC BY-SA 4.0
Genre: grammar-examples
Questions, comments? General annotation questions (either Hittite-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [erik • andersen411 (æt) gmail • com]. Development of the treebank happens outside the UD repository. If there are bugs, either the original data source or the conversion procedure must be fixed. Do not submit pull requests against the UD repository.
Annotation | Source |
---|---|
Lemmas | annotated manually |
UPOS | annotated manually, natively in UD style |
XPOS | annotated manually |
Features | annotated manually, natively in UD style |
Relations | annotated manually, natively in UD style |
Description
UD_Hittite-HitTB is a small Universal Dependencies treebank for Hittite, containing original sentences from Hoffner and Melchert’s tutorial to A Grammar of the Hittite Language.
This Hittite treebank contains 136 sentences, 1309 words, and 970 (whitespace-separated) tokens. It contains a variety of sentences spanning each of the major periods in Hittite linguistic history: Old Hittite, Middle Hittite, and New Hittite. These are included in one treebank due to the existence of copies that span multiple different eras.
Acknowledgments
We would like to extend our gratitude to David Wright, Professor of Bible and Ancient Near East at Brandeis University, for offering his excellent course on Hittite in Spring 2020.
References
- Harry A. Hoffner, Jr. and H. Craig Melchert. 2008. A Grammar of the Hittite Language. Part 2: Tutorial. Eisenbrauns.
Statistics of UD Hittite HitTB
POS Tags
ADJ – ADP – ADV – AUX – CCONJ – DET – INTJ – NOUN – NUM – PART – PRON – PROPN – SCONJ – VERB – X
Features
Aspect – Case – Definite – Gender – Language – Mood – Number – NumType – Person – Poss – PronType – Tense – VerbForm – Voice
Relations
acl – acl:relcl – advcl – advmod – advmod:emph – advmod:loc – amod – appos – aux – case – cc – ccomp – compound – conj – cop – csubj – dep – det – discourse – discourse:conn – dislocated – expl – expl:pass – flat – iobj – mark – nmod – nmod:det – nsubj – nummod – obj – obl – orphan – parataxis – root – vocative – xcomp
Tokenization and Word Segmentation
- This corpus contains 136 sentences, 971 tokens and 1309 syntactic words.
- All tokens in this corpus are followed by a space.
- This corpus does not contain words with spaces.
- This corpus contains 601 types of words that contain both letters and punctuation. Examples: ták-ku, A-NA, a-an, a-aš, Ú-UL, ar-ḫa, nu-u, wa-a, KÙ.BABBAR, ma-a-an, wa-r, š-ma-aš, LÚ.MEŠ, ku-it, ku-iš-ki, pa-a-i, pé-ra-an, a-at, an-da, kat-ta, le-e, ma-a, ma-aḫ-ḫa-an, ša-ra-a, UTU-uš, UTU-Š, a-ap-pa, da-a-i, ku-it-ma-an, n-na-aš, na-at-ta, ÉRIN.MEŠ, ú-ez-zi, š-ša-an, š-ši, A-BU, EGIR-pa, KUR-e, LÚ.U19.LU-an, a-a, a-aš-ta, kat-ta-an, ki-nu-n, ku-iš, me-na-aḫ-ḫa-an-da, na-aš-ma, nam-ma, pa-ra-a, ti-ya-zi, É-er
- This corpus contains 255 multi-word tokens. On average, one multi-word token consists of 2.33 syntactic words.
- There are 204 types of multi-word tokens. Examples: nu=kán, nu=mu, n=a-aš, d-UTU-uš, n=a-an, URU-Ḫa-at-tu-ši, d-UTU-Š=I, n=a-at=mu, nu-u=š-ša-an, A-BU=YA, BE-LÍ=NI=wa-a=n-na-aš, DUG-ḫar-ši-ya-al-li, GIŠ-BANŠUR, GIŠ-BANŠUR-i, LUGAL-š=a, LÚ-SANGA, URU-A-ri-ip-ša-a, URU-Ne-e-ša, URU-Za-a-al-pu-wa, d-U-aš, d-UTU, ke-e-et-t=a, ki-nu-n=a, n=a-an=kán, n=a-at, n=a-aš-ta, nu-u=š-ma-aš, nu=uš, nu=wa, nu=wa-a=n-na-aš=za, nu=wa-a=š-ma-aš, A-BI=ŠU, A-BU=ŠU, A.ŠÀ.ḪI.A=ŠU, BURU14=ma-a=z, DINGIR-LIM-ni=wa-a=t-ta, DINGIR=YA, DUMU.NAM.LÚ.U19.LU-aš, DUMU.NAM.LÚ.U19.LU=pát=kán, DUMU=KA, DUMU=ŠU, EN=YA, GAŠAN=YA, GEŠTIN=ya=kán, GIŠ-AB-ya, GIŠ-BANŠUR.GAL, GIŠ-TUKUL-aš, GIŠ-ŠÚ.A-ki, GIŠ-ḫa-tal-ki-iš-na-aš, GUD-un=aš-ta.
Morphology
Tags
- This corpus uses 15 UPOS tags out of 17 possible: ADJ, ADP, ADV, AUX, CCONJ, DET, INTJ, NOUN, NUM, PART, PRON, PROPN, SCONJ, VERB, X
- This corpus does not use the following tags: SYM, PUNCT
- This corpus contains 28 word types tagged as particles (PART): [le-e, [nu, [za, a, a-a, a-aš-ta, aš-ta, kán, le-e, m]a, ma, ma-a, n, na-at-ta, nu, nu-u, nu-u-ma-an, pát, wa, wa-a, wa-r, z, za, Ú-UL, š-ša-[an, š-ša-an, š-šan, ša-an
- This corpus contains 27 lemmas tagged as pronouns (PRON): -at, -aš, -miš, -mu, -naš, -ta, -tiš, -šet, -ši, -šiš, -šmaš, -šmiš, -šummiš, KA, NI, apā-, kuit, kuiš, kuiški, kuwatta, kā-, uk, zik, ÌR-(n)ātar, šumēš, ūk, ḫūmant-
- This corpus contains 6 lemmas tagged as determiners (DET): apā-, aši, kuiš, kā-, mekki, ḫūmant-
- Out of the above, 4 lemmas occurred sometimes as PRON and sometimes as DET: apā-, kuiš, kā-, ḫūmant-
- This corpus contains 2 lemmas tagged as auxiliaries (AUX): ēš, ḫark
- There are 5 (de)verbal forms:
- Fin
- AUX: e-eš-du, e-eš-ta, e-eš, e-ešta, ḫar-mi, ḫar-ta, ḫar-ši
- VERB: pa-a-i, da-a-i, ú-ez-zi, ti-ya-zi, a-ki, i-ya-nu-un, pa-a-an-zi, pa-iz-zi, ti-it-ta-nu-ut, za-a-ḫi
- Inf
- VERB: a-ku-wa-an-na, ma-uš-šu-u-wa-an-zi, ša-a-ru-wa-u-wa-an-zi, ša-an-ḫu-u-wa-an-zi, šu-un-nu-ma-an-zi
- Part
- VERB: i-ya-an-za, tu-u-ri-ya-an, ḫar-ga-an-za, ak-kán-ta-aš, ak-kán-za, pa-ap-pár-ša-an-ta, ti-ya-an, šar-ni-in-kán, še-ek-kán-te-et
- Sup
- VERB: pa-iš-ga-u-wa-an, pé-eš-ke-u-an, pí-iš-ke-u-an
- Vnoun
- VERB: GUL-aḫ-ḫu-wa-ar
Nominal Features
- Com
- ADJ: i-da-a-lu-uš, TI-an-za, TUKU.TUKU-u-an-za, da-an-ku-wa-e-eš, da-an-na-at-t[a-aš, da-aš-šu-š, pár-ku-iš, ḫa-a]n-te-ez-zi-ya, ḫa-an-te-ez-zi-iš, ḫa-an-te-ez-zi-ya-a
- DET: ku-iš, ku-u-uš, ḫu-u-ma-an-te-eš
- NOUN: KÙ.BABBAR, DUMU, ÉRIN.MEŠ, LÚ, LÚ.MEŠ, A-BU, MUNUS, NINDA.GUR4.RA, LÚ.U19.LU-an, ḫal-ki-in
- NUM: 1-an, mi-e-wa-aš, mi-e-ya-wa-aš, mi-e-ú-uš, mi-u-wa-aš, te-ri-ya-aš
- PRON: a-aš, a-an, ku-iš-ki, a-at, an, a-pa-a-aš, a-pu-u-un, ku-in, ku-iš, uš
- VERB-Part: ak-kán-za, še-ek-kán-te-et
- Fem
- PRON: KA
- Masc
- PRON: ŠU, KA, KU-NU, SÚ
- Neut
- ADJ: iš-ḫar-wa-an-d[a
- DET: ke-e
- NOUN: KUR, KÙ.BABBAR, KUR-e, É-er, TI-an-na-aš, É, ŠA-ME-E, ḫar-ši-ya-al-li, ṬUP-PA-ḪI.A, AB-ya
- PRON: ki-i, ku-it, a-at, at, ku-e, mi-it, ÌR-an-ni, ša-aš, še-et, ši-it
- VERB-Part: ti-ya-an, šar-ni-in-kán
- Plur
- ADJ: IGI.NU.GÁL.ḪI.A, da-an-ku-wa-e-eš, da-an-na-at-t[a-aš, iš-ḫar-wa-an-d[a, ḫar-ga-eš
- DET: ke-e, ku-e-da-aš, a-pé-e-da-aš, ku-u-uš, ḫu-u-ma-an-te-eš
- NOUN: LÚ.MEŠ, ANŠE.KUR.RA.ḪI.A, BURU14.ḪI.A, GUD.ḪI.A, NAM.RA.ḪI.A, TÚG.NÍG.LÁM.MEŠ, UDU.ḪI.A, URU.DIDLI.ḪI.A, uk-tu-ri-ya-aš, ḫar-ši-ya-al-li
- PRON: š-ma-aš, a-at, n-na-aš, NI, a-aš, uš, KU-NU, ku-e, ku-i-uš, u-uš
- VERB-Fin: pa-a-an-zi, ú-e-er, ši-pa-an-da-an-zi, a-du-e-ni, a-ku-e-ni, a-ra-an-zi, a-ri-ya-u-e-ni, a-ú-e-er, ap-pa-an-da-at, ar-mi-iz-zi-ya-an-ta-ru
- VERB-Part: pa-ap-pár-ša-an-ta
- Sing
- ADJ: i-da-a-lu-uš, EL-LA-AM, EL-LAM, EL-LUM, TI-an-za, TUKU.TUKU-u-an-za, da-aš-šu-š, pár-ku-iš, ḫa-a]n-te-ez-zi-ya, ḫa-an-te-ez-zi-iš
- AUX-Fin: e-eš-du, e-eš-ta, e-eš, e-ešta, ḫar-mi, ḫar-ta, ḫar-ši
- DET: ku-e-da-ni, a-pé-e-da-ni, a-pí-ya, ku-iš, ḫu-u-ma-an-da-an
- NOUN: KUR, DUMU, KUR-e, LÚ.U19.LU-an, É-er, ḫal-ki-in, A-BU, BANŠUR, BANŠUR-i, BE-EL
- NUM: 1-an
- PRON: mu, a-an, YA, a-aš, ku-iš-ki, ŠU, š-ši, I, an, KA
- VERB-Fin: pa-a-i, da-a-i, ú-ez-zi, ti-ya-zi, a-ki, i-ya-nu-un, pa-iz-zi, ti-it-ta-nu-ut, za-a-ḫi, ú-wa-ši
- VERB-Part: ak-kán-ta-aš, ak-kán-za, še-ek-kán-te-et
- Abl
- NOUN: šu-uḫ-ḫa-az, SAG.DU-za, a-ru-na-az, a-ša-ú-na-az, an-na-az, ŠU-za, ŠÀ-az, ḫa-a-li-az
- PRON: ḫu-u-ma-an-da-az
- PROPN: Ne-e-ša, Ne-e-ša-az, URU-Aš-ta-ta-za, Z]a-a-al-pu-wa-az, Ḫa-at-tu-ša-za
- Abs
- NOUN: ḫar-ši-ya-al-li, a-še-eš-šar, lu-uz-zi, me-ḫur, ták-šu-ul, É
- Acc
- ADJ: EL-LA-AM, EL-LAM, iš-ḫar-wa-an-d[a
- DET: ke-e, ku-u-uš, ḫu-u-ma-an-da-an, ḫu-u-ma-an-te-eš
- NOUN: LÚ.U19.LU-an, ḫal-ki-in, ÉSAG-an, ši-ú, BANŠUR, GEŠTIN-an, GUD-un, G[I-an], KASKAL-an, KIR4
- NUM: 1-an
- PRON: a-an, n-na-aš, an, a-aš, a-pu-u-un, ku-in, uš, a-at, am-mu-uk, at
- PROPN: Ka-a-ra-aš-šu-wa-an, Kap-pé-e-ri-in, Ku-pa-an-ta-d-LAMMA-an, Ne-e-ša-an, Ḫa-an-ta-še-pa-an, Ḫur-na-an-n
- All
- NOUN: pár-na, ta-ak-na-a, É-na
- PROPN: Za-a-al-pu-wa
- Dat
- ADJ: da-an-na-at-t[a-aš, ḫa-a]n-te-ez-zi-ya
- DET: ku-e-da-aš, ku-e-da-ni, a-pé-e-da-aš, a-pé-e-da-ni, a-pí-ya
- NOUN: BANŠUR-i, KUR-e, LUGAL-i, UD-ti, uk-tu-ri-ya-aš, za-aḫ-ḫi-ya, A.ŠÀ-ni, AB-ya, DAM, DINGIR
- PRON: mu, š-ma-aš, š-ši, ku-e-da-ni-ik-ki, t-ta, tu-uq-q, ÌR-an-ni, š-še
- PROPN: Ḫa-at-tu-ši, KÙ.BABBAR-ši, NIN.TU-ni, UTU, UTU-i
- Gen
- NOUN: DINGIR-LIM, KUR, TI-an-na-aš, UD-aš, ŠA-ME-E, A-BI, AN-aš, DINGIR.MEŠ-aš, GIŠ.ḪI.A, GUDU12-aš
- NUM: mi-u-wa-aš, te-ri-ya-aš
- PRON: am-me-el, ku-e-el, tu-e-el, ša-aš
- PROPN: Ne-e-ša-aš
- VERB-Part: ak-kán-ta-aš
- Ins
- NOUN: Ì.DÙG.GA-it, IGI.ḪI.A-it, ZI-it, a-aš-ša-u-i-it, ki-iš-ta-an-ti-it, na-ak-ki-it, Ù-it, ḫal-ki-it
- VERB-Part: še-ek-kán-te-et
- Nom
- ADJ: i-da-a-lu-uš, ku-u-ru-ur, EL-LUM, TI-an-za, TUKU.TUKU-u-an-za, da-an-ku-wa-e-eš, da-aš-šu-š, pár-ku-iš, ḫa-an-te-ez-zi-iš, ḫa-an-te-ez-zi-ya-a
- DET: ku-iš
- NOUN: LÚ.MEŠ, A-BU, DINGIR-LUM, LUGAL, LUGAL-uš, MUNUS, iš-ḫa-a-aš, É-er, ÌR-aš, BE-EL
- NUM: mi-e-wa-aš, mi-e-ya-wa-aš
- PRON: a-aš, ku-iš-ki, a-at, ŠU, a-pa-a-aš, ku-iš, zi-g, [ú-g, am-mu-uk, aš
- PROPN: U-aš, A-la-lu-uš, A-ni-it-ta-aš, A-nu-uš, A-ri-ip-ša-aš, Ar-nu-wa-a]n-da-aš, Da-aš-mi-šu-uš-š, Ga-aš-ga-aš, IŠKUR-aš, IŠTAR-at-ti-iš
- VERB-Part: ak-kán-za
- Voc
- PROPN: Te-li-pí-nu-uš, UTU-uš
- Cons
- NOUN: BE-EL
Degree and Polarity
Verbal Features
- Imp
- VERB-Fin: pí-iš-ker
- Imp
- AUX-Fin: e-eš-du, e-eš
- VERB-Fin: ar-mi-iz-zi-ya-an-ta-ru, da-a, i-it, i-ya, i-ya-an-ni, ka-ri-ip-pa-an-du, ki-ša-ru, pa-a-an-du, pa-aḫ-ša-an-da-ru, pa-aḫ-ši
- Ind
- AUX-Fin: e-eš-ta, e-ešta, ḫar-mi, ḫar-ši
- VERB-Fin: pa-a-i, da-a-i, ti-ya-zi, ú-ez-zi, a-ki, i-ya-nu-un, pa-a-an-zi, pa-iz-zi, ti-it-ta-nu-ut, za-a-ḫi
- Past
- AUX-Fin: e-eš-ta, e-ešta, ḫar-ta
- VERB-Fin: ti-it-ta-nu-ut, ú-e-er, GUL-aḫ-ḫu-un, IṢ-BAT, a-ni-ya-at, a-ú-e-er, ap-pa-an-da-at, da-a-aš, da-a-er, da-a-ir
- Pres
- AUX-Fin: ḫar-ši
- VERB-Fin: pa-a-i, da-a-i, ú-ez-zi, ti-ya-zi, a-ki, pa-a-an-zi, pa-iz-zi, za-a-ḫi, ú-wa-ši, šar-ni-ik-zi
- Act
- AUX-Fin: e-eš-du, e-eš-ta, e-eš, e-ešta, ḫar-mi, ḫar-ta, ḫar-ši
- VERB-Fin: pa-a-i, da-a-i, ú-ez-zi, ti-ya-zi, a-ki, i-ya-nu-un, pa-a-an-zi, pa-iz-zi, ti-it-ta-nu-ut, za-a-ḫi
- Mid
- VERB-Fin: ap-pa-an-da-at, ar-mi-iz-zi-ya-an-ta-ru, ar-ta, du-uq-qa-a-ri, e-šu-wa-aš-ta, ki-it-ta-at, ki-it-ta-ri, ki-ša-at, ki-ša-ri, ki-ša-ru
Pronouns, Determiners, Quantifiers
- Dem
- PRON: a-pa-a-aš, a-pu-u-un, ki-i
- Ind
- PRON: ku-iš-ki, ku-e-da-ni-ik-ki, ku-i]š-ki
- Int
- PRON: ku-wa-at-ta, ku-it
- Prs
- PRON: mu, a-an, YA, a-aš, š-ma-aš, ŠU, a-at, I, n-na-aš, š-ši
- Rel
- PRON: ku-e, ku-e-el, ku-i-uš, ku-i-š, ku-in, ku-iš
- Tot
- DET: ḫu-u-ma-an-te-eš
- PRON: ḫu-u-ma-an-da-az
- Card
- NUM: mi-e-ú-uš, mi-u-wa-aš, te-ri-ya-aš
- Ord
- ADV: da-a-an
- NUM: ta-a-an
- Yes
- PRON: YA, ŠU, I, KA, NI, KU-NU, SÚ, mi-it, te-eš, ti-iš
- 1
- AUX-Fin: ḫar-mi
- PRON: mu, YA, n-na-aš, I, NI, am-mu-uk, [ú-g, am-me-el, mi-it, mu-u
- VERB-Fin: i-ya-nu-un, GUL-aḫ-ḫu-un, a-du-e-ni, a-ku-e-ni, a-ri-ya-u-e-ni, da-aḫ-ḫi, da-aḫ-ḫu-un, e-šu-wa-aš-ta, i-da-la-u-wa-aḫ-ḫu-un, i-ya-an-na-aḫ-ḫé
- 2
- AUX-Fin: e-eš, ḫar-ši
- PRON: š-ma-aš, t-ta, zi-g, KA, KU-NU, te-eš, ti-iš, tu-e-el, tu-uk, tu-uq-q
- VERB-Fin: ú-wa-ši, da-a, e-ep-ta, e-ep-te-e-ni, i-it, i-ya, i-ya-an-ni, ku-en-ta, ma-ni-ya-aḫ-ti, na-aḫ-ti
- 3
- AUX-Fin: e-eš-du, e-eš-ta, e-ešta, ḫar-ta
- PRON: a-an, a-aš, ŠU, a-at, š-ma-aš, š-ši, an, uš, še-et, KA
- VERB-Fin: pa-a-i, da-a-i, ú-ez-zi, ti-ya-zi, a-ki, pa-a-an-zi, pa-iz-zi, ti-it-ta-nu-ut, za-a-ḫi, ú-e-er
Other Features
- Language
- Akk
- ADJ: EL-LAM
- Sum
- NOUN: DINGIR.MEŠ-aš, KIR4, LÚ.U19.LU-an
- Akk
Syntax
Auxiliary Verbs and Copula
- This corpus uses 1 lemmas as copulas (cop). Examples: ēš.
- This corpus uses 2 lemmas as auxiliaries (aux). Examples: ḫark, ēš.
Core Arguments, Oblique Arguments and Adjuncts
Here we consider only relations between verbs (parent) and nouns or pronouns (child).
- nsubj
- VERB--NOUN (1)
- VERB--NOUN-Nom (1)
- VERB-Fin--NOUN (6)
- VERB-Fin--NOUN-Abs (2)
- VERB-Fin--NOUN-Nom (30)
- VERB-Fin--PRON (1)
- VERB-Fin--PRON-Nom (24)
- obj
- VERB-Fin--NOUN (21)
- VERB-Fin--NOUN-Abs (4)
- VERB-Fin--NOUN-Acc (38)
- VERB-Fin--NOUN-Dat (2)
- VERB-Fin--NOUN-Nom (1)
- VERB-Fin--PRON (2)
- VERB-Fin--PRON-Acc (37)
- VERB-Fin--PRON-Dat (1)
- VERB-Fin--PRON-Dat-ADP(peran) (1)
- VERB-Inf--NOUN (1)
- VERB-Part--PRON-Acc (2)
- VERB-Sup--NOUN (1)
- VERB-Sup--NOUN-Acc (1)
- iobj
- VERB-Fin--NOUN (1)
- VERB-Fin--NOUN-Acc (1)
- VERB-Fin--NOUN-Dat (4)
- VERB-Fin--PRON-Dat (10)
- VERB-Part--NOUN-Dat (1)
- VERB-Sup--PRON-Dat (2)
Reflexive Passive
- This corpus contains 10 lemmas that occur at least once with an expl:pass child. Examples: iya-#1- za, wašše- z, MUNUS-n- za, antuḫša- za, dā- za, warp- za, zikke- z, ÌR-(n)aḫḫ- za, ēd- z, ḫaš(š)-#1- [za
Relations Overview
- This corpus uses 6 relation subtypes: acl:relcl, advmod:emph, advmod:loc, discourse:conn, expl:pass, nmod:det
- The following 6 relation types are not used in this corpus at all: clf, fixed, list, goeswith, reparandum, punct