UD Chintang CTNTB
Language: Chintang (code: ctn)
Family: Sino-Tibetan
This treebank has been part of Universal Dependencies since the UD v2.17 release.
The following people have contributed to making this treebank part of UD: Kira Tulchynska, Robert Schikowski, Alena Witzlack-Makarevich.
Repository: UD_Chintang-CTNTB
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.17
License: CC BY-NC-SA 4.0
Genre: grammar-examples
Questions, comments? General annotation questions (either Chintang-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [kira • tulchynska (æt) mail • huji • ac • il, witzlack (æt) gmail • com]. Development of the treebank happens directly in the UD repository, so you may submit bug fixes as pull requests against the dev branch.
| Annotation | Source |
|---|---|
| Lemmas | annotated manually in non-UD style, automatically converted to UD |
| UPOS | annotated manually in non-UD style, automatically converted to UD |
| XPOS | annotated manually in non-UD style, automatically converted to UD |
| Features | annotated manually in non-UD style, automatically converted to UD |
| Relations | annotated manually, natively in UD style |
Description
UD_Chintang-CTNTB is a Universal Dependencies (UD) treebank for the Chintang language. The annotation converted from glosses from “A Grammar of Chintang A Tibeto-Burman Language of Nepal” by Robert Schikowski.
The UD_Chintang-CTNTB treebank treebank contains 2k sentences (a total of 15k words) from the Kiranti language Chintang, spoken in eastern Nepal. All data are included in the training set.
The sentences originate from the examples in the (currently unpublished) A Grammar of Chintang: A Tibeto-Burman Language of Nepal by Robert Schikowski. Each example was converted from interlinear glossed text into CoNLL-U format using a custom conversion script. The syntactic relations were subsequently manually annotated according to the Universal Dependencies (UD) framework.
Acknowledgments
- This work was supported by the University of Zurich Global Strategy and Partnerships Funding Scheme (Project Fund Level 3) (https://www.global.uzh.ch).
- Kira Tulchynska gratefully acknowledges the financial support of the Jack, Joseph and Morton Mandel School MA Honors Program at the Hebrew University of Jerusalem.
References
- Schikowski, Robert. A Grammar of Chintang: A Tibeto-Burman Language of Nepal. (unpublished manuscript)
Statistics of UD Chintang CTNTB
POS Tags
ADJ – ADP – ADV – AUX – CCONJ – DET – INTJ – NOUN – NUM – PART – PRON – PROPN – PUNCT – SCONJ – VERB – X
Features
AdvType – Animacy – Aspect – Case – Clusivity – Clusivity[p] – Clusivity[psor] – ConvType – Degree – Deixis – Evident – Foreign – InfStruct – Mood – Number – Number[p] – Number[psor] – NumType – Person – Person[p] – Person[psor] – Polarity – Poss – PronType – Reach – Red – Tense – VerbForm – Voice
Relations
acl – acl:nmlz – acl:relcl – advcl – advcl:cntf – advcl:coord – advcl:emph – advcl:purp – advcl:sim – advmod – advmod:cop – advmod:emph – advmod:nmlz – amod – amod:nmlz – appos – aux – case – cc – ccomp – compound:lvc – conj – csubj – det – det:nmlz – discourse – flat – flat:foreign – flat:name – flat:num – iobj – mark – nmod – nmod:nmlz – nmod:poss – nsubj – nsubj:outer – nummod – obj – obj:caus – obl – orphan – parataxis – punct – reparandum – root – vocative – xcomp – xcomp:desid
Tokenization and Word Segmentation
- This corpus contains 2289 sentences, 12758 tokens and 14631 syntactic words.
- This corpus contains 2864 tokens (22%) that are not followed by a space.
- This corpus does not contain words with spaces.
- This corpus contains 65 types of words that contain both letters and punctuation. Examples: them-them, thitta-thitta, them-themce, aʔ-, maŋka-maŋka, miʔmuŋ-miʔmuŋ, paila-paila, thurum-harum, wahuʔluŋ-wateluŋbeʔ, Ambira-Legura, Appi-appi, Asdinda-, Ga-, Hiccibaŋ-hiccibaŋ, Jamma-jamma, Jata-jata, Kam-kam, Kok-kaya, Mai-, Maruni-Sarasati, Pakkuwa-poluwa, Pheri-pheri, Rato-rato, Sa-salo, Sariloŋma-kokuloŋma, Teĩ-teĩbe, Uthau-uthau, Wahuʔluŋ-wateluŋbeʔŋa, Wanda-chindayu, Warila-Kundala, age-age, anam-anam, apalo-apalo, aŋ-aŋ, baddhe-baddhe, bakhra-ghãsa, baraji-iswostibeʔ, beula-beulice, binti-sewa, chobou-chobou, cikhimtaŋma-puwaŋtaŋma, ciŋga-cala, damchama-epchama, e-, hakhi-hakhi, hokhi-hokhi, idamchama-epchamaŋa, iskul-iskulbe, katti-katti, khamawa-maciya
- This corpus contains 1708 multi-word tokens. On average, one multi-word token consists of 2.10 syntactic words.
- There are 1241 types of multi-word tokens. Examples: huŋgo, bago, huŋgoiʔ, migo, mogo, huŋgoiʔyã, thekha, akkayaŋ, appita, bata, huŋgoi, khalo, mikha, thego, hokkoiʔ, jogo, likhita, themkha, battakha, hokke, hokkoʔni, hunta, jammata, kinalo, linokha, temmakha, Bagote, aniyaŋ, huŋgoiʔta, huŋkhita, manchikha, pimamo, sattekha, Yogo, akadakha, akkata, bagoyaŋ, basaŋata, chaceyaŋ, ekdamai, hicceta, hokoi, hunlamta, huŋgoita, khaiʔmamo, khemsuŋkha, kinale, kinayaŋ, kokyaŋ, matamaimyokte.
Morphology
Tags
- This corpus uses 16 UPOS tags out of 17 possible: ADJ, ADP, ADV, AUX, CCONJ, DET, INTJ, NOUN, NUM, PART, PRON, PROPN, PUNCT, SCONJ, VERB, X
- This corpus does not use the following tags: SYM
- This corpus contains 68 word types tagged as particles (PART): Manchiʔ, ai, aitira, aiŋa, au, aŋ, bhane, bhone, cahi, cahĩ, caine, caĩ, caĩne, chaĩ, e, ei, gone, gonei, goneĩ, haina, hoina, hol, hola, hou, i, isaŋa, khoi, konei, le, leʔ, leʔle, lo, loʔ, ma, maha, mahaʔ, mahã, mahãʔ, manche, mancheʔ, manchi, manchiʔŋa, mane, na, nai, naʔ, nchi, nchiʔ, ne, nei, ni, n̪i, o, one, oneĩ, phopheiʔ, rahecha, rahicha, raicha, raicho, ta, taʔ, te, teʔ, them, u, yaŋ, yoŋ
- This corpus contains 23 lemmas tagged as pronouns (PRON): akka, anaŋa, ancaŋa, anci, ani, arko, aru, ba, hana, hanci, hani, hok, hun, j, jammai, kun, mo, sa, taŋ, them, to, u, yo
- This corpus contains 20 lemmas tagged as determiners (DET): arko, asuk, ba, baddhe, bakŋa, hok, hun, jammai, jun, kun, miʔmuŋ, mo, mokŋa, piche, sabai, them, to, tokŋa, yo, yokŋa
- Out of the above, 10 lemmas occurred sometimes as PRON and sometimes as DET: arko, ba, hok, hun, jammai, kun, mo, them, to, yo
- This corpus contains 2 lemmas tagged as auxiliaries (AUX): lus, pho
- Out of the above, 1 lemmas occurred sometimes as AUX and sometimes as VERB: lus
- There are 4 (de)verbal forms:
- Conv
- VERB: casaŋa, garikana, numsaŋa, khaŋsaŋa, copsi, khaŋsi, kosi, pussi, yuŋsaŋa, asessi
- Fin
- AUX: alusace, lusiki, luce, lumettuce, lusikiyã, uluce, uluceke, ulucekenɨŋ, uluno, ulusaŋsace
- VERB: yuŋno, konno, lino, khade, konnoʔ, lise, khada, thaʔno, lisaŋse, uyuŋno
- Inf
- AUX: luma
- VERB: khice, khaiʔma, numma, cama, bane, khaŋma, pima, roke, hapma, seiʔma
- Part
- VERB: yuŋmayaŋ, hokmayaŋ, kakheiʔpa, kakɨpa, kaoiʔpadheĩpa, kapuiʔpa, porne, thukmayaŋ, Hoĩmayɨŋ, Kapuippãta
Nominal Features
- Hum
- NUM: hiccibaŋ, sumbhaŋ, Carjana, thɨkpaŋ, Hiccibaŋ-hiccibaŋ, panjana, pãcjana, tinjana
- Nhum
- ADV: Thitta-thitta
- DET: Askogeda, asukgeda
- NUM: thitta, hicce, hicci, sumce, thitta-thitta, sumci, Athgeda, Egharageda, Hihicce, Paceda
- Dual
- ADJ: Huncimakacɨkcɨk
- AUX-Fin: alusace, luce, uluce, uluceke, ulucekenɨŋ, ulusaŋsace
- PRON: anci, ancaŋa, Hanci, Ancabeʔyã, Hancibe, ancaŋaiʔyã
- VERB-Conv: huncichoŋsi
- VERB-Fin: ukhadace, khacce, ulaptace, athukceke, khadaca, khattakhaco, khoŋce, khoŋsaca, ukhadaŋsace, ulasaguŋsace
- VERB-Inf: hunikoma
- Plur
- ADV: anitapparaŋ
- AUX-Fin: lusiki, lusikiyã, uluno, uluse
- NOUN: chace, goce, wace, maʔmice, koce, nunuce, uchauce, bakhrace, chaceŋa, gorce
- PRON: ani, hunce, anaŋa, bace, hani, hunceŋa, aniŋa, baceŋa, ancaŋa, them-themce
- PROPN: Ramece, Canrecekoce, Khelece, Rameceŋa, Rosance, Rumaceŋa, Sapanice
- VERB-Conv: Ukeksã, uyuŋsaŋa
- VERB-Fin: uyuŋno, khadi, lisi, ukhade, ukhaʔno, ukonno, ulino, hidumnum, numdi, uliseʔ
- VERB-Inf: Ucamace, loiʔmaceŋa, utama
- VERB-Part: kahice, kakhaiʔpace, kakhuce, kakoppace, kalupace, kamainece, kameipace, kanumpace, kasecce, kaŋice
- Sing
- ADJ: Abyaktigat, Umakacɨkma, iseto
- ADV: itapparaŋ
- AUX-Fin: lumettuce
- NOUN: go, kha, ko, kok, goiʔ, cuwa, maʔmi, iskul, goi, meĩ
- NUM: thitta, duibe, Carjana, Hicce, Sumbhaŋ, noube, pãcjana, sumce
- PRON: akka, hana, ba, them, saŋa, huĩ, basaŋa, akko, huĩsaŋa, jo
- PROPN: Dhankuta, Rame, Rameŋa, Debi, Rameko, Debiŋa, Kathmandu, Kheme, Darke, Joge
- VERB-Conv: asessi, ilasi, Icopsi, Ikhaŋsi, Ulasi, uyuŋsaŋa
- VERB-Fin: yuŋno, konno, lino, khade, konnoʔ, khada, lise, thaʔno, lisaŋse, iʔno
- VERB-Inf: itɨŋma, uchumma, uchuŋma, ukhaiʔ, upaŋma
- VERB-Part: kakheiʔpa, kakɨpa, kaoiʔpadheĩpa, kapuiʔpa, porne, Kapuippãta, Lakkaluppabe, cokkhaune, kacama, kakhaiʔpa
- Abs
- AUX-Inf: luma
- NOUN: go, kha, ko, kok, cuwa, chace, maʔmi, iskul, meĩ, nunu
- NUM: hicce, sumce, thitta, sumbhaŋ, Carjana, pãcjana
- PRON: ba, huĩ, hunce, bace, salo, hun, To, yo, basako, huĩsako
- PROPN: Dhankuta, Rame, Debi, Rameko, Kathmandu, Kheme, Ramece, Darke, Joge, Pecce
- VERB-Inf: khaiʔma, numma, cama, khaŋma, pima, hapma, seiʔma, yuŋma, lima, luma
- VERB-Part: kakheiʔpa, kakɨpa, kaoiʔpadheĩpa, kapuiʔpa, porne, Kasihaiʔpa, cokkhaune, kacama, kahice, kakhaiʔpa
- Abs,Erg
- NOUN: akatha, anipaisa, samance
- PRON: akka, hana, them, ani, jamma, akko, anaŋa, jo, them-them, hani
- AbsErg
- PRON: saloŋa
- Cau
- ADV: appikolagi
- NOUN: ajibankolagi, camyaŋtoŋ, iphuwalaŋtoŋ
- PRON: Hanalaŋtĩ, akkolaŋtĩ
- VERB-Inf: wahumeiʔmalaŋtoŋ, khaiʔmayoŋtoŋ, nummakolaŋtĩ, nummalaŋtoŋ, thaĩmacelaŋtĩ, thukmalaŋtĩ
- Com
- NOUN: menuwanɨŋ, sencaknɨŋ, cuwanɨŋ, maʔmicenɨŋ, ucuwanɨŋ, Amanɨŋ, Bhalenɨŋ, Sailanɨŋ, Thulikonɨŋ, adhisori
- NUM: thittanɨŋ
- PRON: Sanɨŋ, aninɨŋ, banɨŋ, monɨŋ
- PROPN: Indranɨŋ, Puspanɨŋ, Ramenɨŋ, Debinɨŋ, Dhakmanenɨŋ, Lakhmannɨŋ, Lokendranɨŋ, Sitanɨŋ, Somenɨŋ, bisalenɨŋ
- VERB-Conv: heksinɨŋ
- VERB-Inf: tukmanɨŋ
- Erg
- NOUN: gosaŋa, chaŋa, appaŋa, menuwaŋa, bhaleŋa, ummaŋa, Kanchiŋa, chaceŋa, latthiŋa, uppaŋa
- NUM: thittasaŋa
- PART: aiŋa, isaŋa, manchiʔŋa
- PRON: saŋa, basaŋa, huĩsaŋa, hunceŋa, aniŋa, baceŋa, jammaŋa, josaŋa, hanaŋa, Akkaŋa
- PROPN: Rameŋa, Debiŋa, semeŋa, Kaliŋa, Kalpanaŋa, Kheleŋa, Manojŋa, gairiŋa, Anitaŋa, Bippanaã
- VERB-Inf: miʔmaŋa, kaiʔmaŋa, loiʔmaceŋa, nummaŋa, chiʔmaŋa, copmaŋa, komaŋa, yuŋmaŋa
- VERB-Part: Kapuippãta, kapuiʔpã, maikahiŋa
- Loc
- ADV: baiʔ, yoʔni, bai, bhaiʔni, moʔni, yoba, attu, toba, toʔni, bhandu
- NOUN: goiʔ, goi, koiʔ, khimbe, teĩbe, koi, ke, cuwabe, koʔni, aniteĩbe
- NUM: duibe, hajarbe, noube
- PART: aitira
- PRON: hanaiʔ, hanik, thembeʔ, Anabeʔ, Hancibe, Saik, anabe, them-thembeʔ, thembe
- PROPN: Chintaŋbe, Chintaŋbeʔ, Dhankutabe, Dãdagaũbandu, Pancakannebeʔ, Taŋkerabe
- VERB-Inf: camabeʔ, simabe, tommabe, yuŋmaiʔ
- VERB-Part: Lakkaluppabe
- LocCom
- NOUN: khãbobeʔnɨŋ, sontoloŋtaŋbeʔnɨŋ, cuwaiʔnɨŋ
- LocErg
- ADV: Baiʔya, Humbeʔyã, Humbeʔŋa, Yoʔniyã, bhandubaʔŋa, toʔpattiŋa
- NOUN: goiʔyã, geiʔyã, Cuwaiʔyã, Inisaiʔyã, Kallayã, Kukurbeʔyã, Rodbeyã, Wahuʔluŋ-wateluŋbeʔŋa, aŋnabeʔŋa, barsaiŋa
- PRON: Anaiʔŋa, Ancabeʔyã, Anibeyã, Jammaiʔya, ancaŋaiʔyã
- PROPN: Bhojpurbeʔŋa, Balaŋkhabeʔŋa, Cimbrahabamuʔŋa, Homboyoŋbeʔŋa, Jarmanbamuã, Maŋthanaiʔya, Samjhanabeʔyã, Saŋbhoŋteĩbayuŋa, Tumliŋbeʔŋa
- VERB-Conv: cemsiŋa, cemsiʔŋa, wassiʔyã
- LocLoc
- ADV: huŋkhaiʔ, Agadipattibe, bakhaiʔ, bhaipatti, yokhaiʔ, Bandupatti, Bhayupatti, Jãhãiʔ, bakhabe, bhamuba
- NOUN: khetabhamuʔni, khimbhamupatti
- PROPN: Ladimbamuʔni
- LocLocErg
- ADV: bhamubaʔŋa, topattiʔyã
- LocLocPer
- ADV: Baipattilam, bhamuʔnilam
- LocPer
- ADV: bayulam, Topattilam, Yolam, ayulam, bhaiʔlam, bhamulam
- PROPN: Saŋbhoŋteĩpattilam
- LocPerErg
- ADV: Bhaiʔyãpattilamma, bhamulamma, tolamma
- NOUN: umembeʔlamma
- Per
- NOUN: inarilam, macelam, patalam, umuŋlam, ɨcɨklam
- PRON: hunlam, Bhalam, yolam, balam
- PROPN: Budahaŋlam, Dharanlam, Nepalilam, Panirɨŋlam
- PerErg
- NOUN: saikalalamma, sakphagharilamma
- PRON: Hunlamma, tolamma
- PROPN: Budahaŋlamma
Degree and Polarity
- Dim
- NOUN: Sailuki
- PROPN: Asuki, Asukiko, Khumale, Khumaleŋa
- Neg
- AUX-Fin: ulucekenɨŋ
- CCONJ: Na
- INTJ: mahaʔ, Manchi, manche, ãhã
- PART: manchi, mahaʔ, manche, maha, mahãʔ, mancheʔ, mane, Manchiʔ, phopheiʔ, manchiʔŋa
- VERB-Conv: mapimace, Macekma, Maicekma, Maiceksaŋa, Maihaĩma, Manakma, mahisaŋa, maica, maiiʔmadheĩmace, mailima
- VERB-Fin: linɨʔnɨŋ, hidumnum, maipinɨʔnɨŋ, mamaimyokte, Maimettha, cainɨŋ, caŋanɨŋ, cokkonɨŋ, hapnɨʔnɨŋ, hidukumnɨŋ
- VERB-Inf: maiseiʔma, Mainima, Makhaiʔma, mahima, maipima, maithuʔma
- VERB-Part: Mallumayaŋ, kamainece, maikahiŋa, maikanece, maikanumpa, maithuʔmayɨŋ
- Pos
- INTJ: ei, ã, eʔ, Ho
Verbal Features
- ComplImp
- VERB-Conv: hekdheĩsi, khudheĩsi, lapdheĩsi, lɨgadi, maiiʔmadheĩmace, odheĩsaŋa
- VERB-Fin: sinahaiʔ, Sinaiʔ, linahaʔno, lisadiki, lɨknahaiʔ, uthendattake, Akɨnahaiʔ, Bopnaadhennaʔãce, Hepnahaʔno, Hoʔnahaiʔ
- VERB-Inf: simaiʔma, Camahaiʔmabimakhaŋma, Pheĩmadheĩmaphe, cemmadheĩma, lekmaiʔma, lukmadhẽima, rɨkmadheĩmace, supmadheĩma, thamahaiʔma, tɨkmadheima
- VERB-Part: kaoiʔpadheĩpa, Kasihaiʔpa, kammadheĩmayaŋ, kaphĩpadheĩpa
- ComplPerf
- VERB-Fin: lisadaŋse, nadanduŋsehẽ, ahomadaŋse, aktadaŋsa, alisadaŋsace, bhektadase, chobadaŋse, holadaŋseʔ, homadaŋse, kamnadhenaŋnace
- ComplPerfv
- VERB-Fin: siade, cohatte, lisade, mandade, kobucohatte, kɨrade, lɨgade, thiada, ucohatte, ulɨgade
- Imp
- AUX-Fin: alusace, lusiki, luce, lusikiyã, uluce, uluceke, ulucekenɨŋ, uluno
- AUX-Inf: luma
- VERB-Conv: casaŋa, garikana, numsaŋa, khaŋsaŋa, copsi, khaŋsi, kosi, pussi, yuŋsaŋa, asessi
- VERB-Fin: yuŋno, konno, lino, konnoʔ, thaʔno, uyuŋno, iʔno, khadi, rɨktoko, canoʔ
- VERB-Inf: khaiʔma, numma, cama, khaŋma, pima, hapma, seiʔma, yuŋma, lima, luma
- VERB-Part: yuŋmayaŋ, hokmayaŋ, kakheiʔpa, kakɨpa, kapuiʔpa, porne, thukmayaŋ, Hoĩmayɨŋ, Kapuippãta, Lakkaluppabe
- Perf
- AUX-Fin: ulusaŋsace
- VERB-Fin: lisaŋse, khadaŋse, lisaŋseʔ, khaoŋse, numdoŋse, utiaŋse, Kɨralɨgaŋse, chaptoŋse, chepmusaŋse, ciaŋse
- Perfv
- AUX-Fin: alusace, lumettuce, uluse
- VERB-Fin: khade, khada, lise, khatte, thaptokho, ukhade, kade, lapte, liseʔ, pide
- Imp
- VERB-Fin: khada, thaptokho, eba, thaptaʔ, Maimettha, cia, cohaʔ, coptokho, hiduca, katta
- Ind
- AUX-Fin: alusace, lusiki, lumettuce, lusikiyã, uluceke, ulucekenɨŋ, uluno, ulusaŋsace, uluse
- VERB-Fin: yuŋno, konno, lino, khade, konnoʔ, lise, thaʔno, lisaŋse, uyuŋno, iʔno
- Opt
- VERB-Fin: numnacane, Bhonu, Khaiʔyãne, Nacinnakhaŋne, Yuŋyakne, cekŋakhaŋŋane, imnacanɨŋne, khaccone, khatne, khattumne
- Sub
- AUX-Fin: luce, uluce
- VERB-Fin: khadi, lɨkŋa, lisi, khaiʔ, khemsuŋ, numdi, waiʔ, aca, akada, akhaiʔ
- Past
- AUX-Fin: alusace, lumettuce, ulusaŋsace, uluse
- VERB-Fin: khade, lise, lisaŋse, khatte, yuwakte, ukhade, kade, khadaŋse, lapte, liseʔ
- Pres
- AUX-Fin: lusiki, luce, lusikiyã, uluce, uluceke, ulucekenɨŋ, uluno
- VERB-Fin: yuŋno, konno, lino, konnoʔ, thaʔno, uyuŋno, iʔno, khadi, rɨktoko, canoʔ
- Act
- AUX-Fin: alusace, luce, lusiki, lusikiyã, uluce, uluceke, ulucekenɨŋ, uluno, ulusaŋsace, uluse
- AUX-Inf: luma
- VERB-Conv: casaŋa, garikana, numsaŋa, khaŋsaŋa, copsi, khaŋsi, kosi, pussi, yuŋsaŋa, asessi
- VERB-Fin: yuŋno, konno, lino, khade, konnoʔ, lise, khada, thaʔno, lisaŋse, uyuŋno
- VERB-Inf: khaiʔma, numma, cama, khaŋma, pima, hapma, seiʔma, yuŋma, lima, luma
- VERB-Part: yuŋmayaŋ, hokmayaŋ, kakheiʔpa, kakɨpa, kaoiʔpadheĩpa, kapuiʔpa, porne, thukmayaŋ, Hoĩmayɨŋ, Kapuippãta
- Cau
- AUX-Fin: lumettuce
- VERB-Fin: lapmettoko, Maĩkhaŋmettaʔ, aneamettukuce, ataĩmettukucum, hakmettuce, hapmettaʔ, hapmettoko, hapmettoŋse, haʔumeʔtoko, hekmettoko
- VERB-Inf: wahumeiʔma, wahumeiʔmalaŋtoŋ, chommeʔma, khaŋnummace, phemeiʔma
- VERB-Part: cokkhaune
- CauRcp
- VERB-Inf: Hapmeiʔkameiʔ
- CauRefl
- VERB-Fin: khaŋmetnace, Immetnaŋnace, Lapmeʔnacce, Luʔmeʔnace, khumeʔnaʔãce, mɨkseĩkhaŋmeiʔkameiʔ
- Rcp
- VERB-Inf: khasɨŋkasɨŋ, tɨŋkatɨŋ, Copkacop, Pamkapam, Pokkathaka, apkaap, caiʔkacaiʔ, khaŋkakhaŋ, lapkalap, lekale
- Refl
- VERB-Conv: tomcĩsaŋa, tomgoĩcĩsaŋa
- VERB-Fin: sɨknalɨknancĩyehẽ, Bopnaadhennaʔãce, Kaʔnace, Kɨnadhennaace, ahinaʔacenɨŋ, apamnaceʔ, ateknaʔãce, atɨŋnaʔãce, bhukŋaŋcɨŋ, cinnaʔace
- VERB-Inf: Tommancĩ
- Nfh
- AUX: pho, phe
Pronouns, Determiners, Quantifiers
- Dem
- ADV: baiʔ, utti, mo, yoʔni, batta, huŋkhi, bai, bhaiʔni, moʔni, attu
- DET: ba, huŋ, huĩ, mo, yo, to, bakhiya, hun, Baʔ, Yoʔŋa
- PRON: ba, huĩ, basaŋa, hunce, huĩsaŋa, bace, hunceŋa, hun, baceŋa, yo
- Ind
- ADV: jatti, baddhe, miʔmuŋ, jahã, joso, asuk, jahile, miʔmoŋ, miʔmu, miʔmuŋ-miʔmuŋ
- DET: arko, baddhe, miʔmuŋ, asuk, kun, assuk, badde, hok, jun, mipmoŋ
- PRON: jo, josaŋa, Kun, arkoce, jasto, Aruceŋa, Jasko, Je, arko, arkosã
- Int
- ADV: theke, aŋ, asuk, anɨŋ, hokhi, Thee, Themma, kaile, them, Katti
- DET: hok, asuk, ho, Asko, Asu, them, Askogeda, asukgeda, them-them
- PRON: them, saŋa, salo, them-them, Sanɨŋ, them-themce, thembeʔ, themce, themma, Ithemce
- Prs
- PRON: akka, hana, ani, akko, anaŋa, hani, ancaŋa, hanako, aniŋa, anci
- Rel
- ADV: jaha, jaso, jati
- Tot
- ADV: jamma
- DET: jamma, Jamm, picche, sabai
- PRON: jamma, jammaŋa, Jamm, Jamma-jamma, Jammaiʔya
- Card
- ADV: Thitta-thitta, Hiccipatti
- NUM: thitta, ek, hicce, car, dui, das, hiccibaŋ, hicci, sumce, pãc
- Yes
- ADV: asukko, pahilako, appikolagi
- NOUN: maʔmiko, Iskulako, Kanchako, barsako, ruppeko, saĩliceko, Ahisappako, Aŋrejiko, Cakletko, Chintaŋko
- PRON: akko, hanako, anako, basako, huĩsako, Huisako, aniko, Anakoce, Anikoce, Hanakosaŋa
- PROPN: Rameko, Asumako, Kappeko, Amerikako, Asukiko, Canrecekoce, Garako, Gɨŋgeko, Hilekoceyã, Indiako
- VERB-Inf: meiʔmako, nummako, nummakolaŋtĩ
- VERB-Part: kapuiʔpako
- 1
- ADJ: Abyaktigat
- ADV: anitapparaŋ
- AUX-Fin: lusiki, luce, lusikiyã
- NOUN: appaŋa, apa, akhim, aniteĩbe, appa, amma, anirek, anirɨŋ, apakku, athippaŋa
- PRON: akka, ani, akko, anaŋa, ancaŋa, aniŋa, anci, anako, Aka, Akkaŋa
- VERB-Conv: asessi
- VERB-Fin: khadi, lɨkŋa, lisi, hidumnum, khemsuŋ, miʔyaʔã, numdi, cekti, cekŋaʔã, hidukum
- 2
- ADJ: iseto
- ADV: itapparaŋ
- AUX-Fin: alusace
- NOUN: ilaŋ, inisa, ippa, Immai, Ipadumko, igol, ikamace, ikamce, immaŋa, imɨk
- NUM: iek
- PRON: hana, hani, hanako, hanaŋa, Hanci, hanaiʔ, hanik, Hanakosaŋa, Hanalaŋtĩ, Hancibe
- VERB-Conv: ilasi, Icopsi, Ikhaŋsi
- VERB-Fin: khada, thaptokho, akonno, akhaʔno, anisoko, akhade, alise, aca, achonnoʔ, akada
- VERB-Inf: itɨŋma
- 3
- ADJ: Huncimakacɨkcɨk, Umakacɨkma
- AUX-Fin: lumettuce, uluce, uluceke, ulucekenɨŋ, uluno, ulusaŋsace, uluse
- NOUN: uchau, unisa, unɨŋ, uppa, utopi, uchauce, ummaŋa, uphuwa, ubheĩbe, umma
- PRON: Utaŋbandu, usko
- VERB-Conv: uyuŋsaŋa, Ukeksã, Ulasi, huncichoŋsi
- VERB-Fin: yuŋno, konno, lino, khade, konnoʔ, lise, thaʔno, lisaŋse, uyuŋno, iʔno
- VERB-Inf: Nakhaiʔma, Ucamace, hunikoma, nanekma, nanumma, uchumma, uchuŋma, ukhaiʔ, upaŋma, upeĩma
- VERB-Part: pimayaŋce, soiʔmayaŋce
- Dual
- NOUN: Hancimaya, Huncijhani, ancimuk, hancichauce, hancimeĩ, huncicam, huncikhim, huncimau, huncimaube, huncipatte
- PRON: Anako, Anakoce, Anikoce, Anikoceŋa, anakko
- Plur
- NOUN: aniteĩbe, anirek, anirɨŋ, aniteĩ, huninari, Anacamace, Anamailace, Anapaube, Anapauŋa, Anathurum
- PROPN: Hilekoceyã
- VERB-Fin: hunciumiʔ
- Sing
- NOUN: uchau, appaŋa, unisa, unɨŋ, uppa, utopi, apa, uchauce, ummaŋa, uphuwa
- NUM: iek
- PRON: akko, Hanakosaŋa, Ithemce, Utaŋbandu, akkolaŋtĩ, hanako
- PROPN: Asukiko, Canrecekoce, Garako, Kappeko, Monsuko, Rameko, Rumako, Someko
- VERB-Inf: Ucamace
Other Features
- AdvType
- Ext
- ADV: batta, battata, utta, Motta, atta, totta, uttata
- Loc
- ADV: baiʔ, mo, yoʔni, bai, bhaiʔni, moʔni, attu, to, yo, yoba
- Man
- ADV: huŋkhi, hokhi, bakhi, bakhiʔnɨŋ, Huŋkhiʔnɨŋ, Esari, joso, bakhiʔ, hakhi-hakhi, hakhinɨŋ
- Qua
- ADV: utti, jatti, uti, etti, eti, jati, Josori, Katti, ettiti, jahã
- Ext
- Clusivity
- Ex
- AUX-Fin: lusikiyã
- PRON: anaŋa, ancaŋa, Anabeʔ, Anaiʔŋa, Ancabeʔyã, anabe, anako, ancaŋaiʔyã, anã, kanaŋa
- VERB-Fin: numdikiŋa, sɨknalɨknancĩyehẽ, caiehẽ, caikiyã, candumcummehẽ, choncilokcekeŋa, haknaancĩŋa, khadiŋa, khamsumcumme, khattumcum
- In
- ADV: anitapparaŋ
- AUX-Fin: lusiki, luce
- PRON: ani, aniŋa, anci, Anibeyã, aniko, aninɨŋ
- VERB-Fin: khadi, lisi, numdi, cekti, hidukum, hidum, khacce, lisiki, ahidumcumheʔ, cainɨŋ
- Ex
- Clusivity[p]
- Ex
- VERB-Fin: khaŋmameʔte, maapaŋsakteʔ, macopnadheĩ, mahiceke, maledase, mapi, mapidadisaʔ, mapide, mapondacia, mathokno
- In
- VERB-Fin: maipinɨʔnɨŋ, mainekno, Khapattaŋsa, Khaumaikhaŋyaktaŋseʔ, Maikhaŋnacano, Maipide, Mairecceke, khamaisɨŋ, maica, maichap
- Ex
- Clusivity[psor]
- Ex
- NOUN: Anacamace, Anamailace, Anapaube, Anapauŋa, Anathurum, anakhalampa, anapauce, anateibe, kanaphak
- PRON: Anako, Anakoce, anakko
- In
- NOUN: aniteĩbe, anirek, anirɨŋ, aniteĩ, Anichintaŋbe, Anicuwakhamce, Anidoŋdumce, Anidoŋdumma, Anikhim, Animaŋthana
- PRON: Anikoce, Anikoceŋa, aniko
- Ex
- ConvType
- Cntf
- VERB-Conv: Ebi, Negi, Pagi, khoŋsi, khɨndi, ludi, lusi, lɨgadi, pidi, sukti
- Coord
- VERB-Conv: casaŋa, numsaŋa, khaŋsaŋa, ceksaŋa, khoŋsaŋa, ochoksaŋa, pasaŋa, popsaŋa, rɨsaŋa, tomcĩsaŋa
- Purp
- VERB-Conv: copsi, khaŋsi, kosi, pussi, asessi, casi, choŋsi, ilasi, phasi, Icopsi
- Cntf
- Deixis
- Med
- ADV: huŋkhi, Huŋkhiʔnɨŋ, huŋkhaiʔ, huĩsaiʔ, utta, Humbeʔyã, Humbeʔŋa, Huĩ, huŋkhiʔ, huŋkhiʔni
- DET: huŋ, huĩ, hun, Hui, huŋkhiya
- PRON: huĩ, hunce, huĩsaŋa, hunceŋa, hun, hunlam, huĩsako, huŋkhiya, Huisako, hunsaŋa
- Prox
- ADV: baiʔ, batta, bai, bhaiʔni, bakhiʔnɨŋ, bakhi, bhandu, bhayu, etti, Esari
- DET: ba, bakhiya, Baʔ, bhaiʔŋa, ha
- PRON: ba, basaŋa, bace, baceŋa, basako, Bhalam, bakhiya, baceyã, bakhiyace, balam
- Remt
- ADV: utti, mo, yoʔni, moʔni, attu, to, yo, yoba, toba, uti
- DET: mo, yo, to, Yoʔŋa, huŋ, Bakhiya, Mokhiya, Toʔwa, huĩ, moʔ
- PRON: To, yo, mo, yolam, yosaŋa, Mosako, Toce, Tosaŋa, Utaŋbandu, Yoce
- Med
- Foreign
- Yes
- VERB: Bhaisi, kinuŋ
- X: ke, bhane, bhanedekhi, paubasi, sir
- Yes
- InfStruct
- Foc
- PART: ta, yaŋ, lo, le, taʔ, ai, leʔ, i, leʔle, aitira
- Top
- PART: na, caĩ, cahi, cahĩ, caine, bhane, bhone, caĩne, chaĩ, naʔ
- Uniq
- PART: te, teʔ
- Foc
- Number[p]
- Dual
- VERB-Fin: tennace, Mairecceke, Nanahottace, khaŋnamettace, lapceke, mahiceke, maledase, nacacce, nalapceke, nanumceke
- Plur
- AUX-Fin: lumettuce
- VERB-Conv: mapimace, maiiʔmadheĩmace
- VERB-Fin: maipinɨʔnɨŋ, pidukuce, ahidumcumheʔ, coptuŋcuhẽ, hiduca, hiduŋcuŋ, hiduŋcuŋnɨŋ, khapide, khaukuce, khaupidake
- VERB-Inf: pimace, chaʔmace, cĩmace, khemmace, lumace, miʔmace, chapmace, hummace, khaĩmace, khaŋmace
- Sing
- VERB-Fin: rɨktoko, kondoko, thaptokho, anisoko, mettoko, numdoko, yaŋsoko, hidumnum, khemsuŋ, puttoko
- VERB-Inf: nanekma
- Dual
- Person[p]
- 1
- VERB-Fin: maipinɨʔnɨŋ, khapide, khaupidake, labaŋbidahãʔ, mainekno, ucopmaʔã, ulaiʔyaʔãnɨŋ, uludaŋnɨhẽ, uphattehẽ, upidasehẽ
- 2
- VERB-Fin: chapnabina, nakhae, nakonno, nalatte, naphatte, napino, nateĩ, pina, tennace, Huʔnachokna
- VERB-Inf: Nakhaiʔma, nanekma, nanumma
- 3
- AUX-Fin: lumettuce
- VERB-Conv: mapimace, maiiʔmadheĩmace
- VERB-Fin: rɨktoko, kondoko, thaptokho, numdoko, yaŋsoko, anisoko, mettoko, hidumnum, khemsuŋ, puttoko
- VERB-Inf: pimace, chaʔmace, cĩmace, khemmace, lumace, miʔmace, chapmace, hummace, khaĩmace, khaŋmace
- 1
- Person[psor]
- 3
- VERB-Fin: hunciumiʔ
- 3
- Reach
- Access
- ADV: uttu, uyu, uyuba, aiyu, aiyuba, otuba, uipatti, umbu, umuba, uyoba
- Remote
- ADV: attu, ammu, aiyu, ambu, Ayyu, Uttuʔni, Yoayu, aiyuba, ayulam
- Access
- Red
- Yes
- ADJ: Thette, Miʔmiʔmi, Rato-rato, Thedde, tatotato, temma-temma, thupro-thupro, Huncimakacɨkcɨk
- ADV: maŋka-maŋka, paila-paila, Appi-appi, Ballaballa, Pheri-pheri, Thitta-thitta, age-age, anam-anam, battata, miʔmuŋ-miʔmuŋ
- DET: baddhe-baddhe, them-them
- INTJ: chobou-chobou
- NOUN: koi, Kam-kam, Teĩ-teĩbe, Teĩteĩbe, Uthau-uthau, apalo-apalo, iskul-iskulbe, kei, khakha, koiʔ
- NUM: thitta-thitta, Hiccibaŋ-hiccibaŋ, Hihicce, susumci
- PRON: them-them, them-themce, Jamma-jamma, Sa-salo, Yoyo, jojo, them-thembeʔ, them-themma, toto
- Yes
Syntax
Auxiliary Verbs and Copula
- This corpus does not contain copulas.
- This corpus uses 2 lemmas as auxiliaries (aux). Examples: pho, lus.
Core Arguments, Oblique Arguments and Adjuncts
Here we consider only relations between verbs (parent) and nouns or pronouns (child).
- nsubj
- VERB-Conv--NOUN-Abs (1)
- VERB-Conv--NOUN-Erg (1)
- VERB-Conv--PRON-Abs,Erg (1)
- VERB-Conv--PRON-Erg (1)
- VERB-Fin--NOUN-Abs (401)
- VERB-Fin--NOUN-Com (6)
- VERB-Fin--NOUN-Erg (124)
- VERB-Fin--NOUN-Loc (2)
- VERB-Fin--PRON (1)
- VERB-Fin--PRON-Abs (43)
- VERB-Fin--PRON-Abs,Erg (190)
- VERB-Fin--PRON-AbsErg (1)
- VERB-Fin--PRON-Erg (97)
- VERB-Inf--NOUN-Abs (6)
- VERB-Inf--NOUN-Erg (3)
- VERB-Inf--PRON-Abs (2)
- VERB-Inf--PRON-Abs,Erg (11)
- VERB-Inf--PRON-Erg (10)
- VERB-Part--NOUN-Abs (1)
- VERB-Part--PRON-Abs,Erg (2)
- obj
- VERB-Conv--NOUN-Abs (55)
- VERB-Conv--PRON-Abs (2)
- VERB-Conv--PRON-Abs,Erg (5)
- VERB-Fin--NOUN (1)
- VERB-Fin--NOUN-Abs (558)
- VERB-Fin--NOUN-Abs,Erg (1)
- VERB-Fin--NOUN-Abs-ADP(likhi) (1)
- VERB-Fin--NOUN-Abs-ADP(mo) (1)
- VERB-Fin--NOUN-Com (4)
- VERB-Fin--NOUN-Erg (1)
- VERB-Fin--NOUN-Loc (5)
- VERB-Fin--PRON (1)
- VERB-Fin--PRON-Abs (29)
- VERB-Fin--PRON-Abs,Erg (44)
- VERB-Fin--PRON-Erg (1)
- VERB-Inf--NOUN-Abs (140)
- VERB-Inf--NOUN-Abs,Erg (2)
- VERB-Inf--PRON-Abs (7)
- VERB-Inf--PRON-Abs,Erg (17)
- VERB-Inf--PRON-Erg (1)
- VERB-Part--NOUN-Abs (20)
- iobj
- VERB--NOUN-Erg (1)
- VERB-Conv--NOUN-Abs (1)
- VERB-Conv--NOUN-Erg (3)
- VERB-Fin--NOUN-Abs (36)
- VERB-Fin--NOUN-Erg (34)
- VERB-Fin--NOUN-Erg-ADP(haŋ) (1)
- VERB-Fin--NOUN-Loc (2)
- VERB-Fin--PRON-Abs (3)
- VERB-Fin--PRON-Abs,Erg (13)
- VERB-Fin--PRON-Erg (2)
- VERB-Fin--PRON-LocErg (1)
- VERB-Inf--NOUN-Abs (14)
- VERB-Inf--NOUN-Erg (5)
- VERB-Inf--PRON-Abs (4)
- VERB-Inf--PRON-Abs,Erg (1)
Relations Overview
- This corpus uses 21 relation subtypes: acl:nmlz, acl:relcl, advcl:cntf, advcl:coord, advcl:emph, advcl:purp, advcl:sim, advmod:cop, advmod:emph, advmod:nmlz, amod:nmlz, compound:lvc, det:nmlz, flat:foreign, flat:name, flat:num, nmod:nmlz, nmod:poss, nsubj:outer, obj:caus, xcomp:desid
- The following 1 main types are not used alone, they are always subtyped: compound
- The following 8 relation types are not used in this corpus at all: expl, dislocated, cop, clf, fixed, list, goeswith, dep