UD Hausa SouthernAutogramm
Language: Hausa (code: ha)
Family: Afro-Asiatic
This treebank has been part of Universal Dependencies since the UD v2.14 release.
The following people have contributed to making this treebank part of UD: Bernard Caron.
Repository: UD_Hausa-SouthernAutogramm
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.17
License: CC BY-SA 4.0
Genre: spoken
Questions, comments? General annotation questions (either Hausa-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [bernard • l • caron (æt) gmail • com]. Development of the treebank happens outside the UD repository. If there are bugs, either the original data source or the conversion procedure must be fixed. Do not submit pull requests against the UD repository.
| Annotation | Source |
|---|---|
| Lemmas | annotated manually |
| UPOS | annotated manually, natively in UD style |
| XPOS | not available |
| Features | annotated manually, natively in UD style |
| Relations | annotated manually, natively in UD style |
Description
This treebank contains data of Southern Autogramm, for the Zaria dialect of Nigeria (Southern Hausa).
The Zaria (Southern) Hausa, is a “modern” version of the language where the 3-way opposition (masculine / feminine / plural) has been abandoned in the noun system, and only the plurality feature is maintained, while the feminine gender is kept in the pronominal and TAM system.
The treebank contains 1,918 sentences and 14,585 tokens.
It is maintained in the SUD framework: SUD_Hausa-SouthernAutogramm and converted automatically in UD.
Acknowledgments
References
Caron, Bernard. 2015. Hausa Grammatical Sketch. In Amina Mettouchi, Martine Vanhove & Dominique Caubet (eds.), Corpus-based Studies of Lesser-described Languages. The CorpAfroAs corpus of spoken AfroAsiatic languages. Amsterdam-Philadelphia: John Benjamins. https://halshs.archives-ouvertes.fr/halshs-00647533.
Statistics of UD Hausa SouthernAutogramm
POS Tags
ADJ – ADP – ADV – AUX – CCONJ – DET – INTJ – NOUN – NUM – PART – PRON – PROPN – PUNCT – SCONJ – VERB – X
Features
Aspect – Case – Definite – Deixis – ExtPos – Foreign – Gender – Number – PartType – Person – Polarity – PronType – Reflex – Tense – VerbForm – Voice
Relations
acl – acl:relcl – advcl – advcl:cleft – advmod – amod – appos – aux – case – cc – cc:preconj – ccomp – compound – compound:prt – compound:svc – conj – cop – csubj – dep – det – discourse – dislocated – flat – flat:foreign – flat:name – iobj – mark – nmod – nsubj – nummod – obj – obl – obl:arg – parataxis – punct – reparandum – root – vocative – xcomp
Tokenization and Word Segmentation
- This corpus contains 1927 sentences and 14401 tokens.
- All tokens in this corpus are followed by a space.
- This corpus does not contain words with spaces.
- This corpus contains 46 types of words that contain both letters and punctuation. Examples: na'àm, sa'ànnan, zaː'à, baː'àː, sànaː'àn, du', mêː-mêː, Mà'aːzù, kalàː-kalàː, loːkàci:, sa'àn, zaːmàni:, Tudùn-Wàda, Tudùn-Wàdân, baƙa-baƙaː, baː'à, dakà:, es-ès, eː'àː, gar̃gajiya:, gishiri-gishiri, gàriː-ǹ, gìne-gìne, ha', hanyà:, hanyàː-n, haʔà:, irìː-irìː, irìː-n, jeːfì-jeːfì, ka:, koːwa:, lâifiː-n, mà'àːnaː, mêː-mêː-mêː, m̀:hm:, r̃uwa-r̃uwa, shùːke-shùːke, su:, sàbà'in, tsoːhoː-nsù, àl'amur̃àː, ƙaɓe-ƙàɓè, ƙuliː-ƙulin, ƴan'uwân, ɗai-ɗai
Morphology
Tags
- This corpus uses 16 UPOS tags out of 17 possible: ADJ, ADP, ADV, AUX, CCONJ, DET, INTJ, NOUN, NUM, PART, PRON, PROPN, PUNCT, SCONJ, VERB, X
- This corpus does not use the following tags: SYM
- This corpus contains 26 word types tagged as particles (PART): ba, bà, bà~, bàː, bâ, bâː, dai, dà, fa, fâ, gàː, kar̃, kaɗà, koː, kuma, kàm, kèːnan, kòː, kùwa, kùwâː, maː, neː, nèː, ta, zâː, àkwai
- This corpus contains 76 lemmas tagged as pronouns (PRON): =kà, =shì, >, dukà, ita, ka, kai, keː, ki, koːmeː, koːwa:, koːwaː, ku, kuː, kâinaː, kânkà, kânmù, kânshì, kântà, kù, makà, manà, mashì, masà, masù, matà, mikì, mishì, mu, mukù, musù, muː, mâi, mèneːnèː, mèː, mèːneː, mèːneːnèː, mêː-mêː, mîn, mù, naːkà, naːkù, naːmù, naːshì, naːsù, naːtà, ni, niː, nàːwa, nàːwaː, shi, shiː, su, su:, suː, sù, ta, taːkù, tà, wancàn, wandà, wani, wannàn, wasu, waɗàndà, waɗànnan, waː, waːnè, wutaː, wànnan, wàː, wàːneː, wàːneːnèː, wânnan, ɗin, ʔàʔè
- This corpus contains 18 lemmas tagged as determiners (DET): dukà, nan, nàn, nân, su, the, wancàn, wancàː, wani, wannàn, wasu, wata, waɗànnan, wànneː, wànè, wânnan, ɗin, ɗîn
- Out of the above, 9 lemmas occurred sometimes as PRON and sometimes as DET: dukà, su, wancàn, wani, wannàn, wasu, waɗànnan, wânnan, ɗin
- This corpus contains 1 lemmas tagged as auxiliaries (AUX): _
- There are 2 (de)verbal forms:
- Part
- VERB: zàune, jìbge, kwànce
- Vnoun
- VERB: noːman, noːmaː, yîː, zuwàː, saːmùn, cîː, jîn, sôː, tunàːwaː, kiɗàː
Nominal Features
- Fem
- ADJ: ƴar̃
- AUX: ta, tà, taː, tanàː, kin, zaːtà, takàn, kikà, kì, bàtà
- DET: wata
- NOUN: suːnantà, wurintà, hankàlintà, iyàːyentà, mijìntà, ƴar̃, bàːƙuwaː, dàliːlìntà, dòːdannìyaː, gaːdòntà
- PRON: ita, tà, ta, matà, keː, naːtà, ki, wata, kântà, kì
- VERB-Vnoun: kiràntà, ràsuwantà, wor̃ɓantà, yîntà
- Masc
- ADP: ɗinshì, kàmankà
- AUX: ya, yaː, nèː, kaː, kà, yà, yanàː, ka, zâi, kanàː
- DET: wani
- NOUN: bàːbanshì, dàngàntakànshì, gidanshì, wàːsanshì, iyàːyenshì, mahallakanshì, wajenshì, wurinshì, yaːrònshì, yiwankà
- PRON: shiː, shi, shì, mishì, makà, kai, mashì, kà, wani, ka
- VERB-Vnoun: jînkà, yînshì, cînshì, duːkànshì, fànciyankà, gudùnkà, jiranshi, kirànshi, kirànshì, kwànciyanshì
- Plur
- ADJ: ƴan, tsòːfàffin
- ADP: màːsu, cikinsù
- AUX: nèː, sukà, mukà, mù, munàː, sun, sunàː, mukàn, mun, kukàn
- DET: wânnan, su, wasu, waɗànnan
- NOUN: mutàːneː, sauransù, shaːnuː, abuːbuwàn, dabboːbiː, abuːbuwàː, yâːraː, saiwoːyiː, gidanmù, gidankù
- PRON: wânnan, suː, manà, muː, mù, naːmù, su, mu, sù, masù
- PROPN: Fulàːniː, Filàːniː, Kanaːwaː, Katsinaːwaː, Tuːr̃aːwaː, Bàfilàːnin, Fulàːnîn, Sakkwataːwaː
- VERB: caccànzaː, masàyaː, ciccìkaː, daddàurè, r̃ar̃r̃àbaː, tattàːrà, tàttàfi, yanyànkà, yâːraː, duːkànmù
- VERB-Vnoun: duːkànmù, noːmanmù, saːmùnsù, ƙwàːranmù
- Sing
- ADP: mài
- AUX: naː, zân, zaːkà, inàː, na, ìn, zaːʼà, bàn, bân, zaːkì
- PRON: niː, mîn, ni, nì, nàːwa, kâinaː, nàːwaː, shiː
- Dat
- ADP: mà, wà
- PRON: mishì, mîn, makà, manà, mashì, matà, masù, mâi, musù, masà
- Gen
- ADP: cikinsù, ɗinshì, kàmankà
- NOUN: suːnantà, gidanmù, bàːbanshì, gidankù, gidanshì, kàːwuːnaː, wurintà, wàːsanshì, alloːlinsù, hankàlintà
- PRON: naːmù, naːtà, nàːwa, naːshì, naːsù, =kà, naːkà, naːkù, nàːwaː, taːkù
- VERB-Vnoun: ràsuwantà, cînshì, duːkànshì, fànciyankà, gudùnkà, kwànciyanshì, sanìnshì, sônshì, tàfiyànshì, tàmbayàːnaː
- Nom
- PRON: shiː, ita, niː, suː, muː, ni, kai, mu, shi, duy
- Cons
- ADJ: ainihin, farin, saːbon, baƙin, bàbban, kaurin, tsantsan, tsoːhon, yawàn, tsòːfàffin
- ADP: na, irìn, kân
- ADV: bana, kùr̃ùngùn, yànzûn
- DET: waɗànnan
- NOUN: àbin, àbîn, gidan, irìn, loːkàcin, gàrin, dàliːlìn, ruwan, suːnan, tsaːmiyan
- NUM: sìttin, tàlàːtin, àshìr̃in, ɗayan, ɗàr̃in, goːmàn
- PROPN: Ùngwan, Bàːsân, Gùndumàn, Fulàːniː, Bàtuːr̃èn, Maːlàn, Muːsa, Saːnin, Ɗan, Bàfilàːnin
- VERB: noːman, saːmùn, jîn, neːman, yîn, sôn, cîn, ganin, kiràntà, saːran
- VERB-Vnoun: noːman, saːmùn, jîn, neːman, yîn, sôn, cîn, ganin, kiràntà, saːran
- Def
- ADV: nan
- DET: wânnan, nan, ɗîn, waɗànnan
- NOUN: loːkàcîn, wân, àbîn, ƙanèn, daːjìn, irìn, gidân, maːlàmîn, wajên, àboːkîn
- PRON: wânnan, wànnan, waɗànnan
- PROPN: Bàːsân, Filàːnîn, Ìsìlàːmiyàn, Fulàːnîn
- Ind
- NOUN: gidaː, zaːmàniː, àbù, ruwaː, zoːmoː, gàriː, àmfàːniː, aikìː, daːjìː, loːkàcîn
- Spec
- DET: wani, wata, wasu
- PRON: wani, wasu
Degree and Polarity
- Neg
- AUX: bàkà, bàn, bài, bàmù, bàʼà, baː'àː, baːyà, baːkàː, baːmàː, bàsù
- INTJ: baːbù, ba, bâː
- PART: ba, bàː, bà, bâː, kar̃, kaɗà, bà~, bâ
Verbal Features
- Aor
- AUX: kà, à, yà, mù, tà, sù, ìn, kù, kì, shì
- Hab
- AUX: mukàn, kukàn, takàn, yakàn, akàn, sukàn
- Iter
- PART: ta
- Perf
- AUX: yaː, kaː, an, naː, sun, mun, taː, kin, kun, am
- PerfBkg
- AUX: ya, ta, akà, sukà, mukà, ka, na, kikà, kukà, kakèː
- PerfNeg
- AUX: bàkà, bàn, bài, bàmù, bàʼà, bàsù, bàtà, bàkì
- Prog
- AUX: anàː, yanàː, munàː, sunàː, inàː, kanàː, nàː, tanàː, kunàː, kinàː
- ProgBkg
- AUX: akèː, mukèː, kukèː, sukèː, yakèː, kèː, kakèː, kukà, takèː, kikèː
- ProgNeg
- AUX: baː'àː, baːkàː, baːmàː, baːyàː, bân, baːkà, baːtà, baːnàː, baːsàː, baːsù
- Fut
- AUX: zâi, zân, zaːkà, zaːsù, zaːʼà, zaːtà, zaː'à, zaːkì, zaːmù, zaːʔà
- Pred
- AUX: kyâː, kâː, mwâː, tâː, âː
- Cau
- VERB: sayar̃, s~
- Stat
- VERB-Part: zàune, jìbge, kwànce
Pronouns, Determiners, Quantifiers
- Dem
- ADV: nan, nân, can, cân
- DET: wânnan, nan, ɗîn, nàn, wannàn, wancàn, waɗànnan, nân
- PRON: wânnan, wannàn, wànnan, wancàn, waɗànnan
- Ind
- DET: wani, wasu, wata
- PRON: koːmeː, koːwaː, wani, koːmiː, koːwa:, waːnè, wasu
- Int
- ADV: ìnaː, yàːyàː, yàushèː, yàyàː, yàː
- DET: wànè, wànneː
- NUM: nawà
- PRON: mèː, mèːneː, mèːneːnèː, wàː, mèneːnèː, mêː-mêː, wàːneː, wàːneːnèː, wâː, wǎːi
- Prs
- PRON: shiː, shi, ita, shì, suː, mishì, tà, mîn, niː, makà
- Rel
- ADV: yandà, indà, yaddà
- PRON: wandà, waɗàndà
- Tot
- DET: dug, duk, dun, dus
- PRON: dukà, duk, dun, duy
- Yes
- PRON: kânmù, kânshì, kâinaː, kâmmù, kânkà, kântà
- 1
- AUX: mukà, naː, mù, zân, munàː, inàː, mukàn, mun, na, ìn
- NOUN: gidanmù, kàːwuːnaː, hanyànmù, kàːkaːnaː, màsàràutunmù, bàːbanaː, cinyànmù, goːnanmù, hannuːnaː, iyàːyenmù
- PRON: niː, mîn, manà, muː, mù, naːmù, ni, mu, nì, nàːwa
- VERB-Vnoun: duːkànmù, noːmanmù, tàmbayàːnaː, ƙwàːranmù
- 2
- ADP: kàmankà
- AUX: kaː, kà, ka, zaːkà, kanàː, kukàn, kin, bàkà, kukèː, kakèː
- NOUN: gidankù, yiwankà, hankàlinkì, kânkì, àlhakinkì, àmfàːninku, ƙar̃finkà
- PRON: suː, kà, kai, makà, ka, keː, ki, dukà, ku, mukù
- VERB-Vnoun: jînkà, fànciyankà, gudùnkà, ròːƙonkà
- 3
- ADP: cikinsù, ɗinshì
- AUX: ya, yaː, ta, sukà, yà, yanàː, zâi, sun, tà, sunàː
- NOUN: sauransù, suːnantà, bàːbanshì, dàngàntakànshì, gidanshì, wurintà, wàːsanshì, alloːlinsù, gàrinsù, hankàlintà
- PRON: shiː, shi, ita, shì, mishì, tà, ta, mashì, matà, su
- VERB-Vnoun: kiràntà, ràsuwantà, yînshì, cînshì, duːkànshì, jiranshi, kirànshi, kirànshì, kwànciyanshì, sanìnshi
- 4
- AUX: akà, à, an, anàː, akèː, zaː'à, akàn, bàʼà, kà, baː'àː
- PRON: makà, mâː
Other Features
- Deixis
- Prox
- ADV: nân
- DET: nàn, wannàn, nân
- PRON: wannàn
- Remt
- ADV: can, cân
- DET: wancàn
- PRON: wancàn
- Prox
- ExtPos
- ADP
- NOUN: kàmaːnaː
- PRON: dud
- ADV
- ADV: zàune
- VERB: zàune, kwànce
- VERB-Part: zàune, kwànce
- NOUN
- NOUN: har̃kàn, noːmân, tsoːhuwaː, girman, goːnan, gàːrin, gìne-gìne, niːsaː, noːman, sigàː
- NUM: goːmàn
- PROPN: Basaːwaː
- VERB: noːmaː, noːman, yîː, zuwàː, saːmùn, cîː, jîn, sôː, tunàːwaː, neːman
- VERB-Vnoun: noːman, noːmaː, yîː, zuwàː, saːmùn, cîː, jîn, sôː, tunàːwaː, neːman
- PRON
- PRON: wandà
- ADP
- Foreign
- Yes
- ADJ: fir̃aːmar̃i, jiːniyo, sakandir̃iː
- ADP: of
- ADV: especia~
- CCONJ: but
- DET: the
- INTJ: OK, lìllaːhì, àlhamdù, sòː
- NOUN: poison, police, kilaːs, sùkûːl, Eːbìːyù, TV, bitch, bìr̃êːk, chemistry, drinks
- PROPN: Feːdar̃al, Gor̃illas
- VERB: checking, escaping, pr̃etending
- Yes
- PartType
- Adv
- PART: ta
- Disc
- PART: koː
- Foc
- PART: nèː, kèːnan, neː
- Neg
- PART: ba, bàː, bà, bâː, kar̃, kaɗà, bà~, bâ
- Pred
- PART: gàː, àkwai, zâː, dà
- Top
- PART: maː, dai, kuma, fa, kòː, kùwa, kàm, koː, kùwâː, fâ
- Adv
Syntax
Auxiliary Verbs and Copula
- This corpus uses 1 lemmas as copulas (cop). Examples: _.
- This corpus uses 1 lemmas as auxiliaries (aux). Examples: _.
Core Arguments, Oblique Arguments and Adjuncts
Here we consider only relations between verbs (parent) and nouns or pronouns (child).
- nsubj
- VERB--NOUN (85)
- VERB--NOUN-ADP(mài) (2)
- VERB--NOUN-Gen (6)
- VERB--NOUN-Gen-ADP(mài) (1)
- VERB--PRON (8)
- VERB--PRON-Nom (1)
- VERB-Part--NOUN (1)
- VERB-Vnoun--NOUN (5)
- obj
- VERB--NOUN (375)
- VERB--NOUN-ADP(dà) (1)
- VERB--NOUN-ADP(mài) (2)
- VERB--NOUN-ADP(na/ta) (3)
- VERB--NOUN-Gen (19)
- VERB--NOUN-Gen-ADP(kân) (1)
- VERB--NOUN-Gen-ADP(wai) (1)
- VERB--PRON (135)
- VERB--PRON-ADP(dà) (1)
- VERB--PRON-Nom (4)
- VERB--PRON-Nom-ADP(wai) (1)
- VERB-Vnoun--NOUN (15)
- VERB-Vnoun--NOUN-Gen (2)
- VERB-Vnoun--PRON (4)
- iobj
- VERB--NOUN (2)
- VERB--PRON (35)
- VERB--PRON-Dat (96)
- VERB--PRON-Nom (2)
- VERB-Vnoun--NOUN (1)
- VERB-Vnoun--PRON (1)
- VERB-Vnoun--PRON-Dat (1)
Verbs with Reflexive Core Objects
- This corpus contains 1 lemmas that occur at least once with a reflexive core object (obj or iobj). Examples: yi kânmù
Relations Overview
- This corpus uses 8 relation subtypes: acl:relcl, advcl:cleft, cc:preconj, compound:prt, compound:svc, flat:foreign, flat:name, obl:arg
- The following 6 relation types are not used in this corpus at all: expl, clf, fixed, list, orphan, goeswith