Treebank Statistics: UD_Hausa-NorthernAutogramm: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
4462 tokens (29%) have a non-empty value of Gender.
847 types (47%) occur at least once with a non-empty value of Gender.
557 lemmas (51%) occur at least once with a non-empty value of Gender.
The feature is used with 10 part-of-speech tags: NOUN (1869; 12% instances), AUX (1602; 10% instances), PRON (436; 3% instances), VERB (331; 2% instances), DET (74; 0% instances), ADJ (62; 0% instances), PROPN (39; 0% instances), PART (32; 0% instances), ADP (13; 0% instances), NUM (4; 0% instances).
NOUN
1869 NOUN tokens (83% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=EMPTY (1869; 100%), Definite=EMPTY (1226; 66%).
NOUN tokens may have the following values of Gender:
Fem(429; 23% of non-emptyGender): hàukaː, kuːraː, kyàutaː, saːnìyaː, laːhiyàː, gàyyaː, mutuwàː, dàːmisàː, raggàː, duːkìyaːMasc(1440; 77% of non-emptyGender): maulòː, yaːƙìː, gidaː, doːkìː, gàriː, yaːròː, doːkìːnai, sarkiː, mùtun, loːkàcînEMPTY(382): jàkkai, ruwaː, mutàːneː, shaːnuː, itàːceː, màːlàmmai, ƴan, baːyuː, hàukaː, yâːra
| Paradigm doːkìː | Masc | Fem |
|---|---|---|
| _ | doːkìː, doːkìːnai | doːkìːnaː |
| Definite=Cons | doːkìn, doːkìːnai, doːkìnku, doːkìmmu, doːkìnka |
Gender seems to be lexical feature of NOUN. 94% lemmas (361) occur only with one value of Gender.
AUX
1602 AUX tokens (63% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Number=EMPTY (1602; 100%), Person=3 (1350; 84%), Mood=EMPTY (1343; 84%), Aspect=Perf (1131; 71%).
AUX tokens may have the following values of Gender:
Fem(304; 19% of non-emptyGender): tac, tà, taː, kì, tanàː, tay, tak, taz, ta’, tasMasc(1298; 81% of non-emptyGender): yac, yaː, shì, shinàː, kà, yat, yay, yak, yaz, kaːEMPTY(936): sunkà, ankà, naː, ìn, à, kù, nàː, sunàː, sun, akà
| Paradigm yaː | Masc | Fem |
|---|---|---|
| Person=2 | kaː, kaz, kac, kay, kash, kat, kaj, kak, kam, kas | kyaː, kig, ki', kis |
| Person=2|Polarity=Neg | bàkà | |
| Person=3 | yac, yaː, yat, yay, yak, yaz, yas, yah, yab, yag, ya', yash, yaɗ, yaj, yaƙ, yam, yar, ya, yas', yats, yaw, yaɓ, yad, yan | tac, taː, tay, tak, taz, ta', tas, tat, tag, tah, taɗ, tab, taj, taƙ, taɓ, tam, tar, tash, taw |
| Person=3|Polarity=Neg | bài | bàtà |
PRON
436 PRON tokens (56% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Number=EMPTY (436; 100%), PronType=Prs (377; 86%), Person=3 (281; 64%).
PRON tokens may have the following values of Gender:
Fem(78; 18% of non-emptyGender): ita, matà, tà, ta, keː, wàccan, naːkì, wàgga, maw, kânkiMasc(358; 82% of non-emptyGender): mai, shiː, shì, shi, kai, kà, ka, makà, naːshì, wandàEMPTY(346): niː, sù, musù, indà, min, suː, ni, koːwaː, koːmiː, mukù
| Paradigm nân | Masc | Fem |
|---|---|---|
| wànga, wannàn | wàgga |
Gender seems to be lexical feature of PRON. 93% lemmas (37) occur only with one value of Gender.
VERB
331 VERB tokens (13% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: ExtPos=NOUN (310; 94%), VerbForm=Vnoun (265; 80%).
VERB tokens may have the following values of Gender:
Fem(129; 39% of non-emptyGender): tàhiyàː, zakkùwaː, bìyash, gàmuwaː, bìɗaː, hwaːɗùwaː, cêːwaː, kankaryaː, sarɓaː, cèːMasc(202; 61% of non-emptyGender): yîː, sôn, cîn, zuwàː, kwan, sôː, gudùː, zamaː, bugùn, ciːzònEMPTY(2190): cèː, yi, cêː, tàhi, zakà, ga, zoː, baː, kai, ji
| Paradigm tah- | Masc | Fem |
|---|---|---|
| Definite=Cons | tàhiyàkka | |
| Definite=Cons|ExtPos=NOUN | tàhiyàkka | |
| Definite=Cons|ExtPos=NOUN|VerbForm=Vnoun | tàhiyàːtai | tàhiyàkkà, tàhiyàkkù, tàhiyàːtai, tàhiyàːtaː |
| ExtPos=NOUN | tàhiyàː | |
| ExtPos=NOUN|VerbForm=Vnoun | tàhiyàːtai | tàhiyàː, tohoːwàː |
Gender seems to be lexical feature of VERB. 94% lemmas (91) occur only with one value of Gender.
DET
74 DET tokens (24% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Deixis=EMPTY (62; 84%), Definite=Spec (48; 65%), PronType=Ind (42; 57%).
DET tokens may have the following values of Gender:
Fem(26; 35% of non-emptyGender): wata, tan, wàccân, waccè, wacèː, wàcceː, wàttanMasc(48; 65% of non-emptyGender): wani, wànga, wanèː, wàncân, wàndon, wângaEMPTY(230): nan, ga, dukà, wasu, duk, dug, duy, su, du’, dus
| Paradigm wani | Masc | Fem |
|---|---|---|
| wani | wata |
ADJ
62 ADJ tokens (62% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=EMPTY (62; 100%), Definite=Cons (49; 79%).
ADJ tokens may have the following values of Gender:
Fem(12; 19% of non-emptyGender): ‘yab, hàlle-hàllan, ƙaːtanyàː, ‘yak, ‘yash, ‘yat, kàrad, wacèː, ƴag, ƴakMasc(50; 81% of non-emptyGender): ɗan, namijì, ɗam, baƙiː, ƙàramiː, hwarin, hwariː, wajjan, yànkakkeː, ƙaːtònEMPTY(38): hàlle-hàllan, daban, jaː, jàkkai, maːtaː, ‘yan, banzaː, hàlleː-hàllan, ‘yam, hwarhwarun
| Paradigm ɗan | Masc | Fem |
|---|---|---|
| _ | ɗan | |
| Definite=Cons | ɗan, ɗam | 'yab, 'yak, 'yash, 'yat |
PROPN
39 PROPN tokens (4% of all PROPN tokens) have a non-empty value of Gender.
PROPN tokens may have the following values of Gender:
Fem(1; 3% of non-emptyGender): gaskiyaːMasc(38; 97% of non-emptyGender): Ɗiɗìː, Tatìː, Garbaː, Buːzuː, Bàhaushèː, Garbà, Bàgawailèː, zaːrùmiːEMPTY(889): Dikkò, Garbaː, Dandà, Garbà, Gilaːgè, Allàː, ‘YabBàraːya, Tatìː, Banìyoːgubà, Tudùː
Gender seems to be lexical feature of PROPN. 100% lemmas (10) occur only with one value of Gender.
PART
32 PART tokens (5% of all PART tokens) have a non-empty value of Gender.
The most frequent other feature values with which PART and Gender co-occurred: Number=EMPTY (32; 100%), Polarity=EMPTY (32; 100%).
PART tokens may have the following values of Gender:
Fem(22; 69% of non-emptyGender): ta, tàː, taːMasc(10; 31% of non-emptyGender): mài, bàƙoː, naː, tôː, àkwaiEMPTY(640): ba, ta, nàː, gàː, màːsu, dai, mài, kuma, baːbù, bàː
| Paradigm neː | Masc | Fem |
|---|---|---|
| _ | tàː | |
| PartType=Foc | naː | tàː, taː |
ADP
13 ADP tokens (2% of all ADP tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADP and Gender co-occurred: Case=EMPTY (13; 100%).
ADP tokens may have the following values of Gender:
Fem(1; 8% of non-emptyGender): tsakaɗMasc(12; 92% of non-emptyGender): bàːkin, gàreːEMPTY(544): dà, mà, gà, cikin, dan, gàreː, bisà, shâː, dàb, dàc
NUM
4 NUM tokens (4% of all NUM tokens) have a non-empty value of Gender.
NUM tokens may have the following values of Gender:
Fem(1; 25% of non-emptyGender): shiddàMasc(3; 75% of non-emptyGender): buyEMPTY(102): gùdaː, buy, goːmà, ukkù, ɗai, shiddà, ɗàriː, tàlàːtin, huɗu, bakwài
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[amod]–> ADJ (44; 80%),
NOUN –[compound]–> NOUN (24; 62%),
NOUN –[nmod:poss]–> PRON (15; 63%),
NOUN –[nmod]–> VERB (12; 60%),
PRON –[appos]–> NOUN (10; 83%),
NOUN –[acl:relcl]–> NOUN (9; 75%),
NOUN –[dislocated]–> NOUN (9; 69%),
VERB –[nmod]–> NOUN (8; 62%),
NOUN –[parataxis]–> NOUN (6; 75%),
VERB –[det]–> DET (6; 86%).