Treebank Statistics: UD_Hausa-WesternAutogramm: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
4873 tokens (35%) have a non-empty value of Gender.
922 types (56%) occur at least once with a non-empty value of Gender.
573 lemmas (57%) occur at least once with a non-empty value of Gender.
The feature is used with 10 part-of-speech tags: NOUN (2205; 16% instances), AUX (1489; 11% instances), PRON (608; 4% instances), VERB (435; 3% instances), DET (71; 1% instances), ADJ (30; 0% instances), PART (15; 0% instances), ADP (10; 0% instances), PROPN (8; 0% instances), NUM (2; 0% instances).
NOUN
2205 NOUN tokens (88% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Definite=EMPTY (1571; 71%).
NOUN tokens may have the following values of Gender:
Fem(747; 34% of non-emptyGender): bùdurwaː, màccè, s’oːhuwaː, dàudawaː, màːtaːtai, hiːr̃a, màːtam, kwalbaː, dùbaːr̃àː, gòːdiyaːMasc(1458; 66% of non-emptyGender): sarkiː, mùtun, maːlàm, maːlàmiː, maːgàniː, gidaː, ƙarhèː, gàːriː, doːkìː, sarmàyiːEMPTY(313): maːtaː, ruwaː, zuːgàl, sàmàːriː, bìkiː, hannuwàː, ruwan, ɗiyan, hannuː, mutàːneː
| Paradigm gidaː | Masc | Fem |
|---|---|---|
| _ | gidaː, gidaːnai | |
| Definite=Cons | gidam, gidan, gidansù, gidankà, gidaːnaː | gidantà |
Gender seems to be lexical feature of NOUN. 93% lemmas (382) occur only with one value of Gender.
AUX
1489 AUX tokens (60% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Number=EMPTY (1446; 97%), Person=3 (1298; 87%), Mood=EMPTY (1077; 72%), Aspect=Perf (763; 51%).
AUX tokens may have the following values of Gender:
Fem(489; 33% of non-emptyGender): tà, taː, tac, bâːta, tanàː, taz, tat, bàtà, tag, tayMasc(1000; 67% of non-emptyGender): shì, yaː, yac, bâːshi, shinàː, yat, neː, yay, kà, yabEMPTY(985): à, ankà, sù, sunkà, an, sunàː, akà, anàː, kà, nàː
| Paradigm yaː | Masc | Fem |
|---|---|---|
| Person=2 | kaː, kas, kaɗ, kah, kac, kam, kay, kad, kag, kaj, kak, kar, kash, kat, kaw, kaz, ka’ | kin, kic, kik, kib, kih, kish |
| Person=2|Polarity=Neg | bàkà | |
| Person=3 | yaː, yac, yat, yay, yab, yak, yag, yaz, yas, yah, yash, yar, ya’, yam, yaɗ, yaw, yaj, yad, yaƙ, yal, yan, yaɓ | taː, tac, taz, tat, tag, tay, tah, tak, tab, tad, taɗ, ta’, taj, tas, tash, tam, taɓ, tal, taƙ, taw |
| Person=3|Polarity=Neg | bài | bàtà |
PRON
608 PRON tokens (73% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Number=EMPTY (608; 100%), PronType=EMPTY (530; 87%), Person=3 (481; 79%).
PRON tokens may have the following values of Gender:
Fem(279; 46% of non-emptyGender): ita, ta, tà, mutà, waddà, kì, matà, wancè, mikì, taːshìMasc(329; 54% of non-emptyGender): shiː, shi, wandà, shì, mai, kai, waːnè, kà, makà, naːtàEMPTY(226): suː, su, indà, koːwaː, wa’àndà, sù, koːmiː, wandà, mun, musù
| Paradigm wandà | Masc | Fem |
|---|---|---|
| wandà | wandà |
Gender seems to be lexical feature of PRON. 91% lemmas (31) occur only with one value of Gender.
VERB
435 VERB tokens (18% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: ExtPos=NOUN (422; 97%).
VERB tokens may have the following values of Gender:
Fem(118; 27% of non-emptyGender): tàhiyàː, ràbuwaː, shìgaː, bugàːwaː, huːdèːwaː, tàhiyàttà, yìwuwaː, ɗiːbàm, cêːwaː, ɗiːbàttàMasc(317; 73% of non-emptyGender): kwaːnaː, sôː, yîn, yîː, sôn, tàhiyàːtai, sôːnai, zuwàː, cîː, kirànEMPTY(2015): yi, cêː, ajèː, zoː, tàhi, sâː, sàːmu, baː, ci, kai
| Paradigm yi | Masc | Fem |
|---|---|---|
| Definite=Cons | yîn, yîm | |
| Definite=Cons|VerbForm=Vnoun | yîn | |
| yîː, yîːnai | ||
| VerbForm=Vnoun | yîː | yôwwaː |
Gender seems to be lexical feature of VERB. 91% lemmas (92) occur only with one value of Gender.
DET
71 DET tokens (35% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Deixis=EMPTY (52; 73%), Definite=EMPTY (38; 54%).
DET tokens may have the following values of Gender:
Fem(20; 28% of non-emptyGender): wata, koːwacè, koːwàcè, waniMasc(51; 72% of non-emptyGender): wani, wânnam, koːdàwane, wânga, wânnan, wannànEMPTY(130): nan, wa’ànnan, du’, ga, wasu, dum, du’, duk, dul, duw
| Paradigm wani | Masc | Fem |
|---|---|---|
| wani | wani, wata |
ADJ
30 ADJ tokens (48% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=EMPTY (30; 100%).
ADJ tokens may have the following values of Gender:
Fem(14; 47% of non-emptyGender): ‘yaƙ, màccè, kwikwiyàː, ƙaramaː, ƙàramaː, ’yak, ’yam, hwarab, hwaram, màccênMasc(16; 53% of non-emptyGender): ɗanyen, baƙiː, hwarin, hwariː, jàː, mùlmùlalleː, sauran, saːboː, ɗanyeː, bàbbamEMPTY(32): ’yam, ɗai, hwarhwarun, mayyaː, sauran, ƙanaːnàː, hwarhwaruː, jàd, kaɗai, saːbiː
Gender seems to be lexical feature of ADJ. 100% lemmas (17) occur only with one value of Gender.
PART
15 PART tokens (4% of all PART tokens) have a non-empty value of Gender.
The most frequent other feature values with which PART and Gender co-occurred: Case=Gen (15; 100%), Number=EMPTY (15; 100%), PartType=Case (15; 100%), Polarity=EMPTY (15; 100%).
PART tokens may have the following values of Gender:
Fem(15; 100% of non-emptyGender): taEMPTY(359): na, ba, kuma, kâu, mài, ta, dai, bâː, maː, àkwai
ADP
10 ADP tokens (2% of all ADP tokens) have a non-empty value of Gender.
ADP tokens may have the following values of Gender:
Fem(3; 30% of non-emptyGender): s’àkaːnintà, wajentà, wurintàMasc(7; 70% of non-emptyGender): wuriːnai, gàr̃ai, wurinkà, wuriːnaː, s’akaːninkàEMPTY(622): dà, cikin, gà, mà, wurin, wurim, sai, cikim, bisà, bandà
PROPN
8 PROPN tokens (25% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=EMPTY (8; 100%).
PROPN tokens may have the following values of Gender:
Masc(8; 100% of non-emptyGender): bàhillaːcèː, ùbangijìː, bàhillaːcènEMPTY(24): allàː, hillàːniː, hàusàːwaː, abzinaːwaː, abzìn, bar̃ar̃oːjì, buːzàːyeː, bàhaushèː, kac’inaːwan, kyakkyataːwaː
NUM
2 NUM tokens (4% of all NUM tokens) have a non-empty value of Gender.
NUM tokens may have the following values of Gender:
Fem(1; 50% of non-emptyGender): dubuːMasc(1; 50% of non-emptyGender): ɗàrînEMPTY(44): gùdaː, biyu, tar̃à, bakwài, ɗàriː, huɗu, shâː, ɗaya, biyun, goːmà
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[nmod]–> NOUN (163; 51%),
PRON –[appos]–> NOUN (57; 88%),
VERB –[nmod]–> NOUN (53; 50%),
NOUN –[cop]–> AUX (34; 65%),
NOUN –[amod]–> ADJ (17; 74%),
NOUN –[appos]–> NOUN (15; 71%),
NOUN –[reparandum]–> NOUN (12; 75%),
NOUN –[appos]–> PRON (11; 92%),
NOUN –[acl:relcl]–> NOUN (8; 89%),
AUX –[nsubj]–> NOUN (7; 78%).