Treebank Statistics: UD_Hausa-WesternAutogramm: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
4877 tokens (35%) have a non-empty value of Gender.
922 types (56%) occur at least once with a non-empty value of Gender.
664 lemmas (59%) occur at least once with a non-empty value of Gender.
The feature is used with 9 part-of-speech tags: NOUN (2206; 16% instances), AUX (1479; 11% instances), PRON (606; 4% instances), VERB (435; 3% instances), DET (71; 1% instances), ADP (42; 0% instances), ADJ (28; 0% instances), PROPN (8; 0% instances), NUM (2; 0% instances).
NOUN
2206 NOUN tokens (88% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=EMPTY (2166; 98%), Definite=Ind (1579; 72%).
NOUN tokens may have the following values of Gender:
Fem(748; 34% of non-emptyGender): bùdurwaː, màccè, s’oːhuwaː, dàudawaː, màːtam, màːtaːtai, jìkintà, hiːr̃a, kwalbaː, dùbaːr̃àːMasc(1458; 66% of non-emptyGender): mùtun, sarkiː, maːlàm, maːlàmiː, maːgàniː, gidaː, maːgànîn, ƙarhèː, gàːriː, doːkìːEMPTY(314): maːtaː, ruwaː, zuːgàl, sàmàːriː, bìkiː, hannuwàː, ruwan, ɗiyan, hannuː, mutàːneː
| Paradigm jiːkàː | Masc | Fem |
|---|---|---|
| jiːkàn | jiːkàw |
Gender seems to be lexical feature of NOUN. 99% lemmas (483) occur only with one value of Gender.
AUX
1479 AUX tokens (60% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Number=EMPTY (1435; 97%), Person=3 (1287; 87%).
AUX tokens may have the following values of Gender:
Fem(485; 33% of non-emptyGender): tà, taː, tac, bâːta, tanàː, taz, tat, bàtà, tag, tayMasc(994; 67% of non-emptyGender): shì, yaː, yac, bâːshi, shinàː, yat, neː, yay, kà, yabEMPTY(997): à, ankà, sù, sunkà, an, sunàː, akà, anàː, kà, nàː
PRON
606 PRON tokens (73% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Number=EMPTY (604; 100%), PronType=EMPTY (546; 90%), Person=3 (492; 81%).
PRON tokens may have the following values of Gender:
Fem(280; 46% of non-emptyGender): ita, ta, tà, mutà, waddà, kì, matà, mikì, naːtà, wancèMasc(326; 54% of non-emptyGender): shiː, shi, wandà, shì, mai, kai, kà, makà, wani, taːshìEMPTY(227): suː, su, indà, koːwaː, wa’àndà, sù, koːmiː, wandà, mun, musù
| Paradigm wandà | Masc | Fem |
|---|---|---|
| wandà | wandà |
Gender seems to be lexical feature of PRON. 90% lemmas (28) occur only with one value of Gender.
VERB
435 VERB tokens (18% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: ExtPos=NOUN (422; 97%).
VERB tokens may have the following values of Gender:
Fem(118; 27% of non-emptyGender): tàhiyàː, ràbuwaː, shìgaː, bugàːwaː, huːdèːwaː, tàhiyàttà, yìwuwaː, ɗiːbàm, cêːwaː, ɗiːbàttàMasc(317; 73% of non-emptyGender): kwaːnaː, sôː, yîn, yîː, sôn, tàhiyàːtai, sôːnai, zuwàː, cîː, kirànEMPTY(1999): yi, cêː, ajèː, zoː, tàhi, sâː, sàːmu, baː, ci, kai
| Paradigm yi | Masc | Fem |
|---|---|---|
| Definite=Cons | yîn, yîm | |
| yîː | yôwwaː |
Gender seems to be lexical feature of VERB. 94% lemmas (113) occur only with one value of Gender.
DET
71 DET tokens (35% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Deixis=EMPTY (52; 73%), PronType=EMPTY (52; 73%), Definite=EMPTY (39; 55%).
DET tokens may have the following values of Gender:
Fem(20; 28% of non-emptyGender): wata, koːwacè, koːwàcè, waniMasc(51; 72% of non-emptyGender): wani, wânnam, koːdàwane, wânga, wânnan, wannànEMPTY(130): nan, wa’ànnan, du’, ga, wasu, dum, du’, duk, dul, duw
| Paradigm wani | Masc | Fem |
|---|---|---|
| _ | wata | |
| Definite=Spec | wani | wani |
ADP
42 ADP tokens (6% of all ADP tokens) have a non-empty value of Gender.
ADP tokens may have the following values of Gender:
Fem(4; 10% of non-emptyGender): s’àkaːnintà, wajentà, wurinkì, wurintàMasc(38; 90% of non-emptyGender): mài, gàr̃ai, wuriːnai, s’akaːninkà, wurinkàEMPTY(708): dà, cikin, gà, na, mà, wurin, wurim, sai, cikim, bisà
ADJ
28 ADJ tokens (46% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=EMPTY (28; 100%), Definite=EMPTY (15; 54%).
ADJ tokens may have the following values of Gender:
Fem(14; 50% of non-emptyGender): ‘yaƙ, màccè, kwikwiyàː, ƙaramaː, ƙàramaː, ’yak, ’yam, hwarab, hwaram, màccênMasc(14; 50% of non-emptyGender): ɗanyen, baƙiː, hwarin, hwariː, jàː, mùlmùlalleː, saːboː, ɗanyeː, bàbbam, saːbonEMPTY(33): ’yam, sauran, ɗai, hwarhwarun, mayyaː, ƙanaːnàː, hwarhwaruː, jàd, kaɗai, saːbiː
Gender seems to be lexical feature of ADJ. 100% lemmas (16) occur only with one value of Gender.
PROPN
8 PROPN tokens (25% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=EMPTY (8; 100%).
PROPN tokens may have the following values of Gender:
Masc(8; 100% of non-emptyGender): bàhillaːcèː, ùbangijìː, bàhillaːcènEMPTY(24): allàː, hillàːniː, hàusàːwaː, abzinaːwaː, abzìn, bar̃ar̃oːjì, buːzàːyeː, bàhaushèː, kac’inaːwan, kyakkyataːwaː
NUM
2 NUM tokens (4% of all NUM tokens) have a non-empty value of Gender.
NUM tokens may have the following values of Gender:
Fem(1; 50% of non-emptyGender): dubuːMasc(1; 50% of non-emptyGender): ɗàrînEMPTY(44): gùdaː, biyu, tar̃à, bakwài, ɗàriː, huɗu, shâː, ɗaya, biyun, goːmà
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[nmod]–> NOUN (169; 51%),
PRON –[appos]–> NOUN (58; 87%),
VERB –[nmod]–> NOUN (55; 51%),
NOUN –[amod]–> ADJ (17; 77%),
NOUN –[appos]–> NOUN (14; 70%),
NOUN –[reparandum]–> NOUN (13; 76%),
NOUN –[appos]–> PRON (10; 91%),
NOUN –[nmod]–> VERB (10; 53%),
NOUN –[acl:relcl]–> NOUN (8; 89%),
NOUN –[parataxis]–> NOUN (8; 67%).