Treebank Statistics: UD_Hausa-EasternAutogramm: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
3177 tokens (33%) have a non-empty value of Gender.
988 types (53%) occur at least once with a non-empty value of Gender.
733 lemmas (54%) occur at least once with a non-empty value of Gender.
The feature is used with 10 part-of-speech tags: NOUN (1731; 18% instances), AUX (470; 5% instances), VERB (371; 4% instances), PRON (214; 2% instances), DET (163; 2% instances), PART (92; 1% instances), ADJ (62; 1% instances), PROPN (34; 0% instances), NUM (33; 0% instances), ADP (7; 0% instances).
NOUN
1731 NOUN tokens (78% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=EMPTY (1688; 98%).
NOUN tokens may have the following values of Gender:
Fem(607; 35% of non-emptyGender): ƙasâr̃, duːniyàː, ƙasar̃, shèːkaràr̃, dòːkaː, gwamnatìn, ƙasaː, daːmaː, laːfiyàː, gwamnatìMasc(1124; 65% of non-emptyGender): irìn, àmfàːniː, loːkàcîn, aikìː, bàːkiː, mùtûm, àbù, loːkàciː, suːnaː, sàndaːEMPTY(484): ‘yan, mutàːneː, mùsùlmiː, ƙasàːshen, ƙwaːyoːyiː, ꞌyan, mutàːnên, maːtaː, shèːkàruː, ƙwaːyoːyin
| Paradigm gwamnatì | Masc | Fem |
|---|---|---|
| _ | gwamnatì | gwamnatì |
| Definite=Cons | gwamnatìn, gwamnatìnsà, gwamnatìnsù | |
| Definite=Def | gwamnatìn, gwamnatì |
Gender seems to be lexical feature of NOUN. 96% lemmas (533) occur only with one value of Gender.
AUX
470 AUX tokens (44% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Number=EMPTY (469; 100%), Person=3 (443; 94%), Mood=EMPTY (426; 91%), Aspect=Perf (296; 63%).
AUX tokens may have the following values of Gender:
Fem(140; 30% of non-emptyGender): ta, taː, tanàː, tà, ceː, cèː, takèː, zaːtà, baːtàː, bàtàMasc(330; 70% of non-emptyGender): ya, yaː, yanàː, yà, zâi, bài, yakè, yakèː, kaː, kàEMPTY(595): sukà, akà, sun, kèː, an, nèː, à, sunàː, sù, akèː
| Paradigm yaː | Masc | Fem |
|---|---|---|
| Person=2 | kaː | |
| Person=3 | ya, yaː | ta, taː |
| Person=3|Polarity=Neg | bài | bàtà |
VERB
371 VERB tokens (29% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: ExtPos=NOUN (327; 88%), VerbForm=Vnoun (327; 88%), Definite=Cons (195; 53%).
VERB tokens may have the following values of Gender:
Fem(151; 41% of non-emptyGender): cèː, cêːwaː, mutuwàː, fàːruwaː, ɗaukàr̃, jìtuwar̃, shìga, taːràːwaː, amìncêwaː, bar̃Masc(220; 59% of non-emptyGender): yîn, rashìn, yîː, ganin, jîn, neːman, saːmùn, sôn, cîː, goːyonEMPTY(912): yi, cêː, kai, nuːnà, iyà, sâː, fi, ci, sàːmi, faːrà
| Paradigm yi | Masc | Fem |
|---|---|---|
| Definite=Cons | yîn, yînsà | |
| yîː, yî: | yìwuwaː |
Gender seems to be lexical feature of VERB. 96% lemmas (109) occur only with one value of Gender.
PRON
214 PRON tokens (64% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Number=EMPTY (212; 99%), PronType=Prs (150; 70%), Person=3 (138; 64%).
PRON tokens may have the following values of Gender:
Fem(53; 25% of non-emptyGender): ita, waddà, wàddà, waːƙàː, matà, taːsù, ta, taːsà, wàccè, tàMasc(161; 75% of non-emptyGender): shiː, wandà, shi, shì, masà, wani, wàndà, kânsà, koːwaː, koːwànneːEMPTY(121): suː, waɗàndà, indà, musù, su, wannàn, wasu, duk, hakàn, kânsù
| Paradigm wandà | Masc | Fem |
|---|---|---|
| wandà, wàndà | waddà, wàddà, wàccè |
Gender seems to be lexical feature of PRON. 96% lemmas (24) occur only with one value of Gender.
DET
163 DET tokens (58% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Number=EMPTY (160; 98%), Deixis=EMPTY (102; 63%), Definite=Spec (96; 59%), PronType=Ind (94; 58%).
DET tokens may have the following values of Gender:
Fem(57; 35% of non-emptyGender): wata, wannàn, koːwàcè, wàccanMasc(106; 65% of non-emptyGender): wani, wannàn, koːwànè, dukkàn, wàncan, wànnan, yankìnEMPTY(119): wasu, nan, ɗîn, duk, waɗànnân, wannàn, dukkàn, nàn, koːwàɗànnè, nân
| Paradigm wani | Masc | Fem |
|---|---|---|
| Number=Plur | wani | |
| wani | wata |
PART
92 PART tokens (16% of all PART tokens) have a non-empty value of Gender.
The most frequent other feature values with which PART and Gender co-occurred: Number=EMPTY (92; 100%), Polarity=EMPTY (92; 100%), Case=Gen (89; 97%), PartType=Case (86; 93%).
PART tokens may have the following values of Gender:
Fem(55; 60% of non-emptyGender): ta, cèːMasc(37; 40% of non-emptyGender): naEMPTY(479): kuma, ba, mài, dai, neː, nèː, maː, na, màːsu, kùwa
| Paradigm na | Masc | Fem |
|---|---|---|
| na | ta | |
| Definite=Cons|PartType=Case | na | |
| PartType=Case | na | ta |
ADJ
62 ADJ tokens (58% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=EMPTY (60; 97%), Definite=Cons (53; 85%).
ADJ tokens may have the following values of Gender:
Fem(18; 29% of non-emptyGender): bàbbar̃, ‘yar̃, hàɗaɗɗiyar̃, ƙwàːyaː, matuƙar̃, mayankar̃, muːgùwar̃, saːbuwar̃, shirgeːgìyar̃, ìsasshiyar̃Masc(44; 71% of non-emptyGender): bàbban, irìː-irìː, yawàn, mùmmuːnan, tsoːhon, namijì, ìsasshen, ƙànƙanèː, ɗan, amanEMPTY(45): miyàːgun, dàbam, maːtaː, mânyan, ‘yan, ƙanaːnàː, Kir̃istàː, ƙanaːnàn, ƙasàːshen, ꞌyan
| Paradigm bàbba | Masc | Fem |
|---|---|---|
| bàbban | bàbbar̃ | |
| Number=Plur | bàbban |
PROPN
34 PROPN tokens (7% of all PROPN tokens) have a non-empty value of Gender.
PROPN tokens may have the following values of Gender:
Fem(17; 50% of non-emptyGender): Nìːjâr̃, Dòːkaː, Laːfiyàː, Dòːkâr̃, Mr, Nàːjeːr̃iyàr̃, Reinhard, Tèːkun, Zàyâr̃, Ƙasâr̃Masc(17; 50% of non-emptyGender): r̃àhoːtòn, Gwamnàn, Abengourou, Japananciː, Shùːgàbaː, Àgustàː, Landàn, Ministàːn, Yaːƙìn, Yàr̃iːmànEMPTY(457): Nàːjeːr̃iyàː, Nàjeːr̃iyàː, Ìkko, Afir̃kà, Fela, Dikkò, Chief, Bìr̃taːniyà, Lasisi, AIDS
Gender seems to be lexical feature of PROPN. 100% lemmas (24) occur only with one value of Gender.
NUM
33 NUM tokens (12% of all NUM tokens) have a non-empty value of Gender.
NUM tokens may have the following values of Gender:
Fem(28; 85% of non-emptyGender): ɗàriː, miliyàn, ɗayantàMasc(5; 15% of non-emptyGender): gùdaː, kashìː, sìttin, tàmàːninEMPTY(238): tar̃à, ɗaya, alìf, bìyar̃, biyu, ukù, tàmàːnin, shidà, goːmà, huɗu
ADP
7 ADP tokens (1% of all ADP tokens) have a non-empty value of Gender.
ADP tokens may have the following values of Gender:
Fem(3; 43% of non-emptyGender): cikinsà, jihàr̃Masc(4; 57% of non-emptyGender): tsàkaːnin, kânEMPTY(942): à, dà, cikin, dàgà, wà, gà, zuwàː, kân, har̃, kàmar̃
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (150; 74%),
NOUN –[amod]–> ADJ (43; 80%),
NOUN –[acl:relcl]–> PRON (34; 71%),
PRON –[appos]–> NOUN (17; 100%),
PRON –[nsubj]–> NOUN (11; 100%),
VERB –[compound]–> NOUN (11; 58%),
NOUN –[nmod:poss]–> PRON (10; 100%),
NOUN –[nmod:appos]–> NOUN (8; 80%),
PRON –[dislocated]–> NOUN (8; 89%),
NOUN –[appos]–> NOUN (5; 100%).