Treebank Statistics: UD_Hausa-NorthernAutogramm: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
1167 tokens (28%) have a non-empty value of Gender.
357 types (43%) occur at least once with a non-empty value of Gender.
219 lemmas (41%) occur at least once with a non-empty value of Gender.
The feature is used with 8 part-of-speech tags: NOUN (427; 10% instances), AUX (423; 10% instances), VERB (141; 3% instances), PRON (131; 3% instances), ADJ (19; 0% instances), DET (17; 0% instances), PROPN (5; 0% instances), ADP (4; 0% instances).
NOUN
427 NOUN tokens (81% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=EMPTY (410; 96%), Case=EMPTY (367; 86%), Person=EMPTY (365; 85%), Definite=Ind (235; 55%).
NOUN tokens may have the following values of Gender:
Fem(113; 26% of non-emptyGender): dàːmisàː, kuːraː, gàyyaː, shìgattà, duːniyàː, rân, raːnakkà, raːnaː, uwattà, uwaːtaiMasc(314; 74% of non-emptyGender): kàreː, sarkin, gidaː, bàːkin, sâː, àbin, ɗan, yaːɗaː, kàram, mùtunEMPTY(98): ruwaː, ƴan, mutàːneː, ayaː, cinàn, giːwàːyeː, ruwan, zàːrùmmai, kuɗɗiː, làːbàːrûn
| Paradigm ɗaː | Masc | Fem |
|---|---|---|
| Case=Gen|Definite=Cons|Person=3 | ɗantà | |
| Definite=Cons | ɗan | ƴa' |
| Definite=Ind | ɗaː |
Gender seems to be lexical feature of NOUN. 98% lemmas (128) occur only with one value of Gender.
AUX
423 AUX tokens (59% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Number=EMPTY (423; 100%), Person=3 (354; 84%), Aspect=PerfBkg (216; 51%).
AUX tokens may have the following values of Gender:
Fem(123; 29% of non-emptyGender): tac, tanàː, tà, taː, tay, tas, ta’, tab, tak, takèMasc(300; 71% of non-emptyGender): yac, yaː, shinàː, kaː, shì, yaz, kà, yat, yay, yahEMPTY(295): ankà, sunkà, nàː, à, ìn, naː, anàː, sunàː, akà, sun
VERB
141 VERB tokens (19% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: ExtPos=NOUN (132; 94%), VerbForm=Vnoun (132; 94%).
VERB tokens may have the following values of Gender:
Fem(77; 55% of non-emptyGender): tàhiyàː, zakkùwaː, bìyash, gàmuwaː, gàyyaː., hwaːɗùwaː, kankaryaː, sarɓaː, bìɗaː, cêːwaːMasc(64; 45% of non-emptyGender): yîː, sôn, cîn, kwaːnaː, gudùː, sauraːreː, taːshìː, hwaɗìː, yîn, zamanEMPTY(586): cèː, yi, cêː, zakà, zoː, ga, tàhi, ji, taɓà, bìye
| Paradigm ci | Masc | Fem |
|---|---|---|
| Definite=Cons | cîn | |
| cîː | cânyeːwàː |
Gender seems to be lexical feature of VERB. 98% lemmas (44) occur only with one value of Gender.
PRON
131 PRON tokens (68% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Number=EMPTY (130; 99%), PronType=Prs (109; 83%), Person=3 (86; 66%), Case=EMPTY (66; 50%).
PRON tokens may have the following values of Gender:
Fem(43; 33% of non-emptyGender): ita, tà, wàccan, matà, ta, wàgga, keː, maw, naːtà, wataMasc(88; 67% of non-emptyGender): shiː, shì, shi, mai, kai, maː, ka, kà, wani, wànnanEMPTY(61): niː, suː, musù, sù, koːmiː, min, ni, miː, su, koːwaː
| Paradigm wani | Masc | Fem |
|---|---|---|
| Definite=Spec | wani | |
| wata |
Gender seems to be lexical feature of PRON. 96% lemmas (25) occur only with one value of Gender.
ADJ
19 ADJ tokens (63% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Definite=Cons (12; 63%).
ADJ tokens may have the following values of Gender:
Fem(3; 16% of non-emptyGender): wacèː, ƴag, ƴakMasc(16; 84% of non-emptyGender): ɗan, baƙiː, hwarin, hwariː, namijì, ƙàramiː, baƙin, hìyayyem, janEMPTY(11): daban, jaː, maːtaː, jan, kwànce-kwancèn
| Paradigm ɗaː | Masc | Fem |
|---|---|---|
| ɗan | ƴag, ƴak |
DET
17 DET tokens (20% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Deixis=EMPTY (15; 88%), Definite=EMPTY (11; 65%), PronType=Ind (10; 59%).
DET tokens may have the following values of Gender:
Fem(11; 65% of non-emptyGender): wata, tan, wàccân, waccèMasc(6; 35% of non-emptyGender): wani, wanèːEMPTY(69): nan, ga, wasu, du’, dug, su, dus, dut, duy, can
| Paradigm wani | Masc | Fem |
|---|---|---|
| Definite=Spec | wani | |
| wata |
PROPN
5 PROPN tokens (5% of all PROPN tokens) have a non-empty value of Gender.
PROPN tokens may have the following values of Gender:
Masc(5; 100% of non-emptyGender): Buːzuː, Bàhaushèː, BàgawailèːEMPTY(88): Tudùː, Galmaːwaː, Muːsà, Ìlleːlàː, Allàː, Bàːgai, Dòːdoː, Gidan, Ƙurƙìyaː, Tuːraːwaː
ADP
4 ADP tokens (3% of all ADP tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADP and Gender co-occurred: Case=EMPTY (4; 100%).
ADP tokens may have the following values of Gender:
Fem(4; 100% of non-emptyGender): taEMPTY(142): dà, cikin, mà, gà, mài, tun, dàc, dàn, na, bàːkin
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[compound]–> NOUN (23; 68%),
NOUN –[amod]–> ADJ (14; 82%),
VERB –[nmod]–> NOUN (13; 62%),
NOUN –[acl:relcl]–> NOUN (6; 67%),
NOUN –[conj]–> NOUN (6; 75%),
VERB –[det]–> DET (4; 80%),
AUX –[conj]–> AUX (2; 100%),
NOUN –[conj]–> VERB (2; 67%),
PRON –[conj]–> NOUN (2; 67%),
VERB –[advcl:cleft]–> VERB (2; 67%).