Treebank Statistics: UD_Hausa-NorthernAutogramm: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
1098 tokens (28%) have a non-empty value of Gender
.
313 types (40%) occur at least once with a non-empty value of Gender
.
207 lemmas (39%) occur at least once with a non-empty value of Gender
.
The feature is used with 8 part-of-speech tags: AUX (394; 10% instances), NOUN (374; 10% instances), PRON (169; 4% instances), VERB (120; 3% instances), ADJ (19; 0% instances), DET (15; 0% instances), PROPN (5; 0% instances), PART (2; 0% instances).
AUX
394 AUX tokens (62% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Number=EMPTY (394; 100%), Person=3 (309; 78%), Aspect=PerfBkg (198; 50%).
AUX
tokens may have the following values of Gender
:
Fem
(112; 28% of non-emptyGender
): tac, tanàː, tà, taː, tas, tay, tab, tak, takè, tatMasc
(282; 72% of non-emptyGender
): yac, yaː, kaː, shinàː, shì, yaz, kà, yat, yah, yayEMPTY
(241): ankà, sunkà, à, ìn, naː, akà, sunàː, anàː, sun, inàː
NOUN
374 NOUN tokens (78% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=EMPTY (373; 100%), Definite=EMPTY (220; 59%).
NOUN
tokens may have the following values of Gender
:
Fem
(84; 22% of non-emptyGender
): dàːmisàː, kuːraː, gàyyaː, duːniyàː, yâː, raːnaː, kwaːnaː, uwaː, rân, uwatMasc
(290; 78% of non-emptyGender
): kàreː, gidaː, sâː, yaːɗaː, zàkaràː, ɓiki, mùtun, àbin, mùzuːruː, bàːkinEMPTY
(103): ruwaː, ƴan, mutàːneː, ayaː, baːyuː, cinàn, giːwàːyeː, ruwan, zàːrùmmai, kuɗɗiː
Paradigm ɗaː | Masc | Fem |
---|---|---|
_ | ɗaː | |
Definite=Cons | ɗan | ƴa' |
Definite=Def | ɗan |
Gender
seems to be lexical feature of NOUN
. 99% lemmas (117) occur only with one value of Gender
.
PRON
169 PRON tokens (67% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Number=EMPTY (168; 99%), PronType=Prs (148; 88%), Person=3 (113; 67%), Case=EMPTY (106; 63%).
PRON
tokens may have the following values of Gender
:
Fem
(52; 31% of non-emptyGender
): ita, tà, =tà, =ta, wàccan, matà, =kì, wàgga, keː, mawMasc
(117; 69% of non-emptyGender
): shiː, shi, shì, =nai, =tai, mai, =ka, =kà, kà, maːEMPTY
(82): =sù, niː, suː, musù, sù, koːmiː, min, =na, ni, =taː
Paradigm wani | Masc | Fem |
---|---|---|
Definite=Spec | wani | |
wata |
Gender
seems to be lexical feature of PRON
. 97% lemmas (35) occur only with one value of Gender
.
VERB
120 VERB tokens (18% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: VerbForm=Vnoun (118; 98%), ExtPos=NOUN (116; 97%).
VERB
tokens may have the following values of Gender
:
Fem
(63; 53% of non-emptyGender
): tàhiyàː, zakkùwaː, gàmuwaː, gàyyaː., hwaːɗùwaː, bìɗaː, cêːwaː, daɗèːwaː, kankaryaː, kaːwoːwàːMasc
(57; 48% of non-emptyGender
): yîː, sôn, cîn, gudùː, kwaːnaː, taːshìː, hwaɗìː, yîn, zaman, zamaːEMPTY
(550): cèː, yi, zakà, cêː, zoː, ga, tàhi, taɓà, ji, tohoː
Paradigm ci | Masc | Fem |
---|---|---|
Definite=Cons | cîn | |
cîː | cânyeːwàː |
Gender
seems to be lexical feature of VERB
. 97% lemmas (36) occur only with one value of Gender
.
ADJ
19 ADJ tokens (73% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Number=EMPTY (19; 100%), Definite=Cons (12; 63%).
ADJ
tokens may have the following values of Gender
:
Fem
(3; 16% of non-emptyGender
): wacèː, ƴag, ƴakMasc
(16; 84% of non-emptyGender
): ɗan, baƙiː, hwarin, hwariː, namijì, ƙàramiː, baƙin, hìyayyem, janEMPTY
(7): jaː, maːtaː, jan
Paradigm ɗaː | Masc | Fem |
---|---|---|
ɗan | ƴag, ƴak |
DET
15 DET tokens (19% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Deixis=EMPTY (13; 87%), Definite=EMPTY (12; 80%), PronType=Ind (9; 60%).
DET
tokens may have the following values of Gender
:
Fem
(9; 60% of non-emptyGender
): wata, wàccân, waccèMasc
(6; 40% of non-emptyGender
): wani, wanèːEMPTY
(65): nan, ga, wasu, du’, dug, su, dus, dut, can, dub
Paradigm wani | Masc | Fem |
---|---|---|
Definite=Spec | wani | |
wata |
PROPN
5 PROPN tokens (5% of all PROPN
tokens) have a non-empty value of Gender
.
PROPN
tokens may have the following values of Gender
:
Masc
(5; 100% of non-emptyGender
): Buːzuː, Bàhaushèː, BàgawailèːEMPTY
(88): Tudùː, Galmaːwaː, Muːsà, Ìlleːlàː, Allàː, Bàːgai, Dòːdoː, Gidan, Ƙurƙìyaː, Tuːraːwaː
PART
2 PART tokens (1% of all PART
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PART
and Gender
co-occurred: Aspect=EMPTY (2; 100%), Polarity=EMPTY (2; 100%).
PART
tokens may have the following values of Gender
:
Fem
(2; 100% of non-emptyGender
): taEMPTY
(171): nàː, ta, ba, baːbù, gàː, dai, àkwai, bâː, kòː, bàː
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[compound]–> NOUN (20; 65%),
NOUN –[amod]–> ADJ (14; 82%),
VERB –[conj]–> VERB (6; 60%),
NOUN –[acl:relcl]–> NOUN (4; 100%),
NOUN –[conj]–> NOUN (4; 67%),
NOUN –[nmod]–> VERB (4; 100%),
VERB –[det]–> DET (4; 80%),
AUX –[nsubj]–> NOUN (2; 100%),
NOUN –[conj]–> VERB (2; 67%),
PRON –[conj]–> NOUN (2; 67%).