Treebank Statistics: UD_Hausa-EasternAutogramm: Features: Definite
This feature is universal but the values Cons, Spec are language-specific.
It occurs with 4 different values: Cons, Def, Ind, Spec.
1795 tokens (19%) have a non-empty value of Definite.
672 types (36%) occur at least once with a non-empty value of Definite.
541 lemmas (40%) occur at least once with a non-empty value of Definite.
The feature is used with 8 part-of-speech tags: NOUN (1288; 14% instances), VERB (206; 2% instances), DET (156; 2% instances), ADJ (81; 1% instances), PROPN (35; 0% instances), PRON (16; 0% instances), NUM (10; 0% instances), PART (3; 0% instances).
NOUN
1288 NOUN tokens (58% of all NOUN tokens) have a non-empty value of Definite.
The most frequent other feature values with which NOUN and Definite co-occurred: Number=EMPTY (1014; 79%), Gender=Masc (704; 55%).
NOUN tokens may have the following values of Definite:
Cons(909; 71% of non-emptyDefinite): ‘yan, irìn, ƙasar̃, shèːkaràr̃, bir̃nin, ƙasàːshen, ƙungìyar̃, gidan, yawàn, gwamnatìnDef(373; 29% of non-emptyDefinite): ƙasâr̃, loːkàcîn, àbîn, cùːtâr̃, mutàːnên, hùkuːmàr̃, mùtumìn, gwamnatìn, tafkìn, zàngà-zangàr̃Ind(6; 0% of non-emptyDefinite): hàr̃aːbàr̃, jàr̃iːdàː, laːfiyàː, mar̃tàniː, tsoːhuwaːEMPTY(927): mutàːneː, mùsùlmiː, duːniyàː, àmfàːniː, ƙwaːyoːyiː, aikìː, bàːkiː, daːmaː, mùtûm, àbù
| Paradigm ƙasaː | Def | Cons |
|---|---|---|
| _ | ƙasâr̃ | |
| Gender=Fem | ƙasâr̃, ƙasâr | ƙasar̃, ƙasar̃sà, ƙasar̃tà |
| Number=Plur | ƙasàːshen |
VERB
206 VERB tokens (16% of all VERB tokens) have a non-empty value of Definite.
The most frequent other feature values with which VERB and Definite co-occurred: ExtPos=NOUN (191; 93%), VerbForm=Vnoun (191; 93%), Gender=Masc (164; 80%).
VERB tokens may have the following values of Definite:
Cons(195; 95% of non-emptyDefinite): yîn, rashìn, ganin, jîn, neːman, saːmùn, sôn, goːyon, shirìn, cînDef(11; 5% of non-emptyDefinite): jiràn, kiràn, tsiːrân, ƙasâr̃, dafàn, hadìyêːwâr̃, jinîn, maːmàyêːwâr̃EMPTY(1077): yi, cêː, kai, cêːwaː, sâː, nuːnà, iyà, cèː, fi, ci
| Paradigm kiraː | Def | Cons |
|---|---|---|
| kiràn | kiràn |
Definite seems to be lexical feature of VERB. 96% lemmas (69) occur only with one value of Definite.
DET
156 DET tokens (55% of all DET tokens) have a non-empty value of Definite.
The most frequent other feature values with which DET and Definite co-occurred: Deixis=EMPTY (156; 100%), PronType=Ind (128; 82%), Number=EMPTY (116; 74%).
DET tokens may have the following values of Definite:
Cons(1; 1% of non-emptyDefinite): dukkànDef(22; 14% of non-emptyDefinite): ɗîn, yankìnSpec(133; 85% of non-emptyDefinite): wani, wasu, wataEMPTY(126): wannàn, nan, duk, waɗànnân, dukkàn, nàn, koːwànè, koːwàcè, koːwàɗànnè, nân
ADJ
81 ADJ tokens (76% of all ADJ tokens) have a non-empty value of Definite.
The most frequent other feature values with which ADJ and Definite co-occurred: Number=EMPTY (51; 63%).
ADJ tokens may have the following values of Definite:
Cons(81; 100% of non-emptyDefinite): bàbban, miyàːgun, yawàn, bàbbar̃, mânyan, ‘yan, mùmmuːnan, cìkakken, hàɗaɗɗiyar̃, ìsasshenEMPTY(26): dàbam, irìː-irìː, maːtaː, ƙanaːnàː, Kir̃istàː, ‘yar̃, dàfaffen, mânyaː, namijì, tsoːhon
Definite seems to be lexical feature of ADJ. 100% lemmas (29) occur only with one value of Definite.
PROPN
35 PROPN tokens (7% of all PROPN tokens) have a non-empty value of Definite.
PROPN tokens may have the following values of Definite:
Cons(24; 69% of non-emptyDefinite): Nìːjâr̃, BBC, Aːyoːyin, Dòːkâr̃, Gwamnàn, Ministàːn, Mr, Ogunbufunmi, Reinhard, SudânDef(11; 31% of non-emptyDefinite): r̃àhoːtòn, Baucîn, Kyanadàn, Landàn, Nàːjeːr̃iyàr̃, Yàr̃iːmàn, jàːmì’ân, Ƙasâr̃EMPTY(456): Nàːjeːr̃iyàː, Nàjeːr̃iyàː, Ìkko, Afir̃kà, Fela, Dikkò, Chief, Bìr̃taːniyà, Lasisi, AIDS
Definite seems to be lexical feature of PROPN. 100% lemmas (26) occur only with one value of Definite.
PRON
16 PRON tokens (5% of all PRON tokens) have a non-empty value of Definite.
The most frequent other feature values with which PRON and Definite co-occurred: Case=EMPTY (14; 88%), Person=EMPTY (14; 88%), PronType=Ind (11; 69%), Number=EMPTY (10; 63%), Gender=Masc (9; 56%).
PRON tokens may have the following values of Definite:
Cons(3; 19% of non-emptyDefinite): dukkànsù, dukànsù, wasunsùDef(2; 13% of non-emptyDefinite): shînSpec(11; 69% of non-emptyDefinite): wani, wasuEMPTY(319): shiː, suː, waɗàndà, ita, wandà, indà, shi, shì, masà, waddà
NUM
10 NUM tokens (4% of all NUM tokens) have a non-empty value of Definite.
The most frequent other feature values with which NUM and Definite co-occurred: Gender=EMPTY (7; 70%).
NUM tokens may have the following values of Definite:
Cons(5; 50% of non-emptyDefinite): ɗayansà, ɗayansù, sìttin, tàmàːnin, ɗayantàDef(5; 50% of non-emptyDefinite): ukùn, biyûn, huɗûn, shidànEMPTY(261): ɗàriː, tar̃à, ɗaya, alìf, bìyar̃, biyu, ukù, tàmàːnin, shidà, goːmà
PART
3 PART tokens (1% of all PART tokens) have a non-empty value of Definite.
The most frequent other feature values with which PART and Definite co-occurred: Case=Gen (3; 100%), Number=EMPTY (3; 100%), PartType=Case (3; 100%), Polarity=EMPTY (3; 100%), Gender=Masc (2; 67%).
PART tokens may have the following values of Definite:
Cons(3; 100% of non-emptyDefinite): naEMPTY(568): kuma, na, ta, ba, mài, dai, neː, nèː, maː, màːsu
Relations with Agreement in Definite
The 10 most frequent relations where parent and child node agree in Definite:
VERB –[nmod]–> VERB (4; 57%),
NOUN –[compound]–> ADJ (2; 100%),
VERB –[compound]–> VERB (2; 100%),
VERB –[fixed]–> NOUN (2; 100%),
NOUN –[dislocated]–> VERB (1; 100%),
NOUN –[obj]–> ADJ (1; 100%),
NOUN –[obj]–> VERB (1; 100%),
PART –[obl:mod]–> ADJ (1; 100%),
VERB –[vocative]–> NOUN (1; 100%).