Treebank Statistics: UD_Kadiweu-Unicamp: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
Some words have combined values of the feature; 1 combinations have been observed: Fem|Masc.
This is a layered feature with the following layers: Gender, Gender[obj].
148 tokens (47%) have a non-empty value of Gender.
66 types (70%) occur at least once with a non-empty value of Gender.
49 lemmas (65%) occur at least once with a non-empty value of Gender.
The feature is used with 5 part-of-speech tags: NOUN (102; 32% instances), DET (31; 10% instances), PRON (11; 3% instances), VERB (3; 1% instances), PROPN (1; 0% instances).
NOUN
102 NOUN tokens (92% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (95; 93%), Number[psor]=EMPTY (91; 89%), Degree=EMPTY (88; 86%).
NOUN tokens may have the following values of Gender:
Fem(26; 25% of non-emptyGender): niwatece, wetiGa, Etogo, Iwalo, libiniena, liwatece, lomigo, nigotaGa, GanigotGa, IwalepodiFem,Masc(3; 3% of non-emptyGender): lodawa, GadodawaMasc(73; 72% of non-emptyGender): iGeladi, libinienigi, liGeladi, looligi, naigi, weiigi, niganaGacanajo, nioladi, LotaGa, eyodiEMPTY(9): eyodi, libinienaGa, napioi, dineigi, iGonagi, lidi
| Paradigm binie | Masc | Fem |
|---|---|---|
| Number=Sing | libinienigi | libiniena |
| Number=Plur | libinienigipi |
Gender seems to be lexical feature of NOUN. 97% lemmas (34) occur only with one value of Gender.
DET
31 DET tokens (94% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (31; 100%), PronType=Dem (31; 100%).
DET tokens may have the following values of Gender:
Fem(12; 39% of non-emptyGender): ajo, NaGani, adi, NaGajo, naGana, naGaniMasc(19; 61% of non-emptyGender): ica, NiGida, ijo, NiGijoEMPTY(2): NiGinoa, eliodi
| Paradigm ijo | Masc | Fem |
|---|---|---|
| ijo | ajo |
PRON
11 PRON tokens (61% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (10; 91%), PronType=Dem (10; 91%).
PRON tokens may have the following values of Gender:
Fem(10; 91% of non-emptyGender): naGada, naGadi, Ada, Adi, Ani, NaGajo, NaGana, naGajoMasc(1; 9% of non-emptyGender): eeEMPTY(7): ane
VERB
3 VERB tokens (6% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Gender[obj]=EMPTY (3; 100%), Mood=EMPTY (3; 100%), Person=3 (3; 100%), Person[erg]=EMPTY (3; 100%), Person[obj]=EMPTY (3; 100%), VerbForm=EMPTY (3; 100%), Voice=EMPTY (3; 100%).
VERB tokens may have the following values of Gender:
Fem(3; 100% of non-emptyGender): etadiEMPTY(45): iwaGadi, idei, ipegitegi, ninitibeci, dapiko, ipegetege, DapicoGo, Ninitibigiwaji, Te, dowediteloco
PROPN
1 PROPN tokens (25% of all PROPN tokens) have a non-empty value of Gender.
PROPN tokens may have the following values of Gender:
Masc(1; 100% of non-emptyGender): JoãoEMPTY(3): Maria, Pedilo
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (30; 94%),
NOUN –[nsubj]–> NOUN (12; 75%),
NOUN –[nmod:poss]–> NOUN (6; 60%).