Treebank Statistics: UD_Beja-Autogramm: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
4528 tokens (38%) have a non-empty value of Gender.
935 types (49%) occur at least once with a non-empty value of Gender.
1 lemmas (0) occur at least once with a non-empty value of Gender.
The feature is used with 10 part-of-speech tags: DET (1732; 14% instances), NOUN (1395; 12% instances), VERB (1063; 9% instances), SCONJ (160; 1% instances), PRON (109; 1% instances), AUX (59; 0% instances), INTJ (5; 0% instances), ADJ (3; 0% instances), NUM (1; 0% instances), PART (1; 0% instances).
DET
1732 DET tokens (100% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Deixis=EMPTY (1453; 84%), PronType=EMPTY (1449; 84%), Definite=Def (1007; 58%), Case=EMPTY (999; 58%), Number=EMPTY (929; 54%).
DET tokens may have the following values of Gender:
Fem(696; 40% of non-emptyGender): =t, ti=, t=, toː=, tuː=, oːt, eːt, toːt, teː=, tuːtMasc(1036; 60% of non-emptyGender): i=, oː=, w=, uː=, oːn, uːn, j=, =b, eːn, aː=EMPTY(4): -a, -aː, =eː, mhasi
NOUN
1395 NOUN tokens (81% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=EMPTY (1163; 83%).
NOUN tokens may have the following values of Gender:
Fem(441; 32% of non-emptyGender): naː, lhaweː, na, takat, giɖʔa, sala, mʔari, ʃiha, ʔabaː, ʔabaMasc(954; 68% of non-emptyGender): tak, doːr, mhiːn, jhaːm, mijʔat, dhaj, kʷiːkʷʔaːj, gaw, haˈwaːd, ʔarEMPTY(324): kaːm, ʔar, ʔoːr, meːk, tʔiit, =na, =naː, na, kam, ʔaraːw
VERB
1063 VERB tokens (44% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Person=EMPTY (1037; 98%), Number=Sing (939; 88%), VerbClass=1 (676; 64%).
VERB tokens may have the following values of Gender:
Fem(247; 23% of non-emptyGender): tindi, tini, tidi, tiːd, tiːfi, akai, ʔeːta, tikati, ʔeːti, ʔabkinMasc(816; 77% of non-emptyGender): indi, ini, jʔi, iːfi, iːbri, ikati, jʔiːni, id, isni, iːktiEMPTY(1347): eːn, jʔeːn, jʔeːtiːt, akeːna, diːtiːt, ani, diːt, rhan, akajeː, difeː
SCONJ
160 SCONJ tokens (27% of all SCONJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which SCONJ and Gender co-occurred: PronType=Rel (123; 77%).
SCONJ tokens may have the following values of Gender:
Fem(56; 35% of non-emptyGender): =eːt, =jeːt, =t, ti=, t=Masc(104; 65% of non-emptyGender): =eːb, =jeːb, =b, ji=, wi=EMPTY(434): =hoːb, =ajt, =eːk, =aj, =it, =jeːk, =eː, =i, =jeː, =ji
PRON
109 PRON tokens (13% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Poss=EMPTY (102; 94%), Number=Sing (65; 60%), Person=EMPTY (56; 51%).
PRON tokens may have the following values of Gender:
Fem(28; 26% of non-emptyGender): ti=, -t, t=, ambataː, =oːki, imbateː, ombatoːMasc(81; 74% of non-emptyGender): wi=, baruːk, umbaruːk, i=, w=, baroːk, baruː, baraː, barhi, barijoːkEMPTY(710): =heːb, =i, =oː, =eː, ani, =hoːk, kna, =oːk, =oːn, aneːb
AUX
59 AUX tokens (21% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Number=Sing (58; 98%), Person=EMPTY (51; 86%), VerbType=EMPTY (49; 83%).
AUX tokens may have the following values of Gender:
Fem(8; 14% of non-emptyGender): tiki, taki, tidʔi, tindi, tirib, tiːha, tkiMasc(51; 86% of non-emptyGender): =wa, iki, ihi, indi, hijaː, irib, iːkti, dʔijaːb, idi, iniEMPTY(225): =u, =i, =a, andi, akajeː, aki, nijad, dannʔi, ijajna, adi
INTJ
5 INTJ tokens (8% of all INTJ tokens) have a non-empty value of Gender.
INTJ tokens may have the following values of Gender:
Masc(5; 100% of non-emptyGender): jhaːEMPTY(61): iraːnaj, əəə, ahaː, mmm, iraːni, jaːbi, hawawawawa, jhaː, mmmmm, nʔalla
ADJ
3 ADJ tokens (2% of all ADJ tokens) have a non-empty value of Gender.
ADJ tokens may have the following values of Gender:
Fem(1; 33% of non-emptyGender): kʷaɖaːɖatMasc(2; 67% of non-emptyGender): nifri, sasuːbajaːbEMPTY(146): daːji, kass, malia, sagi, daːwri, koː, dabal, daːjiː, raw, hadal
NUM
1 NUM tokens (2% of all NUM tokens) have a non-empty value of Gender.
NUM tokens may have the following values of Gender:
Fem(1; 100% of non-emptyGender): gaːtEMPTY(58): mhaj, gaːl, mhall, alif, asarama, gali, mhali, gaːt, awwal, faɖig
PART
1 PART tokens (0% of all PART tokens) have a non-empty value of Gender.
The most frequent other feature values with which PART and Gender co-occurred: Aspect=EMPTY (1; 100%), Mood=EMPTY (1; 100%), Polarity=EMPTY (1; 100%).
PART tokens may have the following values of Gender:
Masc(1; 100% of non-emptyGender): ʔaʃajEMPTY(320): bak, ka=, han, ontʔa, ki=, baː=, bi=, jaː, bass, tʔa
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (1201; 82%),
NOUN –[acl:relcl]–> SCONJ (110; 74%),
VERB –[dislocated:subj]–> NOUN (15; 60%),
AUX –[compound:svc]–> VERB (7; 100%),
SCONJ –[fixed]–> DET (6; 100%),
NOUN –[discourse]–> DET (3; 60%),
VERB –[dep:redup]–> VERB (3; 100%),
VERB –[dep]–> PRON (3; 60%),
VERB –[dislocated:subj]–> PRON (3; 75%),
PART –[det]–> DET (2; 67%).