Treebank Statistics: UD_Ottoman_Turkish-DUDU: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
1794 tokens (8%) have a non-empty value of Gender.
1101 types (13%) occur at least once with a non-empty value of Gender.
707 lemmas (16%) occur at least once with a non-empty value of Gender.
The feature is used with 4 part-of-speech tags: NOUN (1175; 5% instances), PROPN (569; 3% instances), ADJ (47; 0% instances), PRON (3; 0% instances).
NOUN
1175 NOUN tokens (14% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Person=3 (1173; 100%), Number[psor]=EMPTY (899; 77%), Person[psor]=EMPTY (899; 77%), Number=Sing (861; 73%), Case=Nom (766; 65%).
NOUN tokens may have the following values of Gender:
Fem(1162; 99% of non-emptyGender): riʿāyet, eyāleti, ʿavret, sene, es̱nāda, küffār-ı, müddet-i, senesinde, vilāyetine, şehādetMasc(13; 1% of non-emptyGender): beyt-i, baḥr-i, bevvāb, ism-i, kitābda, kitāblar, kitābları, mücahidīn, müşrikīnüñ, ʿilm-iEMPTY(7128): var, gün, paşa, üzerine, bin, efendi, melik, gice, oġlı, beglerbegisi
| Paradigm ʿilm | Masc | Fem |
|---|---|---|
| Case=Loc|Number=Plur | ʿulūmda | |
| Case=Nom|Number=Sing | ʿilm-i | |
| Case=Nom|Number=Plur | ʿulūm-i |
Gender seems to be lexical feature of NOUN. 100% lemmas (476) occur only with one value of Gender.
PROPN
569 PROPN tokens (35% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Person=3 (569; 100%), Number=Sing (567; 100%), Case=Nom (411; 72%), NameType=Prs (392; 69%).
PROPN tokens may have the following values of Gender:
Fem(198; 35% of non-emptyGender): mıṣr, burusada, hācer, yemen, yehūd, şām, iskenderiyye, şāmda, ḥaleb, amāsiyyeMasc(371; 65% of non-emptyGender): ibrāhīm, nemrūd, aḥmed, züheyr, muṣṭafā, meḥemmed, ʿalī, muḥammed, kenʿān, maḥmūdEMPTY(1054): ʿanter, iskender, āẕer, allāh, ʿanterüñ, mālik, rūmili, şī, baġdād, islām
Gender seems to be lexical feature of PROPN. 100% lemmas (200) occur only with one value of Gender.
ADJ
47 ADJ tokens (4% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Person=EMPTY (40; 85%), Number=EMPTY (39; 83%), Case=EMPTY (27; 57%).
ADJ tokens may have the following values of Gender:
Fem(46; 98% of non-emptyGender): ʿaliyye, şerīfe, cemīle, mezbūre, aẓīme, bedīʿā, belīġa, celiyye, kes̱īre, kübrāMasc(1; 2% of non-emptyGender): şerīfiEMPTY(1256): niçe, maʿzūl, çoḳ, mezbūr, vāḳiʿ, dürlü, ẓāhir, mezbūruñ, muḳarrer, büyük
Gender seems to be lexical feature of ADJ. 100% lemmas (36) occur only with one value of Gender.
PRON
3 PRON tokens (0% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Person=3 (3; 100%), PronType=Ind (3; 100%), Case=Nom (2; 67%), Number=Sing (2; 67%), Number[psor]=Sing (2; 67%), Person[psor]=3 (2; 67%).
PRON tokens may have the following values of Gender:
Fem(1; 33% of non-emptyGender): cümle-iMasc(2; 67% of non-emptyGender): mecmūʿsını, mecmūʿısıEMPTY(1003): andan, anuñ, bunuñ, ben, kendü, anda, benüm, bu, ol, anı
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
PROPN –[conj]–> PROPN (35; 66%),
NOUN –[nmod:poss]–> ADJ (6; 55%),
PROPN –[nmod:poss]–> PROPN (2; 67%).