Treebank Statistics: UD_Sindhi-Isra: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
This is a layered feature with the following layers: Gender, Gender[obj], Gender[subj].
45878 tokens (48%) have a non-empty value of Gender.
7625 types (78%) occur at least once with a non-empty value of Gender.
3882 lemmas (76%) occur at least once with a non-empty value of Gender.
The feature is used with 9 part-of-speech tags: NOUN (25335; 27% instances), ADP (5408; 6% instances), VERB (4488; 5% instances), PROPN (4279; 4% instances), ADJ (2469; 3% instances), AUX (1677; 2% instances), DET (1243; 1% instances), PRON (927; 1% instances), ADV (52; 0% instances).
NOUN
25335 NOUN tokens (97% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (21058; 83%), Case=Nom (16010; 63%).
NOUN tokens may have the following values of Gender:
Fem(9195; 36% of non-emptyGender): ڳالهه, حڪومت, پوليس, زال, دنيا, طرح, طور, خبر, دل, زمينMasc(16140; 64% of non-emptyGender): شينهن, ملڪ, گدڙ, وقت, ڏينهن, ماڻهو, پاڻي, وزير, بادشاهه, ماڻهنEMPTY(745): حاصل, شامل, متاثرن, عوام, پري, منهن, حوالي, طرف, ٻوڏ, ڪيس
ADP
5408 ADP tokens (38% of all ADP tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADP and Gender co-occurred: Number=Sing (4839; 89%), Case=Nom (2767; 51%).
ADP tokens may have the following values of Gender:
Fem(1953; 36% of non-emptyGender): جي, جون, واري, کي, جهڙي, وارين, واريون, بعد, سان, کانMasc(3455; 64% of non-emptyGender): جي, جو, جا, واري, وارو, وارا, وارن, جهڙو, جون, سانEMPTY(8946): ۾, کي, تي, سان, کان, لاءِ, مان, تائين, پوءِ, وٽ
| Paradigm جي | Masc | Fem |
|---|---|---|
| Case=Acc | جي | |
| Case=Acc|Number=Sing | جي | جي |
| Case=Acc|Number=Plur | جي | جي |
| Case=Nom|Number=Sing | جي | جي |
| Case=Nom|Number=Plur | جي | |
| Number=Sing | جي |
VERB
4488 VERB tokens (34% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: VerbForm=EMPTY (4481; 100%), Voice=EMPTY (3773; 84%), Number=Sing (3455; 77%), Aspect=Perf (3408; 76%), Person=EMPTY (2442; 54%).
VERB tokens may have the following values of Gender:
Fem(1097; 24% of non-emptyGender): ڪئي, ويئي, آئي, ٿيندي, لڳي, ڏني, پيئي, ويون, هوندي, وئيMasc(3391; 76% of non-emptyGender): ويو, ڪيو, ويا, ڏنو, چيو, آيو, لڳو, ڇڏيو, رهيو, رهياEMPTY(8583): ڪري, ڪرڻ, چيو, وڃي, اچي, کڻي, وٺي, پيو, ڏسي, پئي
| Paradigm ڪر | Masc | Fem |
|---|---|---|
| Aspect=Imp|Number=Sing|Person=3|Voice=Act | ڪري | |
| Aspect=Imp|Number=Plur | ڪرڻيون | |
| Aspect=Imp|Number=Plur|Person=1 | ڪنداسون | |
| Aspect=Imp|Number=Plur|Person=3 | ڪريو | |
| Aspect=Imp|Number=Plur|Voice=Act | ڪرڻيون | |
| Aspect=Imp|Tense=Pres|Voice=Act | ڪرائڻ | |
| Aspect=Perf|Number=Sing|Person=3 | ڪريو | |
| Aspect=Perf|Number=Sing|Voice=Act | ڪريو | |
| Aspect=Perf|Number=Plur | ڪريا |
PROPN
4279 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Case=Nom (3961; 93%), Number=EMPTY (3803; 89%).
PROPN tokens may have the following values of Gender:
Fem(624; 15% of non-emptyGender): سنڌ, پ, ڪابيرو, ڪمپا, بي, شام, ماليڪٽ, پارٽي, نومبر, بينظيرMasc(3655; 85% of non-emptyGender): پاڪستان, علي, ڪراچي, محمد, آمريڪا, احمد, اسلام, الله, نواز, آبادEMPTY(18): وڳڻ, B.T, اعتبار, اورڪزئي, بدين, خيرپور, سپريم, شيخ, مالھي, محمد
Gender seems to be lexical feature of PROPN. 91% lemmas (545) occur only with one value of Gender.
ADJ
2469 ADJ tokens (40% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (2027; 82%), Case=Nom (1927; 78%), Degree=Pos (1547; 63%).
ADJ tokens may have the following values of Gender:
Fem(750; 30% of non-emptyGender): ڏاڍي, سڄي, اهڙيءَ, اهڙي, وڏي, هڪڙي, پوري, سهڻي, گهڻي, بيگاريMasc(1719; 70% of non-emptyGender): هڪڙي, وڏي, هڪڙو, نئين, وڏو, ڏاڍو, گهڻو, سٺو, ٿورو, ڪيترائيEMPTY(3651): ٻئي, وڌيڪ, ڪجهه, ٻين, سڀ, مختلف, خوش, ختم, خودڪش, سڀني
| Paradigm هڪڙو | Masc | Fem |
|---|---|---|
| Case=Acc | هڪڙي | هڪڙي, هڪڙيءَ |
| Case=Nom | هڪڙو | هڪڙي |
AUX
1677 AUX tokens (34% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Number=Sing (1434; 86%), Person=3 (918; 55%).
AUX tokens may have the following values of Gender:
Fem(444; 26% of non-emptyGender): ٿي, هئي, آهي, هيون, نٿي, ٿيون, ڪانهي, هوندي, سگهندي, سگهنديونMasc(1233; 74% of non-emptyGender): هو, ٿو, ٿيو, هئا, ٿي, ٿا, نٿو, سگهيو, آهي, نٿاEMPTY(3241): آهي, آهن, ٿي, ٿا, ناهي, سگهي, هجي, آهيون, آھي, آهيان
| Paradigm آهي | Masc | Fem |
|---|---|---|
| _ | ٿي | |
| Aspect=Imp|Number=Sing | هوندو | |
| Aspect=Imp|Number=Sing|Person=3 | هوندو, هوندي | هوندي |
| Aspect=Imp|Number=Sing|Person=3|Tense=Fut | هوندو | هوندي |
| Aspect=Imp|Number=Sing|Tense=Fut | هوندي | |
| Aspect=Imp|Number=Plur | هوندا | |
| Aspect=Perf|Number=Sing|Person=3 | ٿيو | |
| Aspect=Perf|Number=Sing|Person=3|Tense=Pres | ٿيو | |
| Aspect=Perf|Number=Plur | ٿيا | هيون |
| Mood=Sub|Number=Sing|Tense=Past | ها | |
| Number=Sing | ٿي, ٿو | ٿي |
| Number=Sing|Person=1|Polarity=Neg | نٿي | |
| Number=Sing|Person=1|Tense=Past | هوس | |
| Number=Sing|Person=1|Tense=Pres | آهيان | آهيان, ٿي |
| Number=Sing|Person=2|Tense=Past | هئين | |
| Number=Sing|Person=3 | ٿيو | ٿي |
| Number=Sing|Person=3|Polarity=Neg|Tense=Pres | ناهي | |
| Number=Sing|Person=3|Tense=Fut | هوندو | هوندي |
| Number=Sing|Person=3|Tense=Past | هو | هئي |
| Number=Sing|Person=3|Tense=Pres | ٿو, آهي, ٿي, آ | آهي, ٿي, آ |
| Number=Sing|Polarity=Neg | نٿو, نٿي | نٿي |
| Number=Sing|Polarity=Neg|Tense=Pres | نٿو, نٿي | نٿي, ڪانهي |
| Number=Sing|Tense=Past | هئس | هئي |
| Number=Sing|Tense=Pres | ٿو, ٿي, آهيان, آهي, ٿيون | ٿي, آهي |
| Number=Plur | ٿا | |
| Number=Plur|Person=2|Tense=Past | هئا | |
| Number=Plur|Person=3 | ٿيو | |
| Number=Plur|Person=3|Tense=Fut | هوندا | |
| Number=Plur|Person=3|Tense=Past | هئا, ھئا | هيون |
| Number=Plur|Person=3|Tense=Pres | ٿا, آهن, آهيون, ٿيا | ٿيون |
| Number=Plur|Polarity=Neg|Tense=Pres | نٿا | |
| Number=Plur|Tense=Past | هيون | |
| Number=Plur|Tense=Pres | ٿا, ٿيون, ٿي, آهيون | ٿيون |
| Person=3|Tense=Pres | ٿي |
DET
1243 DET tokens (27% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: PronType=Dem (1243; 100%), Person=EMPTY (1225; 99%), Number=Sing (1192; 96%), Case=Nom (1173; 94%).
DET tokens may have the following values of Gender:
Fem(404; 33% of non-emptyGender): اها, ڪا, هوءَ, جيڪا, هيءَ, ڪهڙي, اهڙي, اهڙيءَ, اِها, جهڙيMasc(839; 67% of non-emptyGender): اهو, ڪو, جيڪو, هو, هُو, اهڙو, ڪي, هي, ڪنهن, اُهيEMPTY(3311): هن, ان, اسان, انهن, جنهن, ڪنهن, انهيءَ, هر, اهي, هو
| Paradigm هن | Masc | Fem |
|---|---|---|
| هن | هن |
PRON
927 PRON tokens (40% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: PronType=EMPTY (860; 93%), Case=Gen (813; 88%), Number=Sing (786; 85%), Person=1 (667; 72%).
PRON tokens may have the following values of Gender:
Fem(314; 34% of non-emptyGender): پنهنجي, منهنجي, تنهنجي, پنهنجيءَ, پنهنجون, مونکي, پنهنجيون, اسانجي, انجو, انجيMasc(613; 66% of non-emptyGender): پنهنجي, سندس, پنهنجو, سو, سندن, پنهنجا, پنهنجن, منهنجو, تنهنجو, منهنجيEMPTY(1414): مون, پاڻ, تون, ڇا, آءٌ, اوهان, جن, ائين, توکي, توهان
| Paradigm پنهنجي | Masc | Fem |
|---|---|---|
| Case=Acc|Person=1 | پنهنجي | پنهنجي |
| Case=Gen | پنهنجي | پنهنجي |
| Case=Gen|Person=1 | پنهنجي | پنهنجي |
ADV
52 ADV tokens (2% of all ADV tokens) have a non-empty value of Gender.
ADV tokens may have the following values of Gender:
Fem(29; 56% of non-emptyGender): تيئن, جيئن, آهستي, ايئن, هڪدم, ڪالهه, ڪڏهن, گھوري, اڄ, اڳتيMasc(23; 44% of non-emptyGender): اڄ, جڏهن, جهٽ, سڀاڻي, اصل, ايتري, ايتريتائين, اڄڪلھ, نافذ, هوڏانهنEMPTY(3100): جڏهن, وري, اتي, هاڻي, پوءِ, اڄ, جيئن, جاري, ڏانهن, تمام
| Paradigm جڏهن | Masc | Fem |
|---|---|---|
| Case=Acc | جڏهن | |
| Case=Nom | جڏهن | جڏهن |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[nmod]–> NOUN (3544; 57%),
NOUN –[conj]–> NOUN (770; 66%),
NOUN –[compound]–> PROPN (635; 94%),
NOUN –[nmod]–> PRON (623; 70%),
NOUN –[nmod]–> PROPN (616; 62%),
PROPN –[flat]–> PROPN (454; 94%),
PROPN –[nmod]–> NOUN (449; 82%),
PROPN –[compound]–> PROPN (282; 90%),
NOUN –[compound]–> NOUN (225; 72%),
NOUN –[nsubj]–> NOUN (222; 56%).