Treebank Statistics: UD_Pashto-Sikaram: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
1740 tokens (43%) have a non-empty value of Gender.
821 types (76%) occur at least once with a non-empty value of Gender.
653 lemmas (78%) occur at least once with a non-empty value of Gender.
The feature is used with 8 part-of-speech tags: NOUN (870; 21% instances), ADJ (434; 11% instances), PROPN (147; 4% instances), AUX (86; 2% instances), VERB (78; 2% instances), DET (71; 2% instances), NUM (41; 1% instances), PRON (13; 0% instances).
NOUN
870 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (614; 71%), Case=Nom (477; 55%).
NOUN tokens may have the following values of Gender:
Fem(380; 44% of non-emptyGender): ژبه, ژبې, ژباړې, ژباړه, ژبو, توګه, خوا, مانا, برخه, خبرېMasc(490; 56% of non-emptyGender): کار, خلکو, کتابونه, خلک, ډول, کتابونو, کسان, وخت, ژوند, ارزښت
Gender seems to be lexical feature of NOUN. 99% lemmas (382) occur only with one value of Gender.
ADJ
434 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Case=Nom (301; 69%), Number=Sing (270; 62%).
ADJ tokens may have the following values of Gender:
Fem(172; 40% of non-emptyGender): سمه, نورو, ناسمه, ښه, اصلي, زياتې, بله, نورې, زياته, سترهMasc(262; 60% of non-emptyGender): زيات, نورو, زده, ټولنیز, اړوند, جوړ, نور, ټولنیزو, پوهنیزو, اسلاميEMPTY(1): خپور
| Paradigm نور | Masc | Fem |
|---|---|---|
| Case=Acc|Number=Plur | نورو | نورو |
| Case=Loc|Number=Plur | نورو | نورو |
| Case=Nom|Number=Sing | نور | |
| Case=Nom|Number=Plur | نور | نورې |
PROPN
147 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (131; 89%).
PROPN tokens may have the following values of Gender:
Fem(79; 54% of non-emptyGender): پښتو, اردو, پاړسي, عربي, مریم, امريکا, انګرېزۍ, انګرېزي, ايینې, براونMasc(68; 46% of non-emptyGender): پښتانه, احمد, وحید, پښتنو, پیتر, پنج, کتاب, افغان, ايران, ايرانیان
Gender seems to be lexical feature of PROPN. 100% lemmas (53) occur only with one value of Gender.
AUX
86 AUX tokens (40% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Aspect=EMPTY (84; 98%), Number=Sing (76; 88%), Mood=Ind (72; 84%), Person=3 (72; 84%), VerbForm=Fin (72; 84%), Tense=Pres (60; 70%).
AUX tokens may have the following values of Gender:
Fem(46; 53% of non-emptyGender): ده, وه, دي, شوېMasc(40; 47% of non-emptyGender): دی, شوی, شوي, دﺉ, و, وو, شو, کېدلEMPTY(130): شي, به, وي, کېږي, دي, ونه, شو, شوای, شې, وای
| Paradigm ول | Masc | Fem |
|---|---|---|
| Number=Sing|Tense=Past | و | وه |
| Number=Sing|Tense=Pres | دی, دﺉ | ده |
| Number=Plur|Tense=Past | وو | |
| Number=Plur|Tense=Pres | دي |
VERB
78 VERB tokens (24% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Tense=Past (77; 99%), Number=Sing (55; 71%), Mood=Ind (42; 54%), Person=3 (42; 54%), VerbForm=Fin (42; 54%), Case=EMPTY (40; 51%).
VERB tokens may have the following values of Gender:
Fem(23; 29% of non-emptyGender): کړه, کړې, شوه, شوې, درلوده, رسولې, شویو, لوېدلې, ورکړې, وهلهMasc(55; 71% of non-emptyGender): کړی, شو, شوي, شوی, وواهه, کړل, کړي, کښل, تړلي, راوغزولEMPTY(249): کوي, لري, کړي, شي, کېږي, ژباړل, کولای, ورکوي, شته, وايي
| Paradigm کول | Masc | Fem |
|---|---|---|
| Aspect=Imp|Mood=Ind|Number=Sing|Person=3|VerbForm=Fin | کاوه | |
| Aspect=Perf|Mood=Ind|Number=Sing|Person=3|Variant=Long|VerbForm=Fin | وکړ | |
| Aspect=Perf|Mood=Ind|Number=Sing|Person=3|Variant=Short|VerbForm=Fin | کړه, وکړه | |
| Aspect=Perf|Mood=Ind|Number=Sing|Person=3|VerbForm=Fin | کړ | |
| Aspect=Perf|Mood=Ind|Number=Plur|Person=3|VerbForm=Fin | کړل | |
| Case=Nom|Number=Sing|VerbForm=Part | کړی | کړې |
| Case=Nom|Number=Plur|VerbForm=Part | کړي |
DET
71 DET tokens (45% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Variant=EMPTY (63; 89%), Deixis=EMPTY (54; 76%), Number=Sing (49; 69%), Poss=EMPTY (49; 69%), Reflex=EMPTY (49; 69%).
DET tokens may have the following values of Gender:
Fem(37; 52% of non-emptyGender): دې, خپله, هرې, هره, ټوله, ټولې, کومه, کومې, هغه, همدېMasc(34; 48% of non-emptyGender): خپل, هر, کوم, ټول, خپلو, دغه, دې, هغه, هماغهEMPTY(87): دغه, هغه, داسې, ځینې, څو, دې, همدغه, دغسې, هماغه, ځینو
| Paradigm خپل | Masc | Fem |
|---|---|---|
| Case=Acc|Number=Sing | خپل | |
| Case=Acc|Number=Plur | خپلو | |
| Case=Loc|Number=Sing | خپل | خپله |
| Case=Nom|Number=Sing | خپل | خپله |
| Case=Nom|Number=Plur | خپل |
NUM
41 NUM tokens (89% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (41; 100%), Case=Nom (28; 68%).
NUM tokens may have the following values of Gender:
Fem(21; 51% of non-emptyGender): یوه, یوې, دوېMasc(20; 49% of non-emptyGender): یو, یوه, دوهEMPTY(5): دوو, 1, 30, 40
| Paradigm یو | Masc | Fem |
|---|---|---|
| Case=Abl | یوې | |
| Case=Acc | یوه | یوې |
| Case=Loc | یوه | |
| Case=Nom | یو | یوه |
PRON
13 PRON tokens (7% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Poss=EMPTY (13; 100%), Variant=EMPTY (13; 100%), Number=Sing (12; 92%), Person=EMPTY (7; 54%).
PRON tokens may have the following values of Gender:
Fem(8; 62% of non-emptyGender): دې, هغې, داMasc(5; 38% of non-emptyGender): هغۀ, ټولو, څوکEMPTY(183): يې, چې, ور, دا, دوی, دې, هغوی, یې, ځان, زموږ
| Paradigm هغه | Masc | Fem |
|---|---|---|
| Case=Acc | هغۀ | هغې |
| Case=Loc | هغې |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[amod]–> ADJ (253; 100%),
NOUN –[conj]–> NOUN (51; 62%),
NOUN –[nummod]–> NUM (38; 93%),
NOUN –[nmod]–> PROPN (26; 60%),
ADJ –[conj]–> ADJ (25; 96%),
ADJ –[nsubj]–> NOUN (22; 88%),
ADJ –[cop]–> AUX (20; 63%),
VERB –[aux:perf]–> AUX (12; 60%),
NOUN –[compound]–> NOUN (10; 100%),
ADJ –[appos]–> ADJ (7; 100%).