Treebank Statistics: UD_Pashto-Sikaram: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
2379 tokens (44%) have a non-empty value of Gender.
1071 types (78%) occur at least once with a non-empty value of Gender.
823 lemmas (79%) occur at least once with a non-empty value of Gender.
The feature is used with 8 part-of-speech tags: NOUN (1156; 21% instances), ADJ (578; 11% instances), PROPN (217; 4% instances), VERB (137; 3% instances), AUX (123; 2% instances), DET (88; 2% instances), NUM (52; 1% instances), PRON (28; 1% instances).
NOUN
1156 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (824; 71%), Case=Nom (602; 52%).
NOUN tokens may have the following values of Gender:
Fem(498; 43% of non-emptyGender): ژبه, ژبې, خوا, ژباړې, ژباړه, ژبو, توګه, خبرې, برخه, ستونزهMasc(658; 57% of non-emptyGender): کار, کتابونه, وخت, خلکو, ډول, خلک, کتابونو, کسان, ډګر, دود
Gender seems to be lexical feature of NOUN. 99% lemmas (481) occur only with one value of Gender.
ADJ
578 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Case=Nom (389; 67%), Number=Sing (366; 63%).
ADJ tokens may have the following values of Gender:
Fem(227; 39% of non-emptyGender): سمه, نورو, بله, ناسمه, ښه, اصلي, زياتې, زياته, ستره, نورېMasc(351; 61% of non-emptyGender): زده, زيات, نورو, ټولنیز, پوهنیزو, اړ, جوړ, نور, اړوند, لږEMPTY(8): ق, هـ, خپور, م
| Paradigm نور | Masc | Fem |
|---|---|---|
| Case=Acc|Number=Plur | نورو | نورو |
| Case=Loc|Number=Plur | نورو | نورو |
| Case=Nom|Number=Sing | نور | نوره |
| Case=Nom|Number=Plur | نور | نورې |
PROPN
217 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (192; 88%).
PROPN tokens may have the following values of Gender:
Fem(108; 50% of non-emptyGender): پښتو, اردو, پاړسي, عربي, دري, مریم, امريکا, انګرېزۍ, کوټه, انګرېزيMasc(109; 50% of non-emptyGender): پښتانه, افغانستان, پښتنو, کابل, احمد, بابا, وحید, پیتر, خوشال, پنج
Gender seems to be lexical feature of PROPN. 99% lemmas (67) occur only with one value of Gender.
VERB
137 VERB tokens (31% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Tense=Past (135; 99%), Number=Sing (99; 72%), Mood=Ind (79; 58%), Person=3 (79; 58%), VerbForm=Fin (79; 58%), Case=EMPTY (77; 56%).
VERB tokens may have the following values of Gender:
Fem(51; 37% of non-emptyGender): کړې, کړه, شوه, شوې, موندلې, وه, وکړه, تدريسوله, خوړلې, درلودهMasc(86; 63% of non-emptyGender): شو, شوي, شوی, کړی, وواهه, ويلي, کاوه, وکړ, کول, کړEMPTY(311): لري, کوي, کړي, کېږي, شي, کولای, شته, ورکوي, ژباړل, راځي
| Paradigm کول | Masc | Fem |
|---|---|---|
| Aspect=Imp|Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin | کاوه | |
| Aspect=Imp|Mood=Ind|Number=Plur|Person=3|Tense=Past|VerbForm=Fin | کول | |
| Aspect=Imp|Mood=Ind|Number=Plur|Person=3|Tense=Pres|Typo=Yes|VerbForm=Fin | کوو | |
| Aspect=Perf|Mood=Ind|Number=Sing|Person=3|Tense=Past|Variant=Long|VerbForm=Fin | وکړ | |
| Aspect=Perf|Mood=Ind|Number=Sing|Person=3|Tense=Past|Variant=Short|VerbForm=Fin | کړه, وکړه | |
| Aspect=Perf|Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin | کړ | |
| Aspect=Perf|Mood=Ind|Number=Plur|Person=3|Tense=Past|VerbForm=Fin | کړل | |
| Case=Nom|Number=Sing|Tense=Past|VerbForm=Part | کړی | کړې |
| Case=Nom|Number=Plur|Tense=Past|VerbForm=Part | کړي |
AUX
123 AUX tokens (45% of all AUX tokens) have a non-empty value of Gender.
The most frequent other feature values with which AUX and Gender co-occurred: Aspect=EMPTY (118; 96%), Mood=Ind (103; 84%), Number=Sing (103; 84%), Person=3 (103; 84%), VerbForm=Fin (103; 84%), Tense=Pres (72; 59%).
AUX tokens may have the following values of Gender:
Fem(65; 53% of non-emptyGender): ده, وه, شوې, دي, شوه, شوي, وېMasc(58; 47% of non-emptyGender): دی, شوی, شوي, و, وو, دﺉ, شول, شو, کېدلEMPTY(148): شي, به, دي, وي, کېږي, شو, شوای, شې, وای, وکولای
| Paradigm ول | Masc | Fem |
|---|---|---|
| Number=Sing|Tense=Past | و | وه |
| Number=Sing|Tense=Pres | دی, دﺉ | ده |
| Number=Plur|Tense=Past | وو | وې |
| Number=Plur|Tense=Pres | دي |
DET
88 DET tokens (45% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Variant=EMPTY (78; 89%), Deixis=EMPTY (64; 73%), Number=Sing (62; 70%), Poss=EMPTY (62; 70%), Reflex=EMPTY (62; 70%).
DET tokens may have the following values of Gender:
Fem(47; 53% of non-emptyGender): دې, کومه, خپله, هرې, هره, هغه, همدې, ټوله, ټولې, کومېMasc(41; 47% of non-emptyGender): خپل, هر, کوم, خپلو, ټول, دغه, دې, هغه, هماغهEMPTY(108): دغه, هغه, دې, ځینې, څو, داسې, همدغه, دا, دغسې, هماغه
| Paradigm دا | Masc | Fem |
|---|---|---|
| Case=Abl|Number=Sing | دې | |
| Case=Abl|Number=Sing|Variant=Short | دې | |
| Case=Loc|Number=Sing | دې | دې |
| Case=Loc|Number=Sing|Variant=Short | دې | |
| Case=Loc|Number=Plur | دې |
NUM
52 NUM tokens (75% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (52; 100%), Case=Nom (36; 69%).
NUM tokens may have the following values of Gender:
Fem(30; 58% of non-emptyGender): یوه, یوې, دوېMasc(22; 42% of non-emptyGender): یو, یوه, دوهEMPTY(17): دوو, 0053, 1, 1032, 1044, 1075, 1100, 1106, 1525, 1858
| Paradigm یو | Masc | Fem |
|---|---|---|
| Case=Abl | یوې | |
| Case=Acc | یوه | یوې |
| Case=Loc | یوه | |
| Case=Nom | یو | یوه |
PRON
28 PRON tokens (13% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Variant=EMPTY (28; 100%), Number=Sing (27; 96%), Poss=EMPTY (24; 86%), PronType=Prs (20; 71%), Person=3 (16; 57%), Case=Acc (15; 54%).
PRON tokens may have the following values of Gender:
Fem(14; 50% of non-emptyGender): خپله, هغې, دا, دېMasc(14; 50% of non-emptyGender): ده, هغۀ, هغه, دی, ټولو, څوکEMPTY(188): يې, ور, دا, یې, دوی, دې, هغوی, ځان, څه, زموږ
| Paradigm هغه | Masc | Fem |
|---|---|---|
| Case=Acc | هغۀ, هغه | هغې |
| Case=Loc | هغې |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[amod]–> ADJ (329; 99%),
NOUN –[conj]–> NOUN (65; 61%),
NOUN –[nummod]–> NUM (48; 91%),
NOUN –[nmod]–> PROPN (44; 62%),
ADJ –[conj]–> ADJ (35; 97%),
ADJ –[nsubj]–> NOUN (28; 93%),
VERB –[conj]–> VERB (27; 52%),
ADJ –[cop]–> AUX (26; 63%),
VERB –[aux:perf]–> AUX (26; 70%),
PROPN –[flat:name]–> PROPN (16; 100%).