Treebank Statistics: UD_Pashto-Sikaram: Features: Case
This feature is universal.
It occurs with 6 different values: Abl, Acc, Gen, Loc, Nom, Voc.
3254 tokens (60%) have a non-empty value of Case.
1110 types (81%) occur at least once with a non-empty value of Case.
866 lemmas (84%) occur at least once with a non-empty value of Case.
The feature is used with 10 part-of-speech tags: NOUN (1156; 21% instances), ADP (838; 15% instances), ADJ (578; 11% instances), PROPN (217; 4% instances), DET (168; 3% instances), VERB (139; 3% instances), PRON (78; 1% instances), NUM (56; 1% instances), AUX (20; 0% instances), ADV (4; 0% instances).
NOUN
1156 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Case.
The most frequent other feature values with which NOUN and Case co-occurred: Number=Sing (824; 71%), Gender=Masc (658; 57%).
NOUN tokens may have the following values of Case:
Abl(49; 4% of non-emptyCase): خوا, کبله, مخې, مرغه, اړخه, لاسه, منځه, اره, اصولو, املهAcc(340; 29% of non-emptyCase): ژبې, ژباړې, کتابونو, خلکو, ژبو, ساري, هېوادونو, پرمختګ, کار, ارزښتونوLoc(165; 14% of non-emptyCase): ژبه, توګه, برخه, وخت, ډګر, سیمه, ټکي, ژبو, بڼه, جملهNom(602; 52% of non-emptyCase): ژبه, کتابونه, خبرې, ژباړه, خلک, ستونزه, ډول, کسان, اړتیا, خبره
| Paradigm ژباړه | Nom | Acc | Loc | Abl |
|---|---|---|---|---|
| ژباړه | ژباړې | ژباړه | ژباړې |
ADP
838 ADP tokens (100% of all ADP tokens) have a non-empty value of Case.
ADP tokens may have the following values of Case:
Abl(66; 8% of non-emptyCase): له, تر, پرته, پورې, سرهAcc(430; 51% of non-emptyCase): د, ته, له, لپاره, څخه, سره, تر, لاندې, ترڅنګ, وروستهLoc(342; 41% of non-emptyCase): په, کې, پر, باندې, پۀ, پسې, پورې
| Paradigm له | Acc | Abl |
|---|---|---|
| له | له |
ADJ
578 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Case.
The most frequent other feature values with which ADJ and Case co-occurred: Number=Sing (366; 63%), Gender=Masc (351; 61%).
ADJ tokens may have the following values of Case:
Abl(9; 2% of non-emptyCase): بده, بلې, سمه, لږه, نړیوالو, ډېرهAcc(117; 20% of non-emptyCase): نورو, پوهنیزو, ټولنیزو, اغېزمن, متاثر, پوهنیز, اسلامي, بل, کلتوري, اداريLoc(63; 11% of non-emptyCase): نورو, ټولنیز, وروستیو, دري, لره, هنري, پوهنیز, اخرو, ادبي, اسلاميNom(389; 67% of non-emptyCase): زده, زيات, ښه, سمه, اړ, جوړ, نور, ناسمه, اصلي, اړوندEMPTY(8): ق, هـ, خپور, م
| Paradigm بل | Nom | Acc | Loc | Abl |
|---|---|---|---|---|
| Gender=Masc | بل | بل | ||
| Gender=Fem | بله | بله | بلې |
PROPN
217 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Case.
The most frequent other feature values with which PROPN and Case co-occurred: Number=Sing (192; 88%), Gender=Masc (109; 50%).
PROPN tokens may have the following values of Case:
Abl(1; 0% of non-emptyCase): پېښورهAcc(79; 36% of non-emptyCase): پښتو, پښتنو, افغانستان, پیتر, کابل, اردو, بابا, خوشال, ايران, مریمLoc(48; 22% of non-emptyCase): پښتو, اردو, پاړسي, کابل, انګرېزۍ, ږوب, کوټه, افغانستان, امريکا, جرمنيNom(88; 41% of non-emptyCase): پښتو, پښتانه, احمد, وحید, عربي, پنج, ايرانیان, بابا, دري, پاړسيVoc(1; 0% of non-emptyCase): سامه
| Paradigm پښتو | Nom | Acc | Loc |
|---|---|---|---|
| پښتو | پښتو | پښتو |
DET
168 DET tokens (86% of all DET tokens) have a non-empty value of Case.
The most frequent other feature values with which DET and Case co-occurred: Poss=EMPTY (142; 85%), Reflex=EMPTY (142; 85%), Variant=EMPTY (124; 74%), PronType=Dem (93; 55%), Number=EMPTY (85; 51%).
DET tokens may have the following values of Case:
Abl(9; 5% of non-emptyCase): دې, همدې, هغهAcc(29; 17% of non-emptyCase): خپل, خپلو, هرې, دې, ځینو, کوم, دغه, دغو, دغې, هغوLoc(35; 21% of non-emptyCase): دې, خپل, خپله, هغه, ټوله, هره, هماغه, دغو, هر, ځینوNom(95; 57% of non-emptyCase): دغه, هغه, ځینې, خپل, هر, همدغه, کومه, دا, کوم, ټولEMPTY(28): څو, داسې, دغسې, یوشمېر, هېڅ, هماغسې, همدغسې, څه, څۀ
| Paradigm دا | Nom | Acc | Loc | Abl |
|---|---|---|---|---|
| Gender=Masc|Number=Sing | دې | |||
| Gender=Fem|Number=Sing | دې | دې | ||
| Gender=Fem|Number=Sing|Variant=Short | دې | دې | ||
| Gender=Fem|Number=Plur | دې | |||
| دا | دې | دې | دې | |
| Variant=Short | دا | دې |
VERB
139 VERB tokens (31% of all VERB tokens) have a non-empty value of Case.
The most frequent other feature values with which VERB and Case co-occurred: Mood=EMPTY (137; 99%), Person=EMPTY (137; 99%), Gender=EMPTY (79; 57%), Number=EMPTY (79; 57%), Tense=EMPTY (79; 57%), VerbForm=Inf (79; 57%).
VERB tokens may have the following values of Case:
Acc(12; 9% of non-emptyCase): کولو, شویو, رسېدو, ځلولو, څښلو, څکولو, ړنګېدوLoc(4; 3% of non-emptyCase): تېرېدو, لوستلو, ويلو, کارولوNom(123; 88% of non-emptyCase): کړې, ژباړل, شوي, شوی, کارول, کړی, لیکل, شوې, وهل, ويليEMPTY(309): لري, کوي, کړي, کېږي, شي, کولای, کړه, شته, شو, ورکوي
| Paradigm کول | Nom | Acc |
|---|---|---|
| Aspect=Imp|VerbForm=Inf | کول | کولو |
| Gender=Masc|Number=Sing|Tense=Past|VerbForm=Part | کړی | |
| Gender=Masc|Number=Plur|Tense=Past|VerbForm=Part | کړي | |
| Gender=Fem|Number=Sing|Tense=Past|VerbForm=Part | کړې |
PRON
78 PRON tokens (36% of all PRON tokens) have a non-empty value of Case.
The most frequent other feature values with which PRON and Case co-occurred: Variant=EMPTY (77; 99%), Poss=EMPTY (74; 95%), Gender=EMPTY (50; 64%), PronType=Prs (43; 55%), Person=EMPTY (40; 51%).
PRON tokens may have the following values of Case:
Abl(3; 4% of non-emptyCase): دې, ټولوAcc(29; 37% of non-emptyCase): ده, دې, هغوی, هغۀ, هغې, چا, دوی, ما, هغهGen(6; 8% of non-emptyCase): زموږ, ستا, زماLoc(11; 14% of non-emptyCase): خپله, دې, دوی, ما, هغوی, هغېNom(29; 37% of non-emptyCase): دا, دوی, هغه, همدا, څوک, دی, موږ, هرڅوک, همدغه, هیڅوکEMPTY(138): يې, ور, یې, څه, ځان, هرڅه, داسې, در, را, یوبل
| Paradigm دا | Nom | Acc | Loc | Abl |
|---|---|---|---|---|
| Gender=Fem|Number=Sing | دا | دې | دې | |
| دا | دې | دې | دې |
NUM
56 NUM tokens (81% of all NUM tokens) have a non-empty value of Case.
The most frequent other feature values with which NUM and Case co-occurred: NumType=Card (56; 100%), Gender=Fem (30; 54%).
NUM tokens may have the following values of Case:
Abl(1; 2% of non-emptyCase): یوېAcc(12; 21% of non-emptyCase): یوې, یوه, دووLoc(6; 11% of non-emptyCase): یوه, درېیو, دووNom(37; 66% of non-emptyCase): یوه, یو, دوه, دوې, پنځهEMPTY(13): 0053, 1, 1032, 1044, 1075, 1100, 1106, 1525, 1858, 2
| Paradigm یو | Nom | Acc | Loc | Abl |
|---|---|---|---|---|
| Gender=Masc | یو | یوه | ||
| Gender=Fem | یوه | یوې | یوه | یوې |
AUX
20 AUX tokens (7% of all AUX tokens) have a non-empty value of Case.
The most frequent other feature values with which AUX and Case co-occurred: Aspect=EMPTY (20; 100%), Mood=EMPTY (20; 100%), Person=EMPTY (20; 100%), Tense=Past (20; 100%), VerbForm=Part (20; 100%), Gender=Masc (17; 85%), Number=Sing (12; 60%).
AUX tokens may have the following values of Case:
Acc(1; 5% of non-emptyCase): شويNom(19; 95% of non-emptyCase): شوی, شوي, شوېEMPTY(251): ده, شي, به, دي, وي, کېږي, دی, وه, و, وو
| Paradigm کېدل | Nom | Acc |
|---|---|---|
| Gender=Masc|Number=Sing | شوی | شوي |
| Gender=Masc|Number=Plur | شوي | |
| Gender=Fem|Number=Sing | شوې | |
| Gender=Fem|Number=Plur|Typo=Yes | شوي |
ADV
4 ADV tokens (2% of all ADV tokens) have a non-empty value of Case.
ADV tokens may have the following values of Case:
Abl(4; 100% of non-emptyCase): اوسهEMPTY(221): هم, نو, اوس, بیا, کله, یوازې, چېرې, دومره, لا, همدا
Relations with Agreement in Case
The 10 most frequent relations where parent and child node agree in Case:
NOUN –[case]–> ADP (572; 98%),
NOUN –[amod]–> ADJ (331; 100%),
NOUN –[det]–> DET (159; 85%),
PROPN –[case]–> ADP (120; 99%),
NOUN –[conj]–> NOUN (103; 97%),
NOUN –[nummod]–> NUM (51; 96%),
PRON –[case]–> ADP (41; 53%),
ADJ –[conj]–> ADJ (36; 100%),
VERB –[nsubj:pass]–> NOUN (34; 100%),
ADJ –[nsubj]–> NOUN (30; 100%).