Treebank Statistics: UD_Pashto-Sikaram: Features: Case
This feature is universal.
It occurs with 6 different values: Abl, Acc, Gen, Loc, Nom, Voc.
2381 tokens (59%) have a non-empty value of Case.
860 types (80%) occur at least once with a non-empty value of Case.
693 lemmas (83%) occur at least once with a non-empty value of Case.
The feature is used with 10 part-of-speech tags: NOUN (870; 21% instances), ADP (588; 14% instances), ADJ (434; 11% instances), PROPN (147; 4% instances), DET (131; 3% instances), VERB (95; 2% instances), PRON (58; 1% instances), NUM (43; 1% instances), AUX (14; 0% instances), ADV (1; 0% instances).
NOUN
870 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Case.
The most frequent other feature values with which NOUN and Case co-occurred: Number=Sing (614; 71%), Gender=Masc (490; 56%).
NOUN tokens may have the following values of Case:
Abl(30; 3% of non-emptyCase): خوا, مخې, کبله, اړخه, اصولو, امله, اړخونو, خلکو, دمه, دودهAcc(240; 28% of non-emptyCase): ژباړې, ژبې, خلکو, ژبو, کتابونو, ساري, هېوادونو, ارزښتونو, خبرو, دودونوLoc(123; 14% of non-emptyCase): ژبه, توګه, برخه, وخت, ټکي, بڼه, جمله, سیمه, ډګر, ژبوNom(477; 55% of non-emptyCase): ژبه, ژباړه, کتابونه, خلک, ډول, کسان, خبرې, ستونزه, مانا, اړتیا
| Paradigm ژباړه | Nom | Acc | Loc | Abl |
|---|---|---|---|---|
| ژباړه | ژباړې | ژباړه | ژباړې |
ADP
588 ADP tokens (100% of all ADP tokens) have a non-empty value of Case.
ADP tokens may have the following values of Case:
Abl(40; 7% of non-emptyCase): له, تر, پرته, پورېAcc(289; 49% of non-emptyCase): د, ته, له, سره, لپاره, څخه, تر, لاندې, ترڅنګ, پسېLoc(259; 44% of non-emptyCase): په, کې, پر, باندې, پۀ, پورې
| Paradigm له | Acc | Abl |
|---|---|---|
| له | له |
ADJ
434 ADJ tokens (100% of all ADJ tokens) have a non-empty value of Case.
The most frequent other feature values with which ADJ and Case co-occurred: Number=Sing (270; 62%), Gender=Masc (262; 60%).
ADJ tokens may have the following values of Case:
Abl(4; 1% of non-emptyCase): بده, بلې, لږه, نړیوالوAcc(88; 20% of non-emptyCase): نورو, ټولنیزو, پوهنیزو, اغېزمن, متاثر, اسلامي, بل, کلتوري, ادبي, اغېزناکLoc(41; 9% of non-emptyCase): نورو, ټولنیز, وروستیو, لره, اخرو, اسلامي, اوښتې, ايراني, ايرانۍ, اړوندNom(301; 69% of non-emptyCase): زيات, ښه, زده, سمه, جوړ, ناسمه, نور, اصلي, اړ, اړوندEMPTY(1): خپور
| Paradigm بل | Nom | Acc | Loc | Abl |
|---|---|---|---|---|
| Gender=Masc | بل | بل | ||
| Gender=Fem | بله | بله | بلې |
PROPN
147 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Case.
The most frequent other feature values with which PROPN and Case co-occurred: Number=Sing (131; 89%), Gender=Fem (79; 54%).
PROPN tokens may have the following values of Case:
Abl(1; 1% of non-emptyCase): پېښورهAcc(44; 30% of non-emptyCase): پښتو, پښتنو, پیتر, اردو, ايران, مریم, پاړسي, اسامه, افغان, امريکاLoc(40; 27% of non-emptyCase): پښتو, اردو, پاړسي, انګرېزۍ, ږوب, افغانستان, امريکا, جرمني, لورلايي, هندNom(61; 41% of non-emptyCase): پښتو, پښتانه, احمد, وحید, عربي, پنج, ايرانیان, پاړسي, کتاب, اردوVoc(1; 1% of non-emptyCase): سامه
| Paradigm پښتو | Nom | Acc | Loc |
|---|---|---|---|
| پښتو | پښتو | پښتو |
DET
131 DET tokens (83% of all DET tokens) have a non-empty value of Case.
The most frequent other feature values with which DET and Case co-occurred: Poss=EMPTY (109; 83%), Reflex=EMPTY (109; 83%), Variant=EMPTY (95; 73%), PronType=Dem (68; 52%), Number=EMPTY (66; 50%).
DET tokens may have the following values of Case:
Abl(5; 4% of non-emptyCase): دې, هغه, همدېAcc(19; 15% of non-emptyCase): خپل, هرې, خپلو, ځینو, دغو, هماغه, همدغو, ټولې, کوم, کومېLoc(29; 22% of non-emptyCase): دې, خپل, خپله, ټوله, هره, هماغه, هغه, ځینوNom(78; 60% of non-emptyCase): دغه, هغه, خپل, هر, ځینې, همدغه, ټول, کوم, کومه, خپلهEMPTY(27): داسې, څو, دغسې, چې, یوشمېر, هماغسې, هېڅ, څه, څۀ
| Paradigm خپل | Nom | Acc | Loc |
|---|---|---|---|
| Gender=Masc|Number=Sing | خپل | خپل | خپل |
| Gender=Masc|Number=Plur | خپل | خپلو | |
| Gender=Fem|Number=Sing | خپله | خپله |
VERB
95 VERB tokens (29% of all VERB tokens) have a non-empty value of Case.
The most frequent other feature values with which VERB and Case co-occurred: Mood=EMPTY (93; 98%), Person=EMPTY (93; 98%), Gender=EMPTY (57; 60%), Number=EMPTY (57; 60%), Tense=EMPTY (57; 60%), VerbForm=Inf (57; 60%).
VERB tokens may have the following values of Case:
Acc(8; 8% of non-emptyCase): شویو, کولو, ځلولو, څښلو, څکولو, ړنګېدوNom(87; 92% of non-emptyCase): ژباړل, کارول, کړی, شوي, شوی, کړې, لیکل, ګڼل, رااخیستل, راژباړلEMPTY(232): کوي, لري, کړي, شي, کېږي, کولای, ورکوي, شته, وايي, راځي
| Paradigm کول | Nom | Acc |
|---|---|---|
| Aspect=Imp|VerbForm=Inf | کول | کولو |
| Gender=Masc|Number=Sing|Tense=Past|VerbForm=Part | کړی | |
| Gender=Masc|Number=Plur|Tense=Past|VerbForm=Part | کړي | |
| Gender=Fem|Number=Sing|Tense=Past|VerbForm=Part | کړې |
Case seems to be lexical feature of VERB. 94% lemmas (34) occur only with one value of Case.
PRON
58 PRON tokens (30% of all PRON tokens) have a non-empty value of Case.
The most frequent other feature values with which PRON and Case co-occurred: Poss=EMPTY (58; 100%), Variant=EMPTY (57; 98%), Person=EMPTY (33; 57%).
PRON tokens may have the following values of Case:
Abl(2; 3% of non-emptyCase): دې, ټولوAcc(19; 33% of non-emptyCase): دې, هغوی, هغۀ, چا, ما, هغې, دویGen(6; 10% of non-emptyCase): زموږ, ستا, زماLoc(6; 10% of non-emptyCase): دې, دوی, هغوی, هغېNom(25; 43% of non-emptyCase): دا, دوی, همدا, څوک, هغه, موږ, هرڅوک, همدغه, هیڅوک, ځانEMPTY(138): يې, چې, ور, یې, ځان, څه, هرڅه, داسې, یوبل
| Paradigm دا | Nom | Acc | Loc | Abl |
|---|---|---|---|---|
| Gender=Fem|Number=Sing | دا | دې | دې | |
| دا | دې | دې | دې |
NUM
43 NUM tokens (93% of all NUM tokens) have a non-empty value of Case.
The most frequent other feature values with which NUM and Case co-occurred: NumType=Card (43; 100%).
NUM tokens may have the following values of Case:
Abl(1; 2% of non-emptyCase): یوېAcc(11; 26% of non-emptyCase): یوه, یوې, دووLoc(3; 7% of non-emptyCase): یوه, دووNom(28; 65% of non-emptyCase): یو, یوه, دوه, دوېEMPTY(3): 1, 30, 40
| Paradigm یو | Nom | Acc | Loc | Abl |
|---|---|---|---|---|
| Gender=Masc | یو | یوه | ||
| Gender=Fem | یوه | یوې | یوه | یوې |
AUX
14 AUX tokens (6% of all AUX tokens) have a non-empty value of Case.
The most frequent other feature values with which AUX and Case co-occurred: Aspect=EMPTY (14; 100%), Mood=EMPTY (14; 100%), Person=EMPTY (14; 100%), Tense=Past (14; 100%), VerbForm=Part (14; 100%), Gender=Masc (13; 93%), Number=Sing (8; 57%).
AUX tokens may have the following values of Case:
Nom(14; 100% of non-emptyCase): شوی, شوي, شوېEMPTY(202): ده, شي, به, وي, کېږي, دي, دی, دﺉ, شو, و
ADV
1 ADV tokens (1% of all ADV tokens) have a non-empty value of Case.
ADV tokens may have the following values of Case:
Abl(1; 100% of non-emptyCase): اوسهEMPTY(174): هم, نو, اوس, کله, چېرې, بیا, یوازې, دومره, لا, وروسته
Relations with Agreement in Case
The 10 most frequent relations where parent and child node agree in Case:
NOUN –[case]–> ADP (404; 99%),
NOUN –[amod]–> ADJ (254; 100%),
NOUN –[det]–> DET (122; 81%),
PROPN –[case]–> ADP (85; 99%),
NOUN –[conj]–> NOUN (79; 96%),
NOUN –[nummod]–> NUM (40; 98%),
ADJ –[conj]–> ADJ (26; 100%),
VERB –[nsubj:pass]–> NOUN (26; 100%),
ADJ –[nsubj]–> NOUN (25; 100%),
ADJ –[case]–> ADP (18; 95%).