Treebank Statistics: UD_Pashto-Sikaram: Features: Case
This feature is universal.
It occurs with 5 different values: Abl
, Acc
, Loc
, Nom
, Voc
.
597 tokens (60%) have a non-empty value of Case
.
326 types (74%) occur at least once with a non-empty value of Case
.
286 lemmas (78%) occur at least once with a non-empty value of Case
.
The feature is used with 9 part-of-speech tags: NOUN (232; 23% instances), ADP (146; 15% instances), ADJ (101; 10% instances), DET (41; 4% instances), PROPN (28; 3% instances), VERB (25; 3% instances), PRON (15; 2% instances), NUM (6; 1% instances), AUX (3; 0% instances).
NOUN
232 NOUN tokens (100% of all NOUN
tokens) have a non-empty value of Case
.
The most frequent other feature values with which NOUN
and Case
co-occurred: Number=Sing (177; 76%), Gender=Masc (131; 56%).
NOUN
tokens may have the following values of Case
:
Abl
(10; 4% of non-emptyCase
): مخې, امله, خوا, دمه, لاسه, مرغه, پولو, پیله, ژباړېAcc
(72; 31% of non-emptyCase
): ژبې, ژباړې, اترو, انسانانو, خبرو, خلکو, سیمې, مینې, نړۍ, پېړۍLoc
(33; 14% of non-emptyCase
): ژبه, وخت, کچه, ادب, انځورونه, بڼه, توګه, دېوالونو, شرق, شمېرNom
(117; 50% of non-emptyCase
): ژبه, اثر, ارزښت, دود, برخې, خبرې, زر, شمېر, لامل, لیک
Paradigm ژبه | Nom | Acc | Loc |
---|---|---|---|
Number=Sing | ژبه | ژبې | ژبه |
Number=Plur | ژبې | ژبو |
ADP
146 ADP tokens (100% of all ADP
tokens) have a non-empty value of Case
.
ADP
tokens may have the following values of Case
:
Abl
(15; 10% of non-emptyCase
): له, تر, پرتهAcc
(74; 51% of non-emptyCase
): د, ته, له, لپاره, څخه, سره, ترمنځLoc
(57; 39% of non-emptyCase
): په, کې, پۀ, پر
Paradigm له | Acc | Abl |
---|---|---|
له | له |
Case
seems to be lexical feature of ADP
. 92% lemmas (11) occur only with one value of Case
.
ADJ
101 ADJ tokens (100% of all ADJ
tokens) have a non-empty value of Case
.
The most frequent other feature values with which ADJ
and Case
co-occurred: Number=Sing (75; 74%), Gender=Masc (65; 64%).
ADJ
tokens may have the following values of Case
:
Abl
(2; 2% of non-emptyCase
): بده, نړيوالوAcc
(19; 19% of non-emptyCase
): ايرانۍ, بېسارو, بېل, زياتو, سترې, شلمې, فرهنګیانو, لرغونو, لیکنۍ, مرستیالېLoc
(13; 13% of non-emptyCase
): ايراني, بنګالۍ, خلیجي, فرهنګي, نورو, نړيواله, وال, وروستیو, ټولنیز, پاړسيNom
(67; 66% of non-emptyCase
): هنري, ادبي, جوړ, راټول, شهکار, لږ, نور, ژوندۍ, ښه, ګڼ
Paradigm نړيوال | Nom | Acc | Loc | Abl |
---|---|---|---|---|
Gender=Masc|Number=Plur | نړيوالو | |||
Gender=Fem|Number=Sing | نړيواله | نړيواله | ||
Gender=Fem|Number=Plur | نړيوالو |
DET
41 DET tokens (80% of all DET
tokens) have a non-empty value of Case
.
The most frequent other feature values with which DET
and Case
co-occurred: Poss=EMPTY (30; 73%), Reflex=EMPTY (30; 73%), Deixis=EMPTY (25; 61%), Number=EMPTY (21; 51%).
DET
tokens may have the following values of Case
:
Abl
(1; 2% of non-emptyCase
): دېAcc
(11; 27% of non-emptyCase
): هرې, خپل, خپلو, همدغو, ځینو, کومLoc
(5; 12% of non-emptyCase
): هماغه, خپله, هره, ټولهNom
(24; 59% of non-emptyCase
): خپل, هغه, دغه, هر, دا, هره, هماغه, همدغه, ټولې, ځینېEMPTY
(10): داسې, ستا, څو, دې, زما, هماغسې, څۀ
Paradigm خپل | Nom | Acc | Loc |
---|---|---|---|
Gender=Masc|Number=Sing | خپل | خپل | |
Gender=Masc|Number=Plur | خپل | خپلو | |
Gender=Fem|Number=Sing | خپله |
PROPN
28 PROPN tokens (100% of all PROPN
tokens) have a non-empty value of Case
.
The most frequent other feature values with which PROPN
and Case
co-occurred: Number=Sing (27; 96%), Gender=Masc (17; 61%).
PROPN
tokens may have the following values of Case
:
Acc
(13; 46% of non-emptyCase
): پیتر, اردو, مریم, ايرانیانو, جانې, فرانسې, نوبل, ټیګورLoc
(4; 14% of non-emptyCase
): اردو, امريکا, انګرېزۍ, پاریسNom
(10; 36% of non-emptyCase
): افغان, ایګوازو, براون, حبیبي, سمیس, طلوع, مریم, ټیګور, پیتر, ګیتانجليVoc
(1; 4% of non-emptyCase
): سامه
Paradigm پیتر | Nom | Acc |
---|---|---|
پیتر | پیتر |
VERB
25 VERB tokens (28% of all VERB
tokens) have a non-empty value of Case
.
The most frequent other feature values with which VERB
and Case
co-occurred: Mood=EMPTY (25; 100%), Person=EMPTY (25; 100%), VerbForm=Part (15; 60%), Aspect=Imp (14; 56%), Tense=Past (14; 56%).
VERB
tokens may have the following values of Case
:
Acc
(3; 12% of non-emptyCase
): روزونکې, څښلو, څکولوNom
(22; 88% of non-emptyCase
): شوې, ګڼل, راتلی, رسولې, رسېدلی, شوى, شوي, لوستى, لیکل, نیولیEMPTY
(65): لري, کړه, شو, وي, کوي, کړل, کښل, اواروي, برېښي, خپروي
Case
seems to be lexical feature of VERB
. 100% lemmas (18) occur only with one value of Case
.
PRON
15 PRON tokens (26% of all PRON
tokens) have a non-empty value of Case
.
The most frequent other feature values with which PRON
and Case
co-occurred: Poss=EMPTY (15; 100%), Variant=EMPTY (13; 87%), PronType=Prs (9; 60%), Gender=EMPTY (8; 53%), Number=Sing (8; 53%).
PRON
tokens may have the following values of Case
:
Abl
(2; 13% of non-emptyCase
): دې, ټولوAcc
(9; 60% of non-emptyCase
): هغۀ, ما, هغې, هغوى, چاLoc
(1; 7% of non-emptyCase
): دویNom
(3; 20% of non-emptyCase
): دا, همدغه, هیڅوکEMPTY
(42): يې, چې, ور, څه, یې, ځان, یوبل
Case
seems to be lexical feature of PRON
. 100% lemmas (10) occur only with one value of Case
.
NUM
6 NUM tokens (100% of all NUM
tokens) have a non-empty value of Case
.
The most frequent other feature values with which NUM
and Case
co-occurred: NumType=Card (6; 100%), Gender=Fem (4; 67%).
NUM
tokens may have the following values of Case
:
Acc
(4; 67% of non-emptyCase
): يوې, يوهNom
(2; 33% of non-emptyCase
): يوه, یو
Paradigm یو | Nom | Acc |
---|---|---|
Gender=Masc | یو | يوه |
Gender=Fem | يوه | يوې |
AUX
3 AUX tokens (7% of all AUX
tokens) have a non-empty value of Case
.
The most frequent other feature values with which AUX
and Case
co-occurred: Gender=Masc (3; 100%), Mood=EMPTY (3; 100%), Number=Sing (3; 100%), Person=EMPTY (3; 100%), Tense=Past (3; 100%), VerbForm=Part (3; 100%), Aspect=Perf (2; 67%).
AUX
tokens may have the following values of Case
:
Nom
(3; 100% of non-emptyCase
): شوى, کولیEMPTY
(43): به, وي, ده, دى, دی, کېږي, شوای, شې, وه, شو
Relations with Agreement in Case
The 10 most frequent relations where parent and child node agree in Case
:
NOUN –[case]–> ADP (111; 100%),
NOUN –[amod]–> ADJ (57; 100%),
NOUN –[det]–> DET (40; 82%),
ADJ –[conj]–> ADJ (14; 100%),
NOUN –[conj]–> NOUN (13; 93%),
PROPN –[case]–> ADP (12; 100%),
NOUN –[nsubj]–> NOUN (6; 75%),
ADJ –[nsubj]–> NOUN (5; 100%),
NOUN –[fixed]–> NOUN (5; 100%),
NOUN –[nummod]–> NUM (5; 100%).