Treebank Statistics: UD_Sindhi-Isra: Features: Case
This feature is universal.
It occurs with 5 different values: Abl, Acc, Gen, Nom, Voc.
48806 tokens (51%) have a non-empty value of Case.
7456 types (76%) occur at least once with a non-empty value of Case.
4080 lemmas (80%) occur at least once with a non-empty value of Case.
The feature is used with 9 part-of-speech tags: NOUN (26071; 27% instances), ADJ (6035; 6% instances), ADP (5571; 6% instances), DET (4376; 5% instances), PROPN (4289; 5% instances), PRON (2319; 2% instances), NUM (66; 0% instances), ADV (61; 0% instances), VERB (18; 0% instances).
NOUN
26071 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Case.
The most frequent other feature values with which NOUN and Case co-occurred: Number=Sing (21482; 82%), Gender=Masc (16139; 62%).
NOUN tokens may have the following values of Case:
Abl(9; 0% of non-emptyCase): اندران, هٿان, گهران, مائيءَ, نقصانن, هنڌان, گھرانAcc(9553; 37% of non-emptyCase): ملڪ, ماڻهن, دنيا, شينهن, گهر, گدڙ, بئراج, ڏينهن, حڪومت, زمينNom(16491; 63% of non-emptyCase): ڳالهه, ماڻهو, وقت, شينهن, گدڙ, وزير, پاڻي, ڏينهن, ڪم, حڪومتVoc(18; 0% of non-emptyCase): سائين, ڀائو, ڌيءَ, بابا, بادشاهه, سلامت, مائي, مالڪ, ميان, پلونگڙاEMPTY(8): بيان, مشڪل, ٻڌڻي, ٽين, پورا, پيا, پڇ, ڏکڻ
ADJ
6035 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Case.
The most frequent other feature values with which ADJ and Case co-occurred: Degree=Pos (4282; 71%), Gender=EMPTY (3603; 60%), Number=Sing (3275; 54%).
ADJ tokens may have the following values of Case:
Acc(817; 14% of non-emptyCase): هڪڙي, ٻين, ٻئي, وڏي, سڀني, ٻنهي, ڪيترن, نئين, پراڻي, ننڍيNom(5217; 86% of non-emptyCase): وڌيڪ, ڪجهه, مختلف, سڀ, صرف, هڪڙو, موجود, خوش, سڄي, ڏاڍيVoc(1; 0% of non-emptyCase): چنڊاEMPTY(83): مالياتي, داخل, واقع, آئرش, بلڪل, تمام, تکو, شهيد, صرف, ناردرن
Case seems to be lexical feature of ADJ. 93% lemmas (933) occur only with one value of Case.
ADP
5571 ADP tokens (39% of all ADP tokens) have a non-empty value of Case.
The most frequent other feature values with which ADP and Case co-occurred: Number=Sing (4913; 88%), Gender=Masc (3415; 61%).
ADP tokens may have the following values of Case:
Acc(2759; 50% of non-emptyCase): جي, واري, کيس, وارن, کي, کين, سان, کان, وارين, کانئسNom(2812; 50% of non-emptyCase): جو, جي, جا, جون, وارو, وارا, واري, واريون, دا, کيEMPTY(8734): ۾, کي, تي, سان, کان, لاءِ, مان, تائين, پوءِ, وٽ
| Paradigm جي | Nom | Acc |
|---|---|---|
| _ | جي | جي |
| Gender=Masc | جي | |
| Gender=Masc|Number=Sing | جي | جي |
| Gender=Masc|Number=Plur | جي | جي |
| Gender=Fem|Number=Sing | جي | جي |
| Gender=Fem|Number=Plur | جي | |
| Number=Sing | جي | جي |
DET
4376 DET tokens (99% of all DET tokens) have a non-empty value of Case.
The most frequent other feature values with which DET and Case co-occurred: PronType=Dem (4376; 100%), Person=EMPTY (3655; 84%), Number=Sing (3524; 81%), Gender=EMPTY (3221; 74%).
DET tokens may have the following values of Case:
Acc(2755; 63% of non-emptyCase): هن, ان, اسان, جنهن, انهن, ڪنهن, انهيءَ, هنن, ھن, انهيGen(2; 0% of non-emptyCase): پنهنجيNom(1619; 37% of non-emptyCase): اهو, هو, اها, اهي, هر, ڪو, جيڪو, ڪا, هي, جيڪاEMPTY(23): هر, جو, هو, اهڙو, ڪنهن, ڪهڙي, ان, اها, اهڙيءَ, نه
| Paradigm ان | Nom | Acc |
|---|---|---|
| Gender=Masc|Number=Sing|Person=3 | ان | |
| Gender=Masc|Number=Sing | ان | |
| Gender=Masc|Number=Plur | انهن | |
| Gender=Fem|Number=Sing | ان | |
| Number=Sing|Person=1 | انهيءَ | |
| Number=Sing|Person=3 | ان, انهي, انهيءَ | |
| Number=Sing | ان | ان, انهيءَ, انهي, انهيءِ, انھي |
| Number=Plur|Person=3 | انهن | |
| Number=Plur | انهن, انھن | |
| ان |
PROPN
4289 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Case.
The most frequent other feature values with which PROPN and Case co-occurred: Number=EMPTY (3814; 89%), Gender=Masc (3647; 85%).
PROPN tokens may have the following values of Case:
Abl(1; 0% of non-emptyCase): احمدAcc(320; 7% of non-emptyCase): پاڪستان, ڪابيرو, سنڌ, ماليڪٽ, ڪراچي, پارٽي, شريف, پ, آباد, کامNom(3967; 92% of non-emptyCase): سنڌ, علي, پاڪستان, ڪراچي, محمد, پ, آمريڪا, احمد, اسلام, نوازVoc(1; 0% of non-emptyCase): ماليڪٽEMPTY(1): ملتان
| Paradigm ماليڪٽ | Nom | Acc | Voc |
|---|---|---|---|
| Gender=Masc | ماليڪٽ | ماليڪٽ | ماليڪٽ |
| Gender=Fem | ماليڪٽ | ماليڪٽ |
PRON
2319 PRON tokens (91% of all PRON tokens) have a non-empty value of Case.
The most frequent other feature values with which PRON and Case co-occurred: Number=Sing (1595; 69%), Gender=EMPTY (1309; 56%), Person=1 (1256; 54%).
PRON tokens may have the following values of Case:
Acc(720; 31% of non-emptyCase): مون, پاڻ, اوهان, جن, توهان, انهن, تنهن, ڇا, تو, سندنGen(857; 37% of non-emptyCase): پنهنجي, منهنجي, سندس, پنهنجو, تنهنجي, سندن, پنهنجا, پنهنجن, منهنجو, پنهنجيءَNom(742; 32% of non-emptyCase): تون, آءٌ, جيڪي, سو, توکي, مان, ڪير, آئون, اسين, جهڙوEMPTY(226): ڇا, ائين, ڇو, ڪيئن, پاڻ, انهيءُ, اوهين, اُهي, تنهن, جهڙو
| Paradigm اسين | Nom | Acc | Gen |
|---|---|---|---|
| Gender=Masc|Number=Sing | اسانجي, اسانکي | ||
| Gender=Fem|Number=Sing | اسانجي | ||
| Number=Sing | اسانجي | ||
| Number=Plur | اسين | اسين |
NUM
66 NUM tokens (5% of all NUM tokens) have a non-empty value of Case.
NUM tokens may have the following values of Case:
Acc(36; 55% of non-emptyCase): ٻن, هزار, لک, هزارن, لکن, ٽنNom(30; 45% of non-emptyCase): هزار, هزارين, لک, ارب, ملين, سؤ, چارEMPTY(1284): هڪ, 2, 3, 5, 4, ٽن, ٻه, 10, 2009ع, 20
ADV
61 ADV tokens (2% of all ADV tokens) have a non-empty value of Case.
ADV tokens may have the following values of Case:
Acc(31; 51% of non-emptyCase): جڏهن, تيئن, جيئن, اڳتي, آهستي, ايئن, اڄ, ايتريتائين, اڄڪلھ, بسNom(30; 49% of non-emptyCase): اڄ, جڏهن, اڳتي, جلد, ڪالهه, ڪڏهن, ھتي, اصل, بلڪل, جلديEMPTY(3057): جڏهن, وري, اتي, هاڻي, پوءِ, اڄ, جاري, جيئن, ڏانهن, تمام
| Paradigm جڏهن | Nom | Acc |
|---|---|---|
| _ | جڏهن | |
| Gender=Masc|Number=Sing | جڏهن | جڏهن |
| Gender=Fem|Number=Sing | جڏهن | |
| Number=Sing | جڏهن |
VERB
18 VERB tokens (0% of all VERB tokens) have a non-empty value of Case.
The most frequent other feature values with which VERB and Case co-occurred: VerbForm=EMPTY (18; 100%), Gender=EMPTY (17; 94%), Aspect=Perf (10; 56%), Voice=Act (10; 56%).
VERB tokens may have the following values of Case:
Acc(15; 83% of non-emptyCase): چيس, چين, اچيـَو, لڳين, ويندس, ٻڌايس, پڇيس, پڇين, ٿيم, ڏينمNom(3; 17% of non-emptyCase): ادا, بيٺس, ڏٺمEMPTY(13075): ڪري, چيو, ڪرڻ, ويو, ڪيو, وڃي, ڪئي, اچي, ويا, پيو
Case seems to be lexical feature of VERB. 100% lemmas (16) occur only with one value of Case.
Relations with Agreement in Case
The 10 most frequent relations where parent and child node agree in Case:
NOUN –[amod]–> ADJ (2538; 64%),
NOUN –[det]–> DET (1365; 83%),
NOUN –[conj]–> NOUN (749; 64%),
NOUN –[nmod]–> PROPN (527; 53%),
PROPN –[nmod]–> NOUN (486; 89%),
NOUN –[compound]–> PROPN (444; 65%),
PROPN –[flat]–> PROPN (394; 84%),
NOUN –[nsubj]–> NOUN (313; 78%),
PROPN –[compound]–> PROPN (296; 93%),
ADJ –[nsubj]–> NOUN (256; 96%).