Treebank Statistics: UD_Bhojpuri-BHTB: Features: Case
This feature is universal.
It occurs with 5 different values: Acc, Dat, Erg, Gen, Nom.
Some words have combined values of the feature; 3 combinations have been observed: Acc|Dat, Acc|Erg, Acc|Gen.
4043 tokens (61%) have a non-empty value of Case.
1361 types (81%) occur at least once with a non-empty value of Case.
1335 lemmas (82%) occur at least once with a non-empty value of Case.
The feature is used with 13 part-of-speech tags: NOUN (1689; 25% instances), ADP (570; 9% instances), PROPN (399; 6% instances), VERB (329; 5% instances), PRON (250; 4% instances), DET (229; 3% instances), ADJ (173; 3% instances), AUX (163; 2% instances), PART (104; 2% instances), NUM (96; 1% instances), CCONJ (30; 0% instances), ADV (7; 0% instances), INTJ (4; 0% instances).
NOUN
1689 NOUN tokens (91% of all NOUN tokens) have a non-empty value of Case.
The most frequent other feature values with which NOUN and Case co-occurred: Person=3 (1562; 92%), Number=Sing (1490; 88%), Gender=Masc (1215; 72%).
NOUN tokens may have the following values of Case:
Acc(686; 41% of non-emptyCase): आजु, लोगन, जी, बात, साहित्य, सभ, घंटा, परिवार, बिआह, कार्यक्रमNom(1003; 59% of non-emptyCase): लोग, जब, रंग, गवनई, बेर, शुरू, सन, काम, निबंध, काल्हुEMPTY(166): पहिले, बिआह, कथा, डा., दिसाईं, पहिलहीं, अलगे, आंचलिक, आरे, काहेंके
| Paradigm बिआह | Nom | Acc |
|---|---|---|
| Gender=Masc|Number=Sing | बिआह | बिआह |
| Gender=Masc|Number=Plur | बिआह | |
| Gender=Fem|Number=Sing | बिआह | बिआह |
ADP
570 ADP tokens (58% of all ADP tokens) have a non-empty value of Case.
The most frequent other feature values with which ADP and Case co-occurred: Gender=Masc (563; 99%), AdpType=Post (553; 97%), Number=Sing (473; 83%).
ADP tokens may have the following values of Case:
Acc(347; 61% of non-emptyCase): के, खड़े, जाके, पटे, खातिर, डाले, पढ़े, पहिलहीं, मोड, वालेNom(223; 39% of non-emptyCase): के, का, खातिर, वाला, ओके, लेके, उठाके, साथे, हमनीके, ओकराकेEMPTY(419): में, से, पर, के, ले, तबे, अतने, का, अपने, उहाँका
| Paradigm का | Nom | Acc |
|---|---|---|
| Gender=Masc|Number=Sing | का, के | के |
| Gender=Masc|Number=Sing|Person=3|Polite=Form | के | |
| Gender=Masc|Number=Plur | के | के |
| Gender=Fem|Number=Sing | के | |
| Number=Plur | के |
PROPN
399 PROPN tokens (95% of all PROPN tokens) have a non-empty value of Case.
The most frequent other feature values with which PROPN and Case co-occurred: Person=3 (392; 98%), Number=Sing (390; 98%), Gender=Masc (225; 56%).
PROPN tokens may have the following values of Case:
Acc(114; 29% of non-emptyCase): भोजपुरी, जी, द्विवेदी, खाली, चोपड़ा, दिल्ली, शाही, सिंह, पाती, प्रियंकाAcc,Dat(10; 3% of non-emptyCase): लेखको, ऑडियो, कादो, केनियो, परिचर्चो, प्रियंको, भाषणोNom(275; 69% of non-emptyCase): भोजपुरी, डॉ., पाती, प्रियंका, राय, सिंह, तिवारी, प्रसाद, आंजनेय, उदयEMPTY(22): पाती, डा॰, अंचल, टिप्पणिए, प्रो॰, 2012, 24, इटलिए, चोपड़ा, पूर्वांचल
| Paradigm भोजपुरी | Nom | Acc |
|---|---|---|
| Gender=Masc | भोजपुरी | |
| Gender=Fem | भोजपुरी | भोजपुरी |
Case seems to be lexical feature of PROPN. 92% lemmas (169) occur only with one value of Case.
VERB
329 VERB tokens (43% of all VERB tokens) have a non-empty value of Case.
The most frequent other feature values with which VERB and Case co-occurred: Aspect=EMPTY (329; 100%), Voice=EMPTY (329; 100%), VerbForm=EMPTY (322; 98%), Person=3 (293; 89%), Number=Sing (283; 86%), Gender=Masc (235; 71%).
VERB tokens may have the following values of Case:
Acc(96; 29% of non-emptyCase): क, होखे, अद्भुत, छोड़त, लिखे, अंतर्दृष्टि, अतने, अनगिनत, अबीरे, उघरतNom(233; 71% of non-emptyCase): भइल, बा, आइल, कहल, ह, कइल, दिहल, खुजली, चल, पड़लEMPTY(438): हो, बा, चाहीं, करे, होखे, होई, कर, होला, कहले, ना
| Paradigm कर | Nom | Acc |
|---|---|---|
| Gender=Masc | कइल | कइला, करे |
| Gender=Fem | कइल | कइला |
Case seems to be lexical feature of VERB. 94% lemmas (151) occur only with one value of Case.
PRON
250 PRON tokens (75% of all PRON tokens) have a non-empty value of Case.
The most frequent other feature values with which PRON and Case co-occurred: Aspect=EMPTY (250; 100%), VerbForm=EMPTY (250; 100%), Number=Sing (179; 72%), Person=3 (134; 54%), PronType=EMPTY (131; 52%).
PRON tokens may have the following values of Case:
Acc(15; 6% of non-emptyCase): हमनी, केहूँ, पत्नी, फलाना, रउवा, उहाँसे, एहसे, ओ, जवना, हमराAcc,Dat(12; 5% of non-emptyCase): कइसे, अइसे, हइसे, ओकरा, हमरा, हमारAcc,Erg(6; 2% of non-emptyCase): ओतने, माने, आपने, हमAcc,Gen(7; 3% of non-emptyCase): हमरा, इनका, ओकर, हमनीके, हमारNom(210; 84% of non-emptyCase): ऊ, अपना, हम, हमरा, आपन, रउरा, हमार, बिना, हमनी, केहूEMPTY(85): ओकरा, काहे, एकरा, ईहे, ऊहो, उनुकर, एकर, कहीं, कबो, का
| Paradigm हमर | Acc,Gen | Nom | Acc |
|---|---|---|---|
| Gender=Masc|Number=Sing|Person=3 | हमरा | ||
| Gender=Masc|Person=1|Poss=Yes|PronType=Prs | हमरा | ||
| Gender=Fem|Number=Sing|Person=3 | हमरा |
DET
229 DET tokens (65% of all DET tokens) have a non-empty value of Case.
The most frequent other feature values with which DET and Case co-occurred: NumType=EMPTY (229; 100%), Person=3 (207; 90%), Number=Sing (196; 86%), PronType=EMPTY (191; 83%), Gender=Masc (125; 55%).
DET tokens may have the following values of Case:
Acc(26; 11% of non-emptyCase): जवना, एह, सभे, कवनो, एही, ओह, कवनाAcc,Dat(6; 3% of non-emptyCase): एहमें, ओहमें, काहेंAcc,Gen(1; 0% of non-emptyCase): जेकराNom(196; 86% of non-emptyCase): ई, कवनो, एह, अइसन, जवन, अब, एही, ओह, कुछु, केहूँEMPTY(124): एह, कुछ, ओकर, ओह, हर, आजु, ई, कतना, फेरु, ईहो
| Paradigm एह | Nom | Acc |
|---|---|---|
| Gender=Masc | एह | एह |
| PronType=Dem | एह |
ADJ
173 ADJ tokens (69% of all ADJ tokens) have a non-empty value of Case.
The most frequent other feature values with which ADJ and Case co-occurred: Number=Sing (105; 61%), Person=3 (98; 57%), Gender=Masc (93; 54%).
ADJ tokens may have the following values of Case:
Acc(38; 22% of non-emptyCase): सांस्कृतिक, खास, सहज, तरह, निश्चित, रोज, स्थित, अपार, इलेक्ट्रॉनिक, कानूनीAcc,Dat(1; 1% of non-emptyCase): हिंदियोNom(134; 77% of non-emptyCase): पूरा, आखिरी, काव्य, बड़, अतिशय, चुनरी, छोट, तथाकथित, बनतारी, रायEMPTY(76): प, चुपचाप, तथाकथित, आसान, जरूरी, ग्रुप, छाप, ठीक, दर्पण, नीक
| Paradigm पूरा | Nom | Acc |
|---|---|---|
| पूरा | पूरा |
Case seems to be lexical feature of ADJ. 94% lemmas (98) occur only with one value of Case.
AUX
163 AUX tokens (46% of all AUX tokens) have a non-empty value of Case.
The most frequent other feature values with which AUX and Case co-occurred: Aspect=EMPTY (163; 100%), VerbForm=EMPTY (163; 100%), Voice=EMPTY (163; 100%), Polite=EMPTY (161; 99%), Person=3 (159; 98%), Number=Sing (158; 97%).
AUX tokens may have the following values of Case:
Acc(5; 3% of non-emptyCase): जात, कइला, जाव, सकता, होखीNom(158; 97% of non-emptyCase): बा, गइल, रहल, जा, जात, जाव, बानी, जाला, रहुवे, सकेलाEMPTY(192): रहे, जाई, बा, गइल, जा, रहीं, बाड़न, सकेला, रहल, हो
| Paradigm जा | Nom | Acc |
|---|---|---|
| _ | जात | जात |
| Gender=Masc|Number=Sing|Person=3 | जा, जात, गइल, जाला, जाउ | जाव |
| Gender=Masc|Number=Plur|Person=3 | जात | |
| Gender=Fem|Number=Sing|Person=3 | गइल, जाई | |
| Number=Sing|Person=3 | जाव |
PART
104 PART tokens (54% of all PART tokens) have a non-empty value of Case.
The most frequent other feature values with which PART and Case co-occurred: Person=3 (92; 88%), Number=Sing (81; 78%), Gender=Masc (78; 75%).
PART tokens may have the following values of Case:
Acc(8; 8% of non-emptyCase): त, नइखे, ना, जादा, सबसे, सूखाड़Nom(96; 92% of non-emptyCase): त, नइखे, ना, बहुते, बस, गमगमावे, घटना, सँ, अतना, अलावेEMPTY(87): ना, नइखे, त, भर, ढेर, तनिको, नाहीं, बनवले, बिना, भी
| Paradigm त | Nom | Acc |
|---|---|---|
| _ | त | त |
| Gender=Masc|Number=Sing|Person=3 | त | |
| Gender=Masc|Number=Plur|Person=3 | त | |
| Gender=Fem|Number=Sing|Person=3 | त |
NUM
96 NUM tokens (64% of all NUM tokens) have a non-empty value of Case.
The most frequent other feature values with which NUM and Case co-occurred: NumType=EMPTY (96; 100%), Person=3 (91; 95%), Gender=Masc (74; 77%), Number=Sing (72; 75%).
NUM tokens may have the following values of Case:
Acc(6; 6% of non-emptyCase): पहिला, सिलसिला, २०१२Nom(90; 94% of non-emptyCase): एगो, लोग, गो, दू, कुछ, 5, कलिग, छठवां, दोसर, दोसराEMPTY(53): एक, अनकस, बाकि, 12, 120, 2011, 75, आठ, एगो, चार
Case seems to be lexical feature of NUM. 100% lemmas (28) occur only with one value of Case.
CCONJ
30 CCONJ tokens (20% of all CCONJ tokens) have a non-empty value of Case.
The most frequent other feature values with which CCONJ and Case co-occurred: Gender=Masc (30; 100%), Number=Sing (30; 100%), Person=3 (30; 100%).
CCONJ tokens may have the following values of Case:
Nom(30; 100% of non-emptyCase): बाकिर, अउर, भा, राउर, आखिर, खम्भा, आउरEMPTY(121): आ, फगुआ, बाकिर, अउर, आउर, खैर, बलुक, रउँआ, सचहूं
ADV
7 ADV tokens (23% of all ADV tokens) have a non-empty value of Case.
The most frequent other feature values with which ADV and Case co-occurred: Gender=EMPTY (4; 57%), Number=EMPTY (4; 57%).
ADV tokens may have the following values of Case:
Acc(4; 57% of non-emptyCase): ललित, सम्मानित, जल्दीNom(3; 43% of non-emptyCase): तेज, आजुओ, शुरूEMPTY(24): जइसे, हिन्दी, गद्य, सभ्य, आनन्द, आसानी, जरूर, जल्दी, जसहीं, जारी
INTJ
4 INTJ tokens (80% of all INTJ tokens) have a non-empty value of Case.
The most frequent other feature values with which INTJ and Case co-occurred: Gender=Masc (4; 100%), Number=Sing (4; 100%).
INTJ tokens may have the following values of Case:
Acc(4; 100% of non-emptyCase): गहरे, अरे, दोसरेEMPTY(1): अजी
Relations with Agreement in Case
The 10 most frequent relations where parent and child node agree in Case:
NOUN –[nmod]–> NOUN (159; 54%),
VERB –[aux]–> AUX (80; 51%),
PROPN –[compound]–> PROPN (75; 51%),
NOUN –[amod]–> ADJ (62; 67%),
PROPN –[case]–> ADP (44; 52%),
NOUN –[amod]–> NOUN (16; 80%),
NOUN –[conj]–> NOUN (16; 62%),
PROPN –[conj]–> PROPN (15; 56%),
PROPN –[nmod]–> NOUN (13; 52%),
DET –[compound]–> NOUN (10; 71%).