Treebank Statistics: UD_Bhojpuri-BHTB: Features: Case
This feature is universal.
It occurs with 5 different values: Acc
, Dat
, Erg
, Gen
, Nom
.
Some words have combined values of the feature; 3 combinations have been observed: Acc|Dat
, Acc|Erg
, Acc|Gen
.
4043 tokens (61%) have a non-empty value of Case
.
1361 types (81%) occur at least once with a non-empty value of Case
.
1335 lemmas (82%) occur at least once with a non-empty value of Case
.
The feature is used with 13 part-of-speech tags: NOUN (1688; 25% instances), ADP (570; 9% instances), PROPN (399; 6% instances), VERB (329; 5% instances), PRON (250; 4% instances), DET (229; 3% instances), ADJ (173; 3% instances), AUX (163; 2% instances), PART (105; 2% instances), NUM (96; 1% instances), CCONJ (30; 0% instances), ADV (7; 0% instances), INTJ (4; 0% instances).
NOUN
1688 NOUN tokens (91% of all NOUN
tokens) have a non-empty value of Case
.
The most frequent other feature values with which NOUN
and Case
co-occurred: Person=3 (1561; 92%), Number=Sing (1489; 88%), Gender=Masc (1214; 72%).
NOUN
tokens may have the following values of Case
:
Acc
(685; 41% of non-emptyCase
): आजु, लोगन, जी, बात, साहित्य, सभ, घंटा, परिवार, बिआह, कार्यक्रमNom
(1003; 59% of non-emptyCase
): लोग, जब, रंग, गवनई, बेर, शुरू, सन, काम, निबंध, काल्हुEMPTY
(166): पहिले, बिआह, कथा, डा., दिसाईं, पहिलहीं, अलगे, आंचलिक, आरे, काहेंके
Paradigm बिआह | Nom | Acc |
---|---|---|
Gender=Masc|Number=Sing | बिआह | बिआह |
Gender=Masc|Number=Plur | बिआह | |
Gender=Fem|Number=Sing | बिआह | बिआह |
ADP
570 ADP tokens (58% of all ADP
tokens) have a non-empty value of Case
.
The most frequent other feature values with which ADP
and Case
co-occurred: Gender=Masc (563; 99%), AdpType=Post (553; 97%), Number=Sing (473; 83%).
ADP
tokens may have the following values of Case
:
Acc
(347; 61% of non-emptyCase
): के, खड़े, जाके, पटे, खातिर, डाले, पढ़े, पहिलहीं, मोड, वालेNom
(223; 39% of non-emptyCase
): के, का, खातिर, वाला, ओके, लेके, उठाके, साथे, हमनीके, ओकराकेEMPTY
(419): में, से, पर, के, ले, तबे, अतने, का, अपने, उहाँका
Paradigm का | Nom | Acc |
---|---|---|
Gender=Masc|Number=Sing | का, के | के |
Gender=Masc|Number=Sing|Person=3|Polite=Form | के | |
Gender=Masc|Number=Plur | के | के |
Gender=Fem|Number=Sing | के | |
Number=Plur | के |
PROPN
399 PROPN tokens (95% of all PROPN
tokens) have a non-empty value of Case
.
The most frequent other feature values with which PROPN
and Case
co-occurred: Person=3 (392; 98%), Number=Sing (390; 98%), Gender=Masc (225; 56%).
PROPN
tokens may have the following values of Case
:
Acc
(114; 29% of non-emptyCase
): भोजपुरी, जी, द्विवेदी, खाली, चोपड़ा, दिल्ली, शाही, सिंह, पाती, प्रियंकाAcc,Dat
(10; 3% of non-emptyCase
): लेखको, ऑडियो, कादो, केनियो, परिचर्चो, प्रियंको, भाषणोNom
(275; 69% of non-emptyCase
): भोजपुरी, डॉ., पाती, प्रियंका, राय, सिंह, तिवारी, प्रसाद, आंजनेय, उदयEMPTY
(22): पाती, डा॰, अंचल, टिप्पणिए, प्रो॰, 2012, 24, इटलिए, चोपड़ा, पूर्वांचल
Paradigm भोजपुरी | Nom | Acc |
---|---|---|
Gender=Masc | भोजपुरी | |
Gender=Fem | भोजपुरी | भोजपुरी |
Case
seems to be lexical feature of PROPN
. 92% lemmas (169) occur only with one value of Case
.
VERB
329 VERB tokens (43% of all VERB
tokens) have a non-empty value of Case
.
The most frequent other feature values with which VERB
and Case
co-occurred: Aspect=EMPTY (329; 100%), Voice=EMPTY (329; 100%), VerbForm=EMPTY (322; 98%), Person=3 (293; 89%), Number=Sing (283; 86%), Gender=Masc (235; 71%).
VERB
tokens may have the following values of Case
:
Acc
(96; 29% of non-emptyCase
): क, होखे, अद्भुत, छोड़त, लिखे, अंतर्दृष्टि, अतने, अनगिनत, अबीरे, उघरतNom
(233; 71% of non-emptyCase
): भइल, बा, आइल, कहल, ह, कइल, दिहल, खुजली, चल, पड़लEMPTY
(438): हो, बा, चाहीं, करे, होखे, होई, कर, होला, कहले, ना
Paradigm कर | Nom | Acc |
---|---|---|
Gender=Masc | कइल | कइला, करे |
Gender=Fem | कइल | कइला |
Case
seems to be lexical feature of VERB
. 94% lemmas (151) occur only with one value of Case
.
PRON
250 PRON tokens (75% of all PRON
tokens) have a non-empty value of Case
.
The most frequent other feature values with which PRON
and Case
co-occurred: Aspect=EMPTY (250; 100%), VerbForm=EMPTY (250; 100%), Number=Sing (179; 72%), Person=3 (134; 54%), PronType=EMPTY (131; 52%).
PRON
tokens may have the following values of Case
:
Acc
(15; 6% of non-emptyCase
): हमनी, केहूँ, पत्नी, फलाना, रउवा, उहाँसे, एहसे, ओ, जवना, हमराAcc,Dat
(12; 5% of non-emptyCase
): कइसे, अइसे, हइसे, ओकरा, हमरा, हमारAcc,Erg
(6; 2% of non-emptyCase
): ओतने, माने, आपने, हमAcc,Gen
(7; 3% of non-emptyCase
): हमरा, इनका, ओकर, हमनीके, हमारNom
(210; 84% of non-emptyCase
): ऊ, अपना, हम, हमरा, आपन, रउरा, हमार, बिना, हमनी, केहूEMPTY
(85): ओकरा, काहे, एकरा, ईहे, ऊहो, उनुकर, एकर, कहीं, कबो, का
Paradigm हमर | Acc,Gen | Nom | Acc |
---|---|---|---|
Gender=Masc|Number=Sing|Person=3 | हमरा | ||
Gender=Masc|Person=1|Poss=Yes|PronType=Prs | हमरा | ||
Gender=Fem|Number=Sing|Person=3 | हमरा |
DET
229 DET tokens (65% of all DET
tokens) have a non-empty value of Case
.
The most frequent other feature values with which DET
and Case
co-occurred: NumType=EMPTY (229; 100%), Person=3 (207; 90%), Number=Sing (196; 86%), PronType=EMPTY (191; 83%), Gender=Masc (125; 55%).
DET
tokens may have the following values of Case
:
Acc
(26; 11% of non-emptyCase
): जवना, एह, सभे, कवनो, एही, ओह, कवनाAcc,Dat
(6; 3% of non-emptyCase
): एहमें, ओहमें, काहेंAcc,Gen
(1; 0% of non-emptyCase
): जेकराNom
(196; 86% of non-emptyCase
): ई, कवनो, एह, अइसन, जवन, अब, एही, ओह, कुछु, केहूँEMPTY
(124): एह, कुछ, ओकर, ओह, हर, आजु, ई, कतना, फेरु, ईहो
Paradigm एह | Nom | Acc |
---|---|---|
Gender=Masc | एह | एह |
PronType=Dem | एह |
ADJ
173 ADJ tokens (69% of all ADJ
tokens) have a non-empty value of Case
.
The most frequent other feature values with which ADJ
and Case
co-occurred: Number=Sing (105; 61%), Person=3 (98; 57%), Gender=Masc (93; 54%).
ADJ
tokens may have the following values of Case
:
Acc
(38; 22% of non-emptyCase
): सांस्कृतिक, खास, सहज, तरह, निश्चित, रोज, स्थित, अपार, इलेक्ट्रॉनिक, कानूनीAcc,Dat
(1; 1% of non-emptyCase
): हिंदियोNom
(134; 77% of non-emptyCase
): पूरा, आखिरी, काव्य, बड़, अतिशय, चुनरी, छोट, तथाकथित, बनतारी, रायEMPTY
(76): प, चुपचाप, तथाकथित, आसान, जरूरी, ग्रुप, छाप, ठीक, दर्पण, नीक
Paradigm पूरा | Nom | Acc |
---|---|---|
पूरा | पूरा |
Case
seems to be lexical feature of ADJ
. 94% lemmas (98) occur only with one value of Case
.
AUX
163 AUX tokens (46% of all AUX
tokens) have a non-empty value of Case
.
The most frequent other feature values with which AUX
and Case
co-occurred: Aspect=EMPTY (163; 100%), VerbForm=EMPTY (163; 100%), Voice=EMPTY (163; 100%), Polite=EMPTY (161; 99%), Person=3 (159; 98%), Number=Sing (158; 97%).
AUX
tokens may have the following values of Case
:
Acc
(5; 3% of non-emptyCase
): जात, कइला, जाव, सकता, होखीNom
(158; 97% of non-emptyCase
): बा, गइल, रहल, जा, जात, जाव, बानी, जाला, रहुवे, सकेलाEMPTY
(192): रहे, जाई, बा, गइल, जा, रहीं, बाड़न, सकेला, रहल, हो
Paradigm जा | Nom | Acc |
---|---|---|
_ | जात | जात |
Gender=Masc|Number=Sing|Person=3 | जा, जात, गइल, जाला, जाउ | जाव |
Gender=Masc|Number=Plur|Person=3 | जात | |
Gender=Fem|Number=Sing|Person=3 | गइल, जाई | |
Number=Sing|Person=3 | जाव |
PART
105 PART tokens (55% of all PART
tokens) have a non-empty value of Case
.
The most frequent other feature values with which PART
and Case
co-occurred: Person=3 (93; 89%), Number=Sing (82; 78%), Gender=Masc (79; 75%).
PART
tokens may have the following values of Case
:
Acc
(9; 9% of non-emptyCase
): त, नइखे, ना, जादा, विस्तार, सबसे, सूखाड़Nom
(96; 91% of non-emptyCase
): त, नइखे, ना, बहुते, बस, गमगमावे, घटना, सँ, अतना, अलावेEMPTY
(87): ना, नइखे, त, भर, ढेर, तनिको, नाहीं, बनवले, बिना, भी
Paradigm त | Nom | Acc |
---|---|---|
_ | त | त |
Gender=Masc|Number=Sing|Person=3 | त | |
Gender=Masc|Number=Plur|Person=3 | त | |
Gender=Fem|Number=Sing|Person=3 | त |
NUM
96 NUM tokens (64% of all NUM
tokens) have a non-empty value of Case
.
The most frequent other feature values with which NUM
and Case
co-occurred: NumType=EMPTY (96; 100%), Person=3 (91; 95%), Gender=Masc (74; 77%), Number=Sing (72; 75%).
NUM
tokens may have the following values of Case
:
Acc
(6; 6% of non-emptyCase
): पहिला, सिलसिला, २०१२Nom
(90; 94% of non-emptyCase
): एगो, लोग, गो, दू, कुछ, 5, कलिग, छठवां, दोसर, दोसराEMPTY
(53): एक, अनकस, बाकि, 12, 120, 2011, 75, आठ, एगो, चार
Case
seems to be lexical feature of NUM
. 100% lemmas (28) occur only with one value of Case
.
CCONJ
30 CCONJ tokens (20% of all CCONJ
tokens) have a non-empty value of Case
.
The most frequent other feature values with which CCONJ
and Case
co-occurred: Gender=Masc (30; 100%), Number=Sing (30; 100%), Person=3 (30; 100%).
CCONJ
tokens may have the following values of Case
:
Nom
(30; 100% of non-emptyCase
): बाकिर, अउर, भा, राउर, आखिर, खम्भा, आउरEMPTY
(121): आ, फगुआ, बाकिर, अउर, आउर, खैर, बलुक, रउँआ, सचहूं
ADV
7 ADV tokens (23% of all ADV
tokens) have a non-empty value of Case
.
The most frequent other feature values with which ADV
and Case
co-occurred: Gender=EMPTY (4; 57%), Number=EMPTY (4; 57%).
ADV
tokens may have the following values of Case
:
Acc
(4; 57% of non-emptyCase
): ललित, सम्मानित, जल्दीNom
(3; 43% of non-emptyCase
): तेज, आजुओ, शुरूEMPTY
(24): जइसे, हिन्दी, गद्य, सभ्य, आनन्द, आसानी, जरूर, जल्दी, जसहीं, जारी
INTJ
4 INTJ tokens (80% of all INTJ
tokens) have a non-empty value of Case
.
The most frequent other feature values with which INTJ
and Case
co-occurred: Gender=Masc (4; 100%), Number=Sing (4; 100%).
INTJ
tokens may have the following values of Case
:
Acc
(4; 100% of non-emptyCase
): गहरे, अरे, दोसरेEMPTY
(1): अजी
Relations with Agreement in Case
The 10 most frequent relations where parent and child node agree in Case
:
NOUN –[nmod]–> NOUN (158; 54%),
VERB –[aux]–> AUX (80; 51%),
PROPN –[compound]–> PROPN (75; 51%),
NOUN –[amod]–> ADJ (62; 67%),
PROPN –[case]–> ADP (44; 52%),
NOUN –[conj]–> NOUN (16; 62%),
NOUN –[amod]–> NOUN (15; 79%),
PROPN –[conj]–> PROPN (15; 56%),
PROPN –[nmod]–> NOUN (13; 52%),
DET –[compound]–> NOUN (10; 71%).