Treebank Statistics: UD_Bhojpuri-BHTB: Features: Number
This feature is universal.
It occurs with 2 different values: Plur, Sing.
4276 tokens (64%) have a non-empty value of Number.
1395 types (83%) occur at least once with a non-empty value of Number.
1341 lemmas (82%) occur at least once with a non-empty value of Number.
The feature is used with 14 part-of-speech tags: NOUN (1629; 24% instances), ADP (579; 9% instances), VERB (546; 8% instances), PROPN (395; 6% instances), AUX (277; 4% instances), PRON (276; 4% instances), DET (222; 3% instances), ADJ (111; 2% instances), PART (98; 1% instances), NUM (93; 1% instances), CCONJ (41; 1% instances), ADV (4; 0% instances), INTJ (4; 0% instances), SCONJ (1; 0% instances).
NOUN
1629 NOUN tokens (88% of all NOUN tokens) have a non-empty value of Number.
The most frequent other feature values with which NOUN and Number co-occurred: Person=3 (1569; 96%), Gender=Masc (1228; 75%), Case=Nom (944; 58%).
NOUN tokens may have the following values of Number:
Plur(115; 7% of non-emptyNumber): लोग, गायक, जानकारी, दिसाईं, पहिले, आदमी, आनंद, आर्थिक, ओने, कबीरSing(1514; 93% of non-emptyNumber): जी, रंग, देश, बिआह, भाषा, आजु, साल, बात, लोगन, साहित्यEMPTY(226): जब, बिआह, तब, अब, पहिले, उहाँ, कथा, गवनई, चीफ, जहाँ
| Paradigm बिआह | Sing | Plur |
|---|---|---|
| Case=Acc|Gender=Masc | बिआह | |
| Case=Acc|Gender=Fem | बिआह | |
| Case=Nom|Gender=Masc | बिआह | बिआह |
| Case=Nom|Gender=Fem | बिआह | |
| Mood=Sub|VerbForm=Fin|Voice=Act | बिआहे |
Number seems to be lexical feature of NOUN. 97% lemmas (758) occur only with one value of Number.
ADP
579 ADP tokens (59% of all ADP tokens) have a non-empty value of Number.
The most frequent other feature values with which ADP and Number co-occurred: Gender=Masc (566; 98%), AdpType=Post (556; 96%), Case=Acc (344; 59%).
ADP tokens may have the following values of Number:
Plur(97; 17% of non-emptyNumber): के, हमनीके, उठाके, लेके, करेके, पढ़िके, लगावेके, लगे, ले, लोगSing(482; 83% of non-emptyNumber): के, का, ले, वाला, खातिर, ओके, जाके, साथे, ओकराके, खड़ेEMPTY(410): में, से, पर, के, खातिर, तबे, अतने, का, अपने, उहाँका
| Paradigm का | Sing | Plur |
|---|---|---|
| Case=Acc|Gender=Masc | के | के |
| Case=Nom | के | |
| Case=Nom|Gender=Masc | का, के | के |
| Case=Nom|Gender=Masc|Person=3|Polite=Form | के | |
| Case=Nom|Gender=Fem | के | |
| Gender=Masc|Person=3 | के |
VERB
546 VERB tokens (71% of all VERB tokens) have a non-empty value of Number.
The most frequent other feature values with which VERB and Number co-occurred: Person=3 (412; 75%), Aspect=EMPTY (368; 67%), Gender=Masc (367; 67%), Voice=EMPTY (340; 62%), VerbForm=EMPTY (332; 61%).
VERB tokens may have the following values of Number:
Plur(106; 19% of non-emptyNumber): चाहीं, होखे, होई, आवे, जाए, जाता, देबे, बा, मए, लागेSing(440; 81% of non-emptyNumber): बा, भइल, आइल, करे, क, कहल, ह, होखे, कइल, कतहींEMPTY(221): हो, कर, ना, लागल, ह, करत, चलि, धीरे, पड़ल, भइला
| Paradigm हो | Sing | Plur |
|---|---|---|
| Aspect=Perf|Gender=Masc|VerbForm=Part|Voice=Act | होखे | |
| Aspect=Perf|Gender=Fem|VerbForm=Part | होखी | होई |
| Aspect=Perf|Gender=Fem|VerbForm=Part|Voice=Act | होखी | |
| Case=Acc|Gender=Masc|Person=3 | होखे | |
| Case=Acc|Gender=Fem | होखी | |
| Gender=Masc|Mood=Ind|Person=3|Tense=Fut|VerbForm=Fin|Voice=Act | होई | |
| Gender=Masc|Person=3|VerbForm=Inf|Voice=Act | होखे | |
| Gender=Masc|Person=3|Voice=Act | हो | हो |
| Gender=Masc|Voice=Act | हो | |
| Person=3|Voice=Act | हो |
Number seems to be lexical feature of VERB. 95% lemmas (215) occur only with one value of Number.
PROPN
395 PROPN tokens (94% of all PROPN tokens) have a non-empty value of Number.
The most frequent other feature values with which PROPN and Number co-occurred: Person=3 (392; 99%), Case=Nom (271; 69%), Gender=Masc (225; 57%).
PROPN tokens may have the following values of Number:
Plur(3; 1% of non-emptyNumber): हिन्दुस्तान, 25Sing(392; 99% of non-emptyNumber): भोजपुरी, सिंह, प्रियंका, राय, जी, डॉ., पाण्डेय, तिवारी, दिल्ली, द्विवेदीEMPTY(26): पाती, डा॰, अंचल, टिप्पणिए, प्रो॰, 2012, 24, इटलिए, चोपड़ा, पूर्वांचल
| Paradigm हिन्दुस्तान | Sing | Plur |
|---|---|---|
| Case=Acc | हिन्दुस्तान | |
| Case=Nom | हिन्दुस्तान |
Number seems to be lexical feature of PROPN. 99% lemmas (181) occur only with one value of Number.
AUX
277 AUX tokens (78% of all AUX tokens) have a non-empty value of Number.
The most frequent other feature values with which AUX and Number co-occurred: Voice=EMPTY (235; 85%), Polite=EMPTY (234; 84%), Person=3 (218; 79%), Aspect=EMPTY (195; 70%), VerbForm=EMPTY (168; 61%), Case=Nom (156; 56%).
AUX tokens may have the following values of Number:
Plur(15; 5% of non-emptyNumber): बाड़न, रहलीं, गइल, चाहीं, जाए, जात, दीहें, देले, बाड़े, मारींSing(262; 95% of non-emptyNumber): बा, रहे, गइल, रहल, जाई, जाव, जा, रहीं, बानी, सकेलाEMPTY(78): जा, गइल, बा, हो, बाड़न, दिहलसि, लागल, आइल, कइले, करी
| Paradigm बा | Sing | Plur |
|---|---|---|
| Aspect=Perf|Gender=Masc|VerbForm=Part | बाड़े | |
| Case=Nom|Gender=Masc|Person=3 | बा, बाड़न, बाड़ | |
| Case=Nom|Gender=Fem|Person=3 | बा, बाड़ी | |
| Case=Nom|Person=3 | बा, बानी | |
| Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | बा | बाड़न |
| Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act | बा, बाटे | |
| Mood=Sub|Person=3|VerbForm=Fin|Voice=Act | बाड़ें | |
| Person=3|Polite=Form | बाड़ |
PRON
276 PRON tokens (82% of all PRON tokens) have a non-empty value of Number.
The most frequent other feature values with which PRON and Number co-occurred: Aspect=EMPTY (230; 83%), VerbForm=EMPTY (228; 83%), Case=Nom (184; 67%), PronType=EMPTY (178; 64%), Person=3 (142; 51%).
PRON tokens may have the following values of Number:
Plur(77; 28% of non-emptyNumber): ओकरा, हम, आम, एकरा, एकदम, कलम, तमाम, दरपन, बनाम, हमनीSing(199; 72% of non-emptyNumber): अपना, हमरा, ऊ, आपन, हमनी, रउरा, हमार, बिना, उनुकर, उनुकाEMPTY(59): ऊ, केहूँ, काहे, कहीं, एकर, कहाँ, ईहे, उहाँ, एकहूँ, कबो
| Paradigm हम | Sing | Plur |
|---|---|---|
| Case=Acc,Erg | हम | |
| Case=Nom | हम | हम, हमनी |
DET
222 DET tokens (63% of all DET tokens) have a non-empty value of Number.
The most frequent other feature values with which DET and Number co-occurred: NumType=EMPTY (222; 100%), Person=3 (207; 93%), PronType=EMPTY (191; 86%), Case=Nom (183; 82%), Gender=Masc (130; 59%).
DET tokens may have the following values of Number:
Plur(21; 9% of non-emptyNumber): कुछु, आजु, सब, सबले, असहिष्णु, ओकरा, ओहु, जासु, जेकरा, पढ़सुSing(201; 91% of non-emptyNumber): ई, कवनो, एह, अइसन, जवन, जवना, एही, ओह, सभे, कतनाEMPTY(131): एह, कुछ, ओकर, ओह, हर, अब, आजु, ई, फेरु, ईहो
| Paradigm आजु | Sing | Plur |
|---|---|---|
| आजुओ | आजु |
Number seems to be lexical feature of DET. 94% lemmas (49) occur only with one value of Number.
ADJ
111 ADJ tokens (45% of all ADJ tokens) have a non-empty value of Number.
The most frequent other feature values with which ADJ and Number co-occurred: Person=3 (98; 88%), Case=Nom (96; 86%), Gender=Masc (95; 86%).
ADJ tokens may have the following values of Number:
Plur(3; 3% of non-emptyNumber): छोट, चोटSing(108; 97% of non-emptyNumber): पूरा, बड़, छोट, तरह, नवका, बड़हन, भोजपुरिया, अढ़ाई, अश्लील, आधाEMPTY(138): सांस्कृतिक, तथाकथित, प, खास, चुपचाप, जरूरी, आखिरी, आसान, काव्य, सहज
| Paradigm छोट | Sing | Plur |
|---|---|---|
| Case=Acc | छोट | |
| Case=Nom | छोट | छोट |
Number seems to be lexical feature of ADJ. 99% lemmas (69) occur only with one value of Number.
PART
98 PART tokens (51% of all PART tokens) have a non-empty value of Number.
The most frequent other feature values with which PART and Number co-occurred: Person=3 (93; 95%), Case=Nom (89; 91%), Gender=Masc (78; 80%).
PART tokens may have the following values of Number:
Plur(17; 17% of non-emptyNumber): नइखे, बहुते, त, तिकवते, नाहींSing(81; 83% of non-emptyNumber): त, ना, बस, गमगमावे, घटना, नइखे, अतना, अलावे, केहू, खालीEMPTY(93): ना, त, नइखे, भर, ढेर, तनिको, बनवले, बिना, भी, सँ
| Paradigm त | Sing | Plur |
|---|---|---|
| Gender=Masc | त | त |
| Gender=Fem | त |
NUM
93 NUM tokens (62% of all NUM tokens) have a non-empty value of Number.
The most frequent other feature values with which NUM and Number co-occurred: NumType=EMPTY (93; 100%), Person=3 (88; 95%), Case=Nom (87; 94%), Gender=Masc (75; 81%).
NUM tokens may have the following values of Number:
Plur(21; 23% of non-emptyNumber): लोग, कलिग, उमंग, सन, सभSing(72; 77% of non-emptyNumber): एगो, गो, दू, 5, छठवां, दोसर, दोसरा, दोसरो, सिलसिला, २०१२EMPTY(56): एक, कुछ, अनकस, बाकि, 12, 120, 2011, 75, आठ, एगो
| Paradigm सभ | Sing | Plur |
|---|---|---|
| Gender=Masc | सभ | |
| PronType=Prs | सभ |
Number seems to be lexical feature of NUM. 96% lemmas (27) occur only with one value of Number.
CCONJ
41 CCONJ tokens (27% of all CCONJ tokens) have a non-empty value of Number.
The most frequent other feature values with which CCONJ and Number co-occurred: Case=Nom (30; 73%), Gender=Masc (30; 73%), Person=3 (30; 73%).
CCONJ tokens may have the following values of Number:
Plur(4; 10% of non-emptyNumber): आ, फगुआ, रउँआSing(37; 90% of non-emptyNumber): बाकिर, अउर, फगुआ, भा, राउर, आखिर, खम्भा, आ, आउरEMPTY(110): आ, बाकिर, अउर, आउर, खैर, बलुक, सचहूं
| Paradigm आ | Sing | Plur |
|---|---|---|
| Gender=Fem | आ | |
| आ | आ |
ADV
4 ADV tokens (13% of all ADV tokens) have a non-empty value of Number.
The most frequent other feature values with which ADV and Number co-occurred: Gender=Masc (3; 75%).
ADV tokens may have the following values of Number:
Plur(1; 25% of non-emptyNumber): नाहिंएSing(3; 75% of non-emptyNumber): आजुओ, जल्दी, शुरूEMPTY(27): जइसे, हिन्दी, गद्य, ललित, सभ्य, आनन्द, आसानी, जरूर, जल्दी, जसहीं
INTJ
4 INTJ tokens (80% of all INTJ tokens) have a non-empty value of Number.
The most frequent other feature values with which INTJ and Number co-occurred: Case=Acc (4; 100%), Gender=Masc (4; 100%).
INTJ tokens may have the following values of Number:
Sing(4; 100% of non-emptyNumber): गहरे, अरे, दोसरेEMPTY(1): अजी
SCONJ
1 SCONJ tokens (1% of all SCONJ tokens) have a non-empty value of Number.
SCONJ tokens may have the following values of Number:
Plur(1; 100% of non-emptyNumber): तकलेEMPTY(117): कि, त, काहेंकि, निकलि, बाकि, लपकि, आँखि, कोच्चि, प्रवृत्ति
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number:
NOUN –[nmod]–> NOUN (217; 74%),
NOUN –[compound]–> NOUN (188; 77%),
VERB –[compound]–> NOUN (182; 54%),
PROPN –[compound]–> PROPN (137; 93%),
VERB –[aux]–> AUX (117; 51%),
VERB –[nmod]–> NOUN (96; 54%),
NOUN –[compound]–> DET (74; 76%),
PROPN –[case]–> ADP (47; 55%),
NOUN –[nmod]–> PROPN (46; 73%),
NOUN –[compound]–> PROPN (41; 89%).