Treebank Statistics: UD_Bulgarian-BTB: Features: Number
This feature is universal but the values Count are language-specific.
It occurs with 4 different values: Count, Plur, Ptan, Sing.
87764 tokens (56%) have a non-empty value of Number.
27713 types (105%) occur at least once with a non-empty value of Number.
14050 lemmas (94%) occur at least once with a non-empty value of Number.
The feature is used with 10 part-of-speech tags: NOUN (33927; 22% instances), VERB (16828; 11% instances), ADJ (13504; 9% instances), PROPN (8363; 5% instances), PRON (5369; 3% instances), AUX (4739; 3% instances), DET (2432; 2% instances), NUM (2102; 1% instances), ADV (499; 0% instances), ADP (1; 0% instances).
NOUN
33927 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Number.
The most frequent other feature values with which NOUN and Number co-occurred: Definite=Ind (20766; 61%).
NOUN tokens may have the following values of Number:
Count(888; 3% of non-emptyNumber): %, лв., млн., $, месеца, дни, лева, млрд., пъти, долараPlur(8792; 26% of non-emptyNumber): г., години, пари, страни, проблеми, представители, сили, промени, фирми, паритеPtan(325; 1% of non-emptyNumber): хората, хора, души, преговори, преговорите, финансите, боеприпаси, книжа, книжата, белезнициSing(23922; 71% of non-emptyNumber): г., време, година, част, президентът, страната, събрание, път, страна, краяEMPTY(225): глава, собственост, партия, президент, интерес, въпрос, съюз, училище, въстание, изказване
| Paradigm лев | Sing | Plur | Count |
|---|---|---|---|
| Definite=Def | лева, Левът | ||
| Definite=Ind | лев | левове | |
| лв., лева |
VERB
16828 VERB tokens (100% of all VERB tokens) have a non-empty value of Number.
The most frequent other feature values with which VERB and Number co-occurred: Voice=Act (15495; 92%), Gender=EMPTY (15006; 89%), Definite=EMPTY (14164; 84%), VerbForm=Fin (14164; 84%), Mood=Ind (13916; 83%), Person=3 (11480; 68%), Tense=Pres (9528; 57%), Aspect=Imp (8679; 52%).
VERB tokens may have the following values of Number:
Plur(5102; 30% of non-emptyNumber): могат, имат, съобщиха, можем, имаме, работят, искат, правят, вземат, искамеSing(11726; 70% of non-emptyNumber): има, няма, може, трябва, каза, съобщи, заяви, стана, обяви, направи
| Paradigm мога | Sing | Plur |
|---|---|---|
| Definite=Ind|Gender=Masc|Tense=Imp|VerbForm=Part | можел | |
| Definite=Ind|Gender=Masc|Tense=Past|VerbForm=Part | могъл | |
| Definite=Ind|Gender=Fem|Tense=Imp|VerbForm=Part | можела | |
| Definite=Ind|Gender=Fem|Tense=Past|VerbForm=Part | могла | |
| Definite=Ind|Gender=Neut|Tense=Imp|VerbForm=Part | можело | |
| Definite=Ind|Gender=Neut|Tense=Past|VerbForm=Part | могло | |
| Definite=Ind|Tense=Imp|VerbForm=Part | можели | |
| Definite=Ind|Tense=Past|VerbForm=Part | могли | |
| Mood=Ind|Person=1|Tense=Imp|VerbForm=Fin | можех | Можехме |
| Mood=Ind|Person=1|Tense=Past|VerbForm=Fin | можах | |
| Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | мога | можем |
| Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin | можеш | можете |
| Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | можеше | можеха |
| Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | можа | можаха |
| Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | може | могат |
ADJ
13504 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Number.
The most frequent other feature values with which ADJ and Number co-occurred: Degree=Pos (12793; 95%), Aspect=EMPTY (12018; 89%), VerbForm=EMPTY (12018; 89%), Voice=EMPTY (12018; 89%), Definite=Ind (7442; 55%).
ADJ tokens may have the following values of Number:
Plur(3947; 29% of non-emptyNumber): други, другите, последните, нови, новите, първите, различни, българските, големи, въоръженитеSing(9557; 71% of non-emptyNumber): народното, българската, нова, европейската, 2001, друг, цялата, 2000, голяма, новияEMPTY(87): т.нар., US, жп, държавен, народна, политически, важен, военноморско, електронна, избирателна
| Paradigm нов | Sing | Plur |
|---|---|---|
| Case=Voc|Degree=Pos|Gender=Masc | Нови | |
| Definite=Def|Degree=Pos | новите | |
| Definite=Def|Degree=Pos|Gender=Masc | новия, новият | |
| Definite=Def|Degree=Pos|Gender=Fem | новата | |
| Definite=Def|Degree=Pos|Gender=Neut | новото | |
| Definite=Def|Degree=Sup|Gender=Masc | най-новият | |
| Definite=Def|Degree=Sup|Gender=Fem | най-новата | |
| Definite=Def|Degree=Sup|Gender=Neut | Най-новото | |
| Definite=Ind|Degree=Pos | нови | |
| Definite=Ind|Degree=Pos|Gender=Masc | нов | |
| Definite=Ind|Degree=Pos|Gender=Fem | нова | |
| Definite=Ind|Degree=Pos|Gender=Neut | ново | |
| Definite=Ind|Degree=Sup | най-нови |
PROPN
8363 PROPN tokens (99% of all PROPN tokens) have a non-empty value of Number.
The most frequent other feature values with which PROPN and Number co-occurred: Definite=Ind (8078; 97%), Gender=Masc (5216; 62%).
PROPN tokens may have the following values of Number:
Plur(142; 2% of non-emptyNumber): САЩ, Балканите, БДЖ, ОДС, DM, Балкани, Гласове, Полимери, РМД-та, АлпиPtan(8; 0% of non-emptyNumber): Кремиковци, ОАЕ, Брадвари, ДрагалевциSing(8213; 98% of non-emptyNumber): България, София, Иван, ЕС, Европа, СДС, Петър, Стоянов, Костов, ГеоргиEMPTY(72): де, Р-300, ван, -, 2000, ал, ди, дела, дьо, 173
| Paradigm сдс | Sing | Plur |
|---|---|---|
| Definite=Def|Gender=Masc | СДС | |
| Definite=Def|Gender=Neut | СДС-та | |
| Definite=Ind|Gender=Masc | СДС |
Number seems to be lexical feature of PROPN. 100% lemmas (2907) occur only with one value of Number.
PRON
5369 PRON tokens (53% of all PRON tokens) have a non-empty value of Number.
The most frequent other feature values with which PRON and Number co-occurred: Poss=EMPTY (5369; 100%), Reflex=EMPTY (5369; 100%), PronType=Prs (3398; 63%), Case=Nom (3312; 62%).
PRON tokens may have the following values of Number:
Plur(1422; 26% of non-emptyNumber): които, те, ги, тях, нас, ни, ние, им, всички, виSing(3947; 74% of non-emptyNumber): това, той, го, който, тя, която, му, което, него, азEMPTY(4726): се, си, му, ни, й, им, ми, себе, ви, ти
| Paradigm аз | Sing | Plur |
|---|---|---|
| Case=Acc|Gender=Masc|Person=3 | го, него | |
| Case=Acc|Gender=Fem|Person=3 | я, нея | |
| Case=Acc|Gender=Neut|Person=3 | го, него | |
| Case=Acc|Person=1 | ме, мен, мене | нас, ни |
| Case=Acc|Person=2 | те, тебе, ви, вас, теб | вас, ви |
| Case=Acc|Person=3 | ги, тях | |
| Case=Dat|Gender=Masc|Person=3 | му, нему | |
| Case=Dat|Gender=Fem|Person=3 | й | |
| Case=Dat|Gender=Neut|Person=3 | му | |
| Case=Dat|Person=1 | ми, мен, мене | ни |
| Case=Dat|Person=2 | ти, ви | ви |
| Case=Dat|Person=3 | им, тям | |
| Case=Nom|Gender=Masc|Person=3 | той | |
| Case=Nom|Gender=Fem|Person=3 | тя | |
| Case=Nom|Gender=Neut|Person=3 | то | |
| Case=Nom|Person=1 | аз | ние, ний |
| Case=Nom|Person=2 | ти, вие | вие |
| Case=Nom|Person=3 | те | |
| Gender=Fem|Person=3 | й | |
| Person=1 | ми | ни |
| Person=2 | ти, ви | ви |
| Person=3 | им |
AUX
4739 AUX tokens (52% of all AUX tokens) have a non-empty value of Number.
The most frequent other feature values with which AUX and Number co-occurred: Mood=Ind (4600; 97%), Voice=Act (4600; 97%), VerbForm=Fin (4493; 95%), Aspect=Imp (4353; 92%), Person=3 (4059; 86%), Tense=Pres (3679; 78%).
AUX tokens may have the following values of Number:
Plur(1309; 28% of non-emptyNumber): са, бяха, бъдат, сме, били, сте, бъдем, биха, бихте, бяхтеSing(3430; 72% of non-emptyNumber): е, бе, бъде, беше, съм, би, бил, си, била, бихEMPTY(4395): да, ще, е, са, бъдат, беше, би, бъде, съм
| Paradigm съм | Sing | Plur |
|---|---|---|
| Definite=Ind|Gender=Masc|Mood=Ind|VerbForm=Part|Voice=Act | бил | |
| Definite=Ind|Gender=Fem|Mood=Ind|VerbForm=Part|Voice=Act | била | |
| Definite=Ind|Gender=Neut|Mood=Ind|VerbForm=Part|Voice=Act | било | |
| Definite=Ind|Mood=Ind|VerbForm=Part|Voice=Act | били | |
| Mood=Cnd|Person=1|VerbForm=Fin | бих | бихме |
| Mood=Cnd|Person=2|VerbForm=Fin | Би | бихте |
| Mood=Cnd|Person=3|Tense=Past|VerbForm=Fin | би | |
| Mood=Cnd|Person=3|VerbForm=Fin | биха | |
| Mood=Ind|Person=1|Tense=Past|VerbForm=Fin|Voice=Act | бях | бяхме |
| Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin|Voice=Act | съм | сме |
| Mood=Ind|Person=2|Tense=Past|VerbForm=Fin|Voice=Act | беше | |
| Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin|Voice=Act | си | сте |
| Mood=Ind|Person=2|VerbForm=Fin|Voice=Act | бяхте | |
| Mood=Ind|Person=3|Tense=Past|VerbForm=Fin|Voice=Act | бе, беше | бяха |
| Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act | е | са |
DET
2432 DET tokens (100% of all DET tokens) have a non-empty value of Number.
The most frequent other feature values with which DET and Number co-occurred: Person=EMPTY (2043; 84%), Poss=EMPTY (1920; 79%), Definite=EMPTY (1640; 67%), Case=EMPTY (1534; 63%).
DET tokens may have the following values of Number:
Plur(712; 29% of non-emptyNumber): тези, всички, нашите, някои, какви, своите, такива, техните, наши, тияSing(1720; 71% of non-emptyNumber): тази, този, това, един, какво, една, всеки, всяка, едно, своя
| Paradigm този | Sing | Plur |
|---|---|---|
| Case=Nom|Gender=Fem | тази, тая, онази, тeзи | |
| Case=Nom|Gender=Neut | това, онова, туй | |
| Gender=Masc | този, тоя, оня, онзи | |
| тези, тия, онези, ония |
NUM
2102 NUM tokens (100% of all NUM tokens) have a non-empty value of Number.
The most frequent other feature values with which NUM and Number co-occurred: NumType=Card (2102; 100%), Definite=Ind (1957; 93%), Gender=EMPTY (1587; 75%).
NUM tokens may have the following values of Number:
Plur(1863; 89% of non-emptyNumber): две, два, 2, 3, три, 10, двамата, 20, двете, 000Sing(239; 11% of non-emptyNumber): един, една, 1, едно, половин, 0, Единият, едното, 0,1, 0.00EMPTY(3): 02, 08, 2000
Number seems to be lexical feature of NUM. 100% lemmas (414) occur only with one value of Number.
ADV
499 ADV tokens (8% of all ADV tokens) have a non-empty value of Number.
The most frequent other feature values with which ADV and Number co-occurred: PronType=EMPTY (499; 100%), Degree=Pos (463; 93%).
ADV tokens may have the following values of Number:
Plur(499; 100% of non-emptyNumber): много, повече, малко, повечето, по-малко, най-много, най-малко, малкото, Многая, Най-малкотоEMPTY(6059): още, вчера, само, вече, когато, защото, обаче, сега, как, така
ADP
1 ADP tokens (0% of all ADP tokens) have a non-empty value of Number.
ADP tokens may have the following values of Number:
Sing(1; 100% of non-emptyNumber): сравнениеEMPTY(22095): на, в, за, от, с, по, до, след, като, през
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number:
NOUN –[amod]–> ADJ (11160; 97%),
NOUN –[nmod]–> NOUN (5930; 61%),
VERB –[nsubj]–> NOUN (4234; 93%),
VERB –[obj]–> NOUN (2782; 58%),
NOUN –[nmod]–> PROPN (2663; 82%),
VERB –[obl]–> NOUN (2631; 58%),
NOUN –[det]–> DET (1946; 98%),
VERB –[nsubj]–> PRON (1889; 98%),
NOUN –[conj]–> NOUN (1720; 78%),
PROPN –[flat]–> PROPN (1539; 96%).