home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Bulgarian-BTB: Features: Number

This feature is universal but the values Count are language-specific. It occurs with 4 different values: Count, Plur, Ptan, Sing.

87763 tokens (56%) have a non-empty value of Number. 27713 types (105%) occur at least once with a non-empty value of Number. 14048 lemmas (94%) occur at least once with a non-empty value of Number. The feature is used with 10 part-of-speech tags: NOUN (33927; 22% instances), VERB (17185; 11% instances), ADJ (13504; 9% instances), PROPN (8363; 5% instances), PRON (5368; 3% instances), AUX (4382; 3% instances), DET (2433; 2% instances), NUM (2101; 1% instances), ADV (499; 0% instances), ADP (1; 0% instances).

NOUN

33927 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Number.

The most frequent other feature values with which NOUN and Number co-occurred: Definite=Ind (20766; 61%).

NOUN tokens may have the following values of Number:

Paradigm левSingPlurCount
Definite=Defлева, Левът
Definite=Indлевлевове
лв., лева

VERB

17185 VERB tokens (100% of all VERB tokens) have a non-empty value of Number.

The most frequent other feature values with which VERB and Number co-occurred: Voice=Act (15819; 92%), Gender=EMPTY (15349; 89%), Definite=EMPTY (14505; 84%), VerbForm=Fin (14505; 84%), Mood=Ind (14240; 83%), Person=3 (11803; 69%), Tense=Pres (9800; 57%), Aspect=Imp (9003; 52%).

VERB tokens may have the following values of Number:

Paradigm могаSingPlur
Definite=Ind|Gender=Masc|Tense=Imp|VerbForm=Partможел
Definite=Ind|Gender=Masc|Tense=Past|VerbForm=Partмогъл
Definite=Ind|Gender=Fem|Tense=Imp|VerbForm=Partможела
Definite=Ind|Gender=Fem|Tense=Past|VerbForm=Partмогла
Definite=Ind|Gender=Neut|Tense=Imp|VerbForm=Partможело
Definite=Ind|Gender=Neut|Tense=Past|VerbForm=Partмогло
Definite=Ind|Tense=Imp|VerbForm=Partможели
Definite=Ind|Tense=Past|VerbForm=Partмогли
Mood=Ind|Person=1|Tense=Imp|VerbForm=FinможехМожехме
Mood=Ind|Person=1|Tense=Past|VerbForm=Finможах
Mood=Ind|Person=1|Tense=Pres|VerbForm=Finмогаможем
Mood=Ind|Person=2|Tense=Pres|VerbForm=Finможешможете
Mood=Ind|Person=3|Tense=Imp|VerbForm=Finможешеможеха
Mood=Ind|Person=3|Tense=Past|VerbForm=Finможаможаха
Mood=Ind|Person=3|Tense=Pres|VerbForm=Finможемогат

ADJ

13504 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Number.

The most frequent other feature values with which ADJ and Number co-occurred: Degree=Pos (12793; 95%), Aspect=EMPTY (12018; 89%), VerbForm=EMPTY (12018; 89%), Voice=EMPTY (12018; 89%), Definite=Ind (7442; 55%).

ADJ tokens may have the following values of Number:

Paradigm новSingPlur
Case=Voc|Degree=Pos|Gender=MascНови
Definite=Def|Degree=Posновите
Definite=Def|Degree=Pos|Gender=Mascновия, новият
Definite=Def|Degree=Pos|Gender=Femновата
Definite=Def|Degree=Pos|Gender=Neutновото
Definite=Def|Degree=Sup|Gender=Mascнай-новият
Definite=Def|Degree=Sup|Gender=Femнай-новата
Definite=Def|Degree=Sup|Gender=NeutНай-новото
Definite=Ind|Degree=Posнови
Definite=Ind|Degree=Pos|Gender=Mascнов
Definite=Ind|Degree=Pos|Gender=Femнова
Definite=Ind|Degree=Pos|Gender=Neutново
Definite=Ind|Degree=Supнай-нови

PROPN

8363 PROPN tokens (99% of all PROPN tokens) have a non-empty value of Number.

The most frequent other feature values with which PROPN and Number co-occurred: Definite=Ind (8078; 97%), Gender=Masc (5216; 62%).

PROPN tokens may have the following values of Number:

Paradigm сдсSingPlur
Definite=Def|Gender=MascСДС
Definite=Def|Gender=NeutСДС-та
Definite=Ind|Gender=MascСДС

Number seems to be lexical feature of PROPN. 100% lemmas (2907) occur only with one value of Number.

PRON

5368 PRON tokens (53% of all PRON tokens) have a non-empty value of Number.

The most frequent other feature values with which PRON and Number co-occurred: Poss=EMPTY (5368; 100%), Reflex=EMPTY (5368; 100%), PronType=Prs (3398; 63%), Case=Nom (3311; 62%).

PRON tokens may have the following values of Number:

Paradigm азSingPlur
Case=Acc|Gender=Masc|Person=3го, него
Case=Acc|Gender=Fem|Person=3я, нея
Case=Acc|Gender=Neut|Person=3го, него
Case=Acc|Person=1ме, мен, мененас, ни
Case=Acc|Person=2те, тебе, ви, вас, тебвас, ви
Case=Acc|Person=3ги, тях
Case=Dat|Gender=Masc|Person=3му, нему
Case=Dat|Gender=Fem|Person=3й
Case=Dat|Gender=Neut|Person=3му
Case=Dat|Person=1ми, мен, менени
Case=Dat|Person=2ти, виви
Case=Dat|Person=3им, тям
Case=Nom|Gender=Masc|Person=3той
Case=Nom|Gender=Fem|Person=3тя
Case=Nom|Gender=Neut|Person=3то
Case=Nom|Person=1азние, ний
Case=Nom|Person=2ти, виевие
Case=Nom|Person=3те
Gender=Fem|Person=3й
Person=1мини
Person=2ти, виви
Person=3им

AUX

4382 AUX tokens (50% of all AUX tokens) have a non-empty value of Number.

The most frequent other feature values with which AUX and Number co-occurred: Mood=Ind (4276; 98%), Voice=Act (4276; 98%), VerbForm=Fin (4152; 95%), Aspect=Imp (4029; 92%), Person=3 (3736; 85%), Tense=Pres (3407; 78%).

AUX tokens may have the following values of Number:

Paradigm съмSingPlur
Definite=Ind|Gender=Masc|Mood=Ind|VerbForm=Part|Voice=Actбил
Definite=Ind|Gender=Fem|Mood=Ind|VerbForm=Part|Voice=Actбила
Definite=Ind|Gender=Neut|Mood=Ind|VerbForm=Part|Voice=Actбило
Definite=Ind|Mood=Ind|VerbForm=Part|Voice=Actбили
Mood=Cnd|Person=1|VerbForm=Finбихбихме
Mood=Cnd|Person=2|VerbForm=FinБибихте
Mood=Cnd|Person=3|Tense=Past|VerbForm=Finби
Mood=Cnd|Person=3|VerbForm=Finбиха
Mood=Ind|Person=1|Tense=Past|VerbForm=Fin|Voice=Actбяхбяхме
Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin|Voice=Actсъмсме
Mood=Ind|Person=2|Tense=Past|VerbForm=Fin|Voice=Actбеше
Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin|Voice=Actсисте
Mood=Ind|Person=2|VerbForm=Fin|Voice=Actбяхте
Mood=Ind|Person=3|Tense=Past|VerbForm=Fin|Voice=Actбе, бешебяха
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin|Voice=Actеса

DET

2433 DET tokens (100% of all DET tokens) have a non-empty value of Number.

The most frequent other feature values with which DET and Number co-occurred: Person=EMPTY (2044; 84%), Poss=EMPTY (1921; 79%), Definite=EMPTY (1640; 67%), Case=EMPTY (1534; 63%).

DET tokens may have the following values of Number:

Paradigm тозиSingPlur
Case=Nom|Gender=Femтази, тая, онази, тeзи
Case=Nom|Gender=Neutтова, онова, туй
Gender=Mascтози, тоя, оня, онзи
тези, тия, онези, ония

NUM

2101 NUM tokens (100% of all NUM tokens) have a non-empty value of Number.

The most frequent other feature values with which NUM and Number co-occurred: NumType=Card (2101; 100%), Definite=Ind (1956; 93%), Gender=EMPTY (1586; 75%).

NUM tokens may have the following values of Number:

Number seems to be lexical feature of NUM. 100% lemmas (412) occur only with one value of Number.

ADV

499 ADV tokens (8% of all ADV tokens) have a non-empty value of Number.

The most frequent other feature values with which ADV and Number co-occurred: PronType=EMPTY (499; 100%), Degree=Pos (463; 93%).

ADV tokens may have the following values of Number:

ADP

1 ADP tokens (0% of all ADP tokens) have a non-empty value of Number.

ADP tokens may have the following values of Number:

Relations with Agreement in Number

The 10 most frequent relations where parent and child node agree in Number: NOUN –[amod]–> ADJ (11159; 97%), NOUN –[nmod]–> NOUN (5931; 61%), VERB –[nsubj]–> NOUN (4386; 94%), VERB –[obj]–> NOUN (2813; 58%), NOUN –[nmod]–> PROPN (2663; 82%), VERB –[obl]–> NOUN (2654; 58%), NOUN –[det]–> DET (1948; 98%), VERB –[nsubj]–> PRON (1924; 98%), NOUN –[conj]–> NOUN (1719; 78%), PROPN –[flat]–> PROPN (1539; 96%).