Treebank Statistics: UD_Bulgarian-BTB: Features: Definite
This feature is universal.
It occurs with 2 different values: Def, Ind.
61280 tokens (39%) have a non-empty value of Definite.
22259 types (84%) occur at least once with a non-empty value of Definite.
12399 lemmas (83%) occur at least once with a non-empty value of Definite.
The feature is used with 10 part-of-speech tags: NOUN (33023; 21% instances), ADJ (13480; 9% instances), PROPN (8357; 5% instances), VERB (2664; 2% instances), NUM (2102; 1% instances), DET (792; 1% instances), ADV (499; 0% instances), AUX (246; 0% instances), PRON (116; 0% instances), ADP (1; 0% instances).
NOUN
33023 NOUN tokens (97% of all NOUN tokens) have a non-empty value of Definite.
The most frequent other feature values with which NOUN and Definite co-occurred: Number=Sing (23906; 72%).
NOUN tokens may have the following values of Definite:
Def(12257; 37% of non-emptyDefinite): хората, президентът, страната, края, държавата, правителството, закона, отбраната, шефът, началотоInd(20766; 63% of non-emptyDefinite): г., време, година, години, част, събрание, хора, път, страна, съветEMPTY(1129): %, лв., млн., $, месеца, дни, лева, млрд., пъти, долара
| Paradigm година | Ind | Def |
|---|---|---|
| Number=Sing | г., година | годината |
| Number=Plur | г., години, г | годините |
ADJ
13480 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Definite.
The most frequent other feature values with which ADJ and Definite co-occurred: Degree=Pos (12769; 95%), Aspect=EMPTY (11994; 89%), VerbForm=EMPTY (11994; 89%), Voice=EMPTY (11994; 89%), Number=Sing (9533; 71%).
ADJ tokens may have the following values of Definite:
Def(6038; 45% of non-emptyDefinite): народното, българската, другите, европейската, последните, цялата, новия, миналата, националната, новатаInd(7442; 55% of non-emptyDefinite): други, нова, 2001, друг, нови, 2000, голяма, български, различни, 1EMPTY(111): т.нар., US, Нови, уважаеми, жп, държавен, народна, политически, важен, военноморско
| Paradigm нов | Ind | Def |
|---|---|---|
| Degree=Pos|Gender=Masc|Number=Sing | нов | новия, новият |
| Degree=Pos|Gender=Fem|Number=Sing | нова | новата |
| Degree=Pos|Gender=Neut|Number=Sing | ново | новото |
| Degree=Pos|Number=Plur | нови | новите |
| Degree=Sup|Gender=Masc|Number=Sing | най-новият | |
| Degree=Sup|Gender=Fem|Number=Sing | най-новата | |
| Degree=Sup|Gender=Neut|Number=Sing | Най-новото | |
| Degree=Sup|Number=Plur | най-нови |
PROPN
8357 PROPN tokens (99% of all PROPN tokens) have a non-empty value of Definite.
The most frequent other feature values with which PROPN and Definite co-occurred: Number=Sing (8207; 98%), Gender=Masc (5211; 62%).
PROPN tokens may have the following values of Definite:
Def(279; 3% of non-emptyDefinite): Балканите, ЕС, СДС, Конституцията, НИС, Дяволът, Евросъюза, ЦСКА, Евролевицата, САЩInd(8078; 97% of non-emptyDefinite): България, София, Иван, Европа, Петър, ЕС, Стоянов, СДС, Костов, ГеоргиEMPTY(78): де, Р-300, ван, -, 2000, ал, ди, дела, дьо, 173
| Paradigm европа | Ind | Def |
|---|---|---|
| Европа | Европата |
Definite seems to be lexical feature of PROPN. 98% lemmas (2863) occur only with one value of Definite.
VERB
2664 VERB tokens (16% of all VERB tokens) have a non-empty value of Definite.
The most frequent other feature values with which VERB and Definite co-occurred: Mood=EMPTY (2664; 100%), Person=EMPTY (2664; 100%), VerbForm=Part (2664; 100%), Aspect=Perf (2037; 76%), Number=Sing (1822; 68%), Voice=Act (1579; 59%), Tense=Past (1350; 51%).
VERB tokens may have the following values of Definite:
Def(5; 0% of non-emptyDefinite): останалите, Останалото, Уволнените, отличенитеInd(2659; 100% of non-emptyDefinite): имало, направил, дал, трябвало, заминал, направено, искал, казал, направили, дошълEMPTY(14164): има, няма, може, трябва, каза, могат, съобщи, заяви, стана, имат
| Paradigm остана | Ind | Def |
|---|---|---|
| Degree=Pos|Number=Plur | останалите | |
| Gender=Masc|Number=Sing | останал | |
| Gender=Fem|Number=Sing | останала | |
| Gender=Neut|Number=Sing | Останалото | |
| Number=Plur | останали | ОСТАНАЛИТЕ |
Definite seems to be lexical feature of VERB. 100% lemmas (994) occur only with one value of Definite.
NUM
2102 NUM tokens (100% of all NUM tokens) have a non-empty value of Definite.
The most frequent other feature values with which NUM and Definite co-occurred: NumType=Card (2102; 100%), Number=Plur (1863; 89%), Gender=EMPTY (1587; 75%).
NUM tokens may have the following values of Definite:
Def(145; 7% of non-emptyDefinite): двамата, двете, двата, тримата, трите, четиримата, Единият, четирите, десетте, еднотоInd(1957; 93% of non-emptyDefinite): две, един, два, 2, една, 1, 3, три, 10, едноEMPTY(3): 02, 08, 2000
| Paradigm два | Ind | Def |
|---|---|---|
| Gender=Masc | два, 2 | двата |
| Gender=Fem | две, 2 | двете |
| Gender=Neut | две | двете |
| 2, две | двете |
Definite seems to be lexical feature of NUM. 97% lemmas (403) occur only with one value of Definite.
DET
792 DET tokens (33% of all DET tokens) have a non-empty value of Definite.
The most frequent other feature values with which DET and Definite co-occurred: Case=EMPTY (615; 78%), Number=Sing (599; 76%), Poss=Yes (512; 65%), PronType=Prs (512; 65%), Person=EMPTY (403; 51%).
DET tokens may have the following values of Definite:
Def(393; 50% of non-emptyDefinite): нашите, своите, нашата, своя, техните, неговата, своята, своето, тяхното, нейнитеInd(399; 50% of non-emptyDefinite): един, една, едно, наши, негово, своя, нищо, нещо, свой, своиEMPTY(1640): тази, този, тези, това, всички, какво, всеки, всяка, някои, какви
| Paradigm един | Ind | Def |
|---|---|---|
| Case=Nom|Gender=Masc|Number=Sing | един | единият |
| Case=Nom|Gender=Fem|Number=Sing | едната | |
| Case=Nom|Gender=Neut|Number=Sing | едно | |
| Case=Nom|Number=Plur | едните | |
| Gender=Masc|Number=Sing | единия | |
| Gender=Fem|Number=Sing | една, един | |
| Gender=Neut|Number=Sing | едното | |
| Number=Plur | едни |
ADV
499 ADV tokens (8% of all ADV tokens) have a non-empty value of Definite.
The most frequent other feature values with which ADV and Definite co-occurred: PronType=EMPTY (499; 100%), Degree=Pos (463; 93%).
ADV tokens may have the following values of Definite:
Def(47; 9% of non-emptyDefinite): повечето, малкото, Най-малкото, многото, по-малкотоInd(452; 91% of non-emptyDefinite): много, повече, малко, по-малко, най-много, най-малко, Многая, немалко, ПовечкоEMPTY(6059): още, вчера, само, вече, когато, защото, обаче, сега, как, така
| Paradigm много | Ind | Def |
|---|---|---|
| Degree=Pos | много, Многая | многото |
| Degree=Sup | най-много |
AUX
246 AUX tokens (3% of all AUX tokens) have a non-empty value of Definite.
The most frequent other feature values with which AUX and Definite co-occurred: Aspect=Imp (246; 100%), Mood=Ind (246; 100%), Person=EMPTY (246; 100%), Tense=EMPTY (246; 100%), VerbForm=Part (246; 100%), Voice=Act (246; 100%), Number=Sing (179; 73%).
AUX tokens may have the following values of Definite:
Ind(246; 100% of non-emptyDefinite): бил, били, била, билоEMPTY(8888): да, е, ще, са, бе, бъде, беше, бяха, съм, бъдат
PRON
116 PRON tokens (1% of all PRON tokens) have a non-empty value of Definite.
The most frequent other feature values with which PRON and Definite co-occurred: Person=EMPTY (116; 100%), Poss=EMPTY (116; 100%), Reflex=EMPTY (116; 100%), Case=Nom (107; 92%), Number=Sing (87; 75%), Gender=Neut (83; 72%), PronType=Ind (72; 62%).
PRON tokens may have the following values of Definite:
Def(27; 23% of non-emptyDefinite): нещата, всичкото, Единият, Едната, нищотоInd(89; 77% of non-emptyDefinite): нищо, нещо, неколцина, няколко, един, едноEMPTY(9979): се, си, това, той, му, които, го, ни, те, който
| Paradigm никой | Ind | Def |
|---|---|---|
| нищо | нищото |
ADP
1 ADP tokens (0% of all ADP tokens) have a non-empty value of Definite.
ADP tokens may have the following values of Definite:
Ind(1; 100% of non-emptyDefinite): сравнениеEMPTY(22095): на, в, за, от, с, по, до, след, като, през
Relations with Agreement in Definite
The 10 most frequent relations where parent and child node agree in Definite:
NOUN –[amod]–> ADJ (5775; 50%),
NOUN –[nmod]–> NOUN (4929; 51%),
NOUN –[conj]–> NOUN (1822; 83%),
NOUN –[nmod]–> PROPN (1744; 54%),
PROPN –[flat]–> PROPN (1538; 96%),
PROPN –[conj]–> PROPN (576; 99%),
ADJ –[obl]–> NOUN (377; 54%),
PROPN –[nmod]–> PROPN (333; 97%),
ADJ –[conj]–> ADJ (330; 91%),
PROPN –[nmod]–> NOUN (301; 89%).