VerbForm
: form of verb or deverbative
Even though the name of the feature seems to suggest that it is used
exclusively with verbs, it is not the case. Some verb
forms in some languages actually form a gray zone between verbs and
other parts of speech (nouns, adjectives
and adverbs). For instance, participles may be either
classified as verbs or as adjectives, depending on language and
context. In both cases VerbForm=Part
may be used to separate them
from other verb forms or other types of adjectives.
Bulgarian does not have an infinitive. It distinguishes: finite verbs and non-finite verbs (participles and transgressives).
Fin
: finite verb
Rule of thumb: if it has non-empty Mood, it is finite.
This features is encoded in the following values as second position in verbal tags: Vp#
(personal verb); Vn#
(impersonal verb); Vx#
, Vy#
and Vi#
(auxiliary verbs).
Examples
- Аз съм, ти си / Az sam, ti si “I am, you are”
- Трябва да дойдеш /Tryabva da doydesh “You must come”
- Прочетох книгата / Prochetoh knigata “I read the book”
Part
: participle
Participle is a non-finite verb form that shares properties of verbs
and adjectives. The participle in Bulgarian is encoded as c
in fifth position of the tag: V#c#
.
In Bulgarian there are four types of participles: present active, past perfective active, past imperfective active, past passive. The present active one can be used only adjectively; the past imperfective one can be used only in evidential verb forms; the other have the two usages. The present active can be derived only from imperfective verbs.
Examples
- виждащ / vizhdasht “seeing” (present active). BulTreeBank tag:
V#car#
- видял / vidyal “seen” (past perfective active). BulTreeBank tag:
V#cao#
- видел / videl “seen” (past imperfective active). BulTreeBank tag:
V#cam#
- видян / vidyan “seen” (past passive). BulTreeBank tag:
V#cv#
Trans
: transgressive
The transgressive, also called adverbial participle, is a non-finite verb form that shares properties of verbs and adverbs. It appears e.g. in Slavic and Indo-Aryan languages.
In Bulgarian it can be derived only from imperfective verbs.
Examples
- Виждайки това, той се разстрои / Vizhdayki tova, toy se razstroi “Having seen this, he became upset”. BulTreebang tag:
V#g
Note that the symbol `#’, used in the Universal POS section indicates a holder for arbitrary number of features, suppressed in the respective tag as irrelevant in the BulTreeBank tagset, when mapped to the Universal one.
Treebank Statistics (UD_Bulgarian)
This feature is universal.
It occurs with 2 different values: Fin
, Part
.
22856 tokens (15%) have a non-empty value of VerbForm
.
7944 types (30%) occur at least once with a non-empty value of VerbForm
.
2968 lemmas (20%) occur at least once with a non-empty value of VerbForm
.
The feature is used with 3 part-of-speech tags: bg-pos/VERB (19376; 12% instances), bg-pos/AUX (2008; 1% instances), bg-pos/ADJ (1472; 1% instances).
VERB
19376 bg-pos/VERB tokens (99% of all VERB
tokens) have a non-empty value of VerbForm
.
The most frequent other feature values with which VERB
and VerbForm
co-occurred: Voice=Act (18025; 93%), Definite=EMPTY (16611; 86%), Mood=Ind (16458; 85%), Person=3 (13729; 71%), Number=Sing (13660; 70%), Tense=Pres (11733; 61%), Aspect=Imp (11178; 58%).
VERB
tokens may have the following values of VerbForm
:
Fin
(16611; 86% of non-emptyVerbForm
): е, са, има, няма, може, трябва, беше, каза, могат, съобщиPart
(2765; 14% of non-emptyVerbForm
): била, бил, имало, били, било, направил, дал, трябвало, заминал, направеноEMPTY
(176): е, каза, няма, откри, поздрави, посочи, беше, заяви, са, благодари
Paradigm съм | Fin | Part |
---|---|---|
Definite=Ind|Gender=Masc|Mood=Ind|Number=Sing|Voice=Act | бил | |
Definite=Ind|Gender=Fem|Mood=Ind|Number=Sing|Voice=Act | била | |
Definite=Ind|Gender=Neut|Mood=Ind|Number=Sing|Voice=Act | било | |
Definite=Ind|Mood=Ind|Number=Plur|Voice=Act | били | |
Mood=Cnd|Number=Sing|Person=3|Tense=Past | би | |
Mood=Ind|Number=Sing|Person=1|Tense=Pres|Voice=Act | съм | |
Mood=Ind|Number=Sing|Person=1|Voice=Act | бях | |
Mood=Ind|Number=Sing|Person=2|Tense=Pres|Voice=Act | си | |
Mood=Ind|Number=Sing|Person=3|Tense=Pres|Voice=Act | е | |
Mood=Ind|Number=Sing|Person=3|Voice=Act | беше, бе | |
Mood=Ind|Number=Plur|Person=1|Tense=Pres|Voice=Act | сме | |
Mood=Ind|Number=Plur|Person=2|Tense=Pres|Voice=Act | сте | |
Mood=Ind|Number=Plur|Person=2|Voice=Act | бяхте | |
Mood=Ind|Number=Plur|Person=3|Tense=Pres|Voice=Act | са | |
Mood=Ind|Number=Plur|Person=3|Voice=Act | бяха |
AUX
2008 bg-pos/AUX tokens (99% of all AUX
tokens) have a non-empty value of VerbForm
.
The most frequent other feature values with which AUX
and VerbForm
co-occurred: Voice=Act (1906; 95%), Mood=Ind (1787; 89%), Aspect=Imp (1760; 88%), Person=3 (1667; 83%), Tense=Pres (1384; 69%), Number=Sing (1363; 68%).
AUX
tokens may have the following values of VerbForm
:
Fin
(1889; 94% of non-emptyVerbForm
): е, са, бе, бъде, бяха, бъдат, беше, съм, би, смеPart
(119; 6% of non-emptyVerbForm
): бил, били, била, билоEMPTY
(23): е, са, би, бъдат, съм, беше, бъде, сме
Paradigm съм | Fin | Part |
---|---|---|
Definite=Ind|Gender=Masc|Number=Sing|Tense=Past|Voice=Act | бил | |
Definite=Ind|Gender=Fem|Number=Sing|Tense=Past|Voice=Act | била | |
Definite=Ind|Gender=Neut|Number=Sing|Tense=Past|Voice=Act | било | |
Definite=Ind|Number=Plur|Tense=Past|Voice=Act | били | |
Mood=Cnd|Number=Sing|Person=1 | бих | |
Mood=Cnd|Number=Sing|Person=2 | Би | |
Mood=Cnd|Number=Sing|Person=3 | би | |
Mood=Cnd|Number=Plur|Person=1 | бихме | |
Mood=Cnd|Number=Plur|Person=2 | бихте | |
Mood=Cnd|Number=Plur|Person=3 | биха | |
Mood=Ind|Number=Sing|Person=1|Tense=Past|Voice=Act | бях | |
Mood=Ind|Number=Sing|Person=1|Tense=Pres|Voice=Act | съм | |
Mood=Ind|Number=Sing|Person=2|Tense=Past|Voice=Act | беше | |
Mood=Ind|Number=Sing|Person=2|Tense=Pres|Voice=Act | си | |
Mood=Ind|Number=Sing|Person=3|Tense=Past|Voice=Act | бе, беше | |
Mood=Ind|Number=Sing|Person=3|Tense=Pres|Voice=Act | е | |
Mood=Ind|Number=Plur|Person=1|Tense=Past|Voice=Act | бяхме | |
Mood=Ind|Number=Plur|Person=1|Tense=Pres|Voice=Act | сме | |
Mood=Ind|Number=Plur|Person=2|Tense=Pres|Voice=Act | сте | |
Mood=Ind|Number=Plur|Person=3|Tense=Past|Voice=Act | бяха | |
Mood=Ind|Number=Plur|Person=3|Tense=Pres|Voice=Act | са |
ADJ
1472 bg-pos/ADJ tokens (11% of all ADJ
tokens) have a non-empty value of VerbForm
.
The most frequent other feature values with which ADJ
and VerbForm
co-occurred: Degree=EMPTY (1113; 76%), Aspect=Perf (1047; 71%), Voice=Pass (953; 65%), Number=Sing (839; 57%), Definite=Ind (808; 55%).
ADJ
tokens may have the following values of VerbForm
:
Part
(1472; 100% of non-emptyVerbForm
): миналата, следващата, въоръжените, останалите, свързани, миналия, управляващите, определени, цитиран, определенEMPTY
(12117): други, народното, българската, нова, другите, нови, европейската, последните, 2001, друг
VerbForm
seems to be lexical feature of ADJ
. 100% lemmas (672) occur only with one value of VerbForm
.
Relations with Agreement in VerbForm
The 10 most frequent relations where parent and child node agree in VerbForm
:
VERB –[ccomp]–> VERB (1745; 77%),
VERB –[conj]–> VERB (1526; 88%),
VERB –[advcl]–> VERB (1085; 84%),
VERB –[xcomp]–> VERB (396; 87%),
VERB –[csubj]–> VERB (181; 84%),
VERB –[dobj]–> VERB (167; 83%),
VERB –[nmod]–> VERB (67; 83%),
VERB –[csubjpass]–> VERB (36; 60%),
VERB –[mwe]–> VERB (28; 100%),
VERB –[nsubj]–> VERB (3; 100%).
VerbForm in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]