VERB
: verb
Definition
A verb is a member of the syntactic class of words that typically signal events and actions, can constitute a minimal predicate in a clause, and govern the number and types of other constituents which may occur in the clause. Verbs are often associated with grammatical categories like tense, mood, aspect and voice, which can either be expressed inflectionally or using auxilliary verbs or particles.
The BulTreeBank annotation scheme provides the following mappings here: main verbs, copulas and modal verbs.
Note that modal verbs do not have special labels in our annotation scheme.
Participles and gerund are considered also VERB
. Below the specific labels that map to VERB
are given.
Examples
- Vp# (finite verb): тичам / ticham “run”
- Vn# (impersonal verb): вали, трябва / vali, tryabva “It rains, must”
- Vx# (the copula to be): съм / sam “to be”
- Vy# (the copula to be): бъда / bada “to be”
- Vi# (the copula to be): бивам / bivam “to be”
- V#cv# (past passive participle): намерен / nameren “found”. It is also mapped to ADJ in its attributive usages.
- V#cam# (past imperfective participle): четял / chetyal “He was reading”
- V#cao# (past perfective participle): дошъл / doshal “He has come”. It is also mapped to ADJ in its attributive usages.
- V#g (gerund): Идвайки / idvayki “Coming”
Note that the present active participle V#car# is mapped only to ADJ.
Note that the symbol `#’, used in the Universal POS section indicates a holder for arbitrary number of features, suppressed in the respective tag as irrelevant in the BulTreeBank tagset, when mapped to the Universal one.
Treebank Statistics (UD_Bulgarian)
There are 2780 VERB
lemmas (18%), 6570 VERB
types (24%) and 19552 VERB
tokens (13%).
Out of 16 observed tags, the rank of VERB
is: 4 in number of lemmas, 2 in number of types and 4 in number of tokens.
The 10 most frequent VERB
lemmas: съм, мога, имам, нямам, кажа, трябва, има, искам, съобщя, стана
The 10 most frequent VERB
types: е, са, има, няма, може, трябва, беше, каза, могат, съобщи
The 10 most frequent ambiguous lemmas: съм (VERB 2583, AUX 1778), мога (VERB 396, ADJ 1), имам (VERB 363, ADJ 1), кажа (VERB 237, ADJ 4), искам (VERB 178, ADJ 3), стана (VERB 144, ADJ 3), бъда (AUX 251, VERB 139), направя-(се) (VERB 129, ADJ 7), дам-(се) (VERB 101, ADJ 4), видя-(се) (VERB 95, ADJ 1)
The 10 most frequent ambiguous types: е (VERB 1521, AUX 655), са (VERB 468, AUX 339), беше (VERB 115, AUX 82), бъде (AUX 148, VERB 89), бе (AUX 199, VERB 67, PART 2), съм (AUX 66, VERB 59), бяха (AUX 96, VERB 56), иска (VERB 46, NOUN 1), сме (VERB 50, AUX 43), била (VERB 40, AUX 19)
- е
- са
- беше
- бъде
- бе
- съм
- бяха
- иска
- VERB 46: Той иска съвет , към кого да се обърне .
- NOUN 1: В петък следобед в съда в канадския град Ванкувър и американския Саут Бент са внесени два граждански иска срещу България в лицето на Министерство на финансите , бившите Главна прокуратура и Национална следствена служба и Националния център по заразни и паразитни болести .
- сме
- била
Morphology
The form / lemma ratio of VERB
is 2.363309 (the average of all parts of speech is 1.728233).
The 1st highest number of forms (21) was observed with the lemma “мога”: Можехме, мога, могат, могла, могли, могло, могъл, можа, можах, можаха, може, можел, можела, можели, можело, можем, можете, можех, можеха, можеш, можеше.
The 2nd highest number of forms (17) was observed with the lemma “взема”: взе, взел, взела, взели, взело, взема, вземат, вземе, вземем, вземете, вземеш, вземи, взета, взети, взето, взех, взеха.
The 3rd highest number of forms (17) was observed with the lemma “намеря-(се)”: Намерете, намерен, намерена, намерени, намерено, намери, намерил, намерила, намерили, намерим, намерите, намерих, намериха, намерихме, намериш, намеря, намерят.
VERB
occurs with 10 features: bg-feat/Aspect (19376; 99% instances), bg-feat/Number (19376; 99% instances), bg-feat/VerbForm (19376; 99% instances), bg-feat/Voice (19096; 98% instances), bg-feat/Tense (17666; 90% instances), bg-feat/Mood (16738; 86% instances), bg-feat/Person (16611; 85% instances), bg-feat/Definite (2765; 14% instances), bg-feat/Gender (1909; 10% instances), bg-feat/Degree (3; 0% instances)
VERB
occurs with 24 feature-value pairs: Aspect=Imp
, Aspect=Perf
, Definite=Def
, Definite=Ind
, Degree=Cmp
, Degree=Pos
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Mood=Cnd
, Mood=Imp
, Mood=Ind
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Tense=Imp
, Tense=Past
, Tense=Pres
, VerbForm=Fin
, VerbForm=Part
, Voice=Act
, Voice=Pass
VERB
occurs with 71 feature combinations.
The most frequent feature combination is Aspect=Imp|Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act
(5328 tokens).
Examples: е, има, няма, може, трябва, става, иска, дава, работи, разбира
Relations
VERB
nodes are attached to their parents using 20 different relations: bg-dep/root (9136; 47% instances), bg-dep/ccomp (2339; 12% instances), bg-dep/cop (1997; 10% instances), bg-dep/conj (1906; 10% instances), bg-dep/acl (1482; 8% instances), bg-dep/advcl (1430; 7% instances), bg-dep/xcomp (458; 2% instances), bg-dep/csubj (385; 2% instances), bg-dep/dobj (208; 1% instances), bg-dep/nmod (91; 0% instances), bg-dep/csubjpass (60; 0% instances), bg-dep/mwe (39; 0% instances), bg-dep/auxpass (6; 0% instances), bg-dep/det (6; 0% instances), bg-dep/nsubj (3; 0% instances), bg-dep/parataxis (2; 0% instances), bg-dep/aux (1; 0% instances), bg-dep/discourse (1; 0% instances), bg-dep/iobj (1; 0% instances), bg-dep/nsubjpass (1; 0% instances)
Parents of VERB
nodes belong to 12 different parts of speech: ROOT (9136; 47% instances), VERB (6399; 33% instances), NOUN (2509; 13% instances), ADJ (733; 4% instances), ADV (411; 2% instances), DET (124; 1% instances), PROPN (89; 0% instances), PRON (85; 0% instances), PART (49; 0% instances), NUM (12; 0% instances), CONJ (3; 0% instances), ADP (2; 0% instances)
2145 (11%) VERB
nodes are leaves.
750 (4%) VERB
nodes have one child.
2646 (14%) VERB
nodes have two children.
14011 (72%) VERB
nodes have three or more children.
The highest child degree of a VERB
node is 13.
Children of VERB
nodes are attached using 27 different relations: bg-dep/punct (13611; 21% instances), bg-dep/nsubj (8668; 13% instances), bg-dep/dobj (7428; 11% instances), bg-dep/aux (5573; 9% instances), bg-dep/nmod (5514; 9% instances), bg-dep/advmod (3634; 6% instances), bg-dep/iobj (3496; 5% instances), bg-dep/expl (3418; 5% instances), bg-dep/ccomp (2549; 4% instances), bg-dep/conj (1827; 3% instances), bg-dep/cc (1685; 3% instances), bg-dep/mark (1616; 2% instances), bg-dep/advcl (1402; 2% instances), bg-dep/nsubjpass (1238; 2% instances), bg-dep/neg (1176; 2% instances), bg-dep/discourse (497; 1% instances), bg-dep/xcomp (497; 1% instances), bg-dep/auxpass (451; 1% instances), bg-dep/csubj (234; 0% instances), bg-dep/case (124; 0% instances), bg-dep/csubjpass (76; 0% instances), bg-dep/mwe (29; 0% instances), bg-dep/vocative (27; 0% instances), bg-dep/amod (9; 0% instances), bg-dep/cop (7; 0% instances), bg-dep/acl (1; 0% instances), bg-dep/appos (1; 0% instances)
Children of VERB
nodes belong to 16 different parts of speech: NOUN (18355; 28% instances), PUNCT (13782; 21% instances), PRON (7735; 12% instances), VERB (6399; 10% instances), PART (4478; 7% instances), ADV (4071; 6% instances), PROPN (2109; 3% instances), AUX (1980; 3% instances), CONJ (1680; 3% instances), SCONJ (1300; 2% instances), INTJ (1235; 2% instances), ADJ (738; 1% instances), ADP (426; 1% instances), DET (344; 1% instances), NUM (155; 0% instances), X (1; 0% instances)
VERB in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]