home bg/pos edit page issue tracker

This page still pertains to UD version 1.

VERB: verb

Definition

A verb is a member of the syntactic class of words that typically signal events and actions, can constitute a minimal predicate in a clause, and govern the number and types of other constituents which may occur in the clause. Verbs are often associated with grammatical categories like tense, mood, aspect and voice, which can either be expressed inflectionally or using auxilliary verbs or particles.

The BulTreeBank annotation scheme provides the following mappings here: main verbs, copulas and modal verbs. Note that modal verbs do not have special labels in our annotation scheme. Participles and gerund are considered also VERB. Below the specific labels that map to VERB are given.

Examples

Note that the present active participle V#car# is mapped only to ADJ.

Note that the symbol `#’, used in the Universal POS section indicates a holder for arbitrary number of features, suppressed in the respective tag as irrelevant in the BulTreeBank tagset, when mapped to the Universal one.


Treebank Statistics (UD_Bulgarian)

There are 2780 VERB lemmas (18%), 6570 VERB types (24%) and 19552 VERB tokens (13%). Out of 16 observed tags, the rank of VERB is: 4 in number of lemmas, 2 in number of types and 4 in number of tokens.

The 10 most frequent VERB lemmas: съм, мога, имам, нямам, кажа, трябва, има, искам, съобщя, стана

The 10 most frequent VERB types: е, са, има, няма, може, трябва, беше, каза, могат, съобщи

The 10 most frequent ambiguous lemmas: съм (VERB 2583, AUX 1778), мога (VERB 396, ADJ 1), имам (VERB 363, ADJ 1), кажа (VERB 237, ADJ 4), искам (VERB 178, ADJ 3), стана (VERB 144, ADJ 3), бъда (AUX 251, VERB 139), направя-(се) (VERB 129, ADJ 7), дам-(се) (VERB 101, ADJ 4), видя-(се) (VERB 95, ADJ 1)

The 10 most frequent ambiguous types: е (VERB 1521, AUX 655), са (VERB 468, AUX 339), беше (VERB 115, AUX 82), бъде (AUX 148, VERB 89), бе (AUX 199, VERB 67, PART 2), съм (AUX 66, VERB 59), бяха (AUX 96, VERB 56), иска (VERB 46, NOUN 1), сме (VERB 50, AUX 43), била (VERB 40, AUX 19)

Morphology

The form / lemma ratio of VERB is 2.363309 (the average of all parts of speech is 1.728233).

The 1st highest number of forms (21) was observed with the lemma “мога”: Можехме, мога, могат, могла, могли, могло, могъл, можа, можах, можаха, може, можел, можела, можели, можело, можем, можете, можех, можеха, можеш, можеше.

The 2nd highest number of forms (17) was observed with the lemma “взема”: взе, взел, взела, взели, взело, взема, вземат, вземе, вземем, вземете, вземеш, вземи, взета, взети, взето, взех, взеха.

The 3rd highest number of forms (17) was observed with the lemma “намеря-(се)”: Намерете, намерен, намерена, намерени, намерено, намери, намерил, намерила, намерили, намерим, намерите, намерих, намериха, намерихме, намериш, намеря, намерят.

VERB occurs with 10 features: bg-feat/Aspect (19376; 99% instances), bg-feat/Number (19376; 99% instances), bg-feat/VerbForm (19376; 99% instances), bg-feat/Voice (19096; 98% instances), bg-feat/Tense (17666; 90% instances), bg-feat/Mood (16738; 86% instances), bg-feat/Person (16611; 85% instances), bg-feat/Definite (2765; 14% instances), bg-feat/Gender (1909; 10% instances), bg-feat/Degree (3; 0% instances)

VERB occurs with 24 feature-value pairs: Aspect=Imp, Aspect=Perf, Definite=Def, Definite=Ind, Degree=Cmp, Degree=Pos, Gender=Fem, Gender=Masc, Gender=Neut, Mood=Cnd, Mood=Imp, Mood=Ind, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Tense=Imp, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Part, Voice=Act, Voice=Pass

VERB occurs with 71 feature combinations. The most frequent feature combination is Aspect=Imp|Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act (5328 tokens). Examples: е, има, няма, може, трябва, става, иска, дава, работи, разбира

Relations

VERB nodes are attached to their parents using 20 different relations: bg-dep/root (9136; 47% instances), bg-dep/ccomp (2339; 12% instances), bg-dep/cop (1997; 10% instances), bg-dep/conj (1906; 10% instances), bg-dep/acl (1482; 8% instances), bg-dep/advcl (1430; 7% instances), bg-dep/xcomp (458; 2% instances), bg-dep/csubj (385; 2% instances), bg-dep/dobj (208; 1% instances), bg-dep/nmod (91; 0% instances), bg-dep/csubjpass (60; 0% instances), bg-dep/mwe (39; 0% instances), bg-dep/auxpass (6; 0% instances), bg-dep/det (6; 0% instances), bg-dep/nsubj (3; 0% instances), bg-dep/parataxis (2; 0% instances), bg-dep/aux (1; 0% instances), bg-dep/discourse (1; 0% instances), bg-dep/iobj (1; 0% instances), bg-dep/nsubjpass (1; 0% instances)

Parents of VERB nodes belong to 12 different parts of speech: ROOT (9136; 47% instances), VERB (6399; 33% instances), NOUN (2509; 13% instances), ADJ (733; 4% instances), ADV (411; 2% instances), DET (124; 1% instances), PROPN (89; 0% instances), PRON (85; 0% instances), PART (49; 0% instances), NUM (12; 0% instances), CONJ (3; 0% instances), ADP (2; 0% instances)

2145 (11%) VERB nodes are leaves.

750 (4%) VERB nodes have one child.

2646 (14%) VERB nodes have two children.

14011 (72%) VERB nodes have three or more children.

The highest child degree of a VERB node is 13.

Children of VERB nodes are attached using 27 different relations: bg-dep/punct (13611; 21% instances), bg-dep/nsubj (8668; 13% instances), bg-dep/dobj (7428; 11% instances), bg-dep/aux (5573; 9% instances), bg-dep/nmod (5514; 9% instances), bg-dep/advmod (3634; 6% instances), bg-dep/iobj (3496; 5% instances), bg-dep/expl (3418; 5% instances), bg-dep/ccomp (2549; 4% instances), bg-dep/conj (1827; 3% instances), bg-dep/cc (1685; 3% instances), bg-dep/mark (1616; 2% instances), bg-dep/advcl (1402; 2% instances), bg-dep/nsubjpass (1238; 2% instances), bg-dep/neg (1176; 2% instances), bg-dep/discourse (497; 1% instances), bg-dep/xcomp (497; 1% instances), bg-dep/auxpass (451; 1% instances), bg-dep/csubj (234; 0% instances), bg-dep/case (124; 0% instances), bg-dep/csubjpass (76; 0% instances), bg-dep/mwe (29; 0% instances), bg-dep/vocative (27; 0% instances), bg-dep/amod (9; 0% instances), bg-dep/cop (7; 0% instances), bg-dep/acl (1; 0% instances), bg-dep/appos (1; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: NOUN (18355; 28% instances), PUNCT (13782; 21% instances), PRON (7735; 12% instances), VERB (6399; 10% instances), PART (4478; 7% instances), ADV (4071; 6% instances), PROPN (2109; 3% instances), AUX (1980; 3% instances), CONJ (1680; 3% instances), SCONJ (1300; 2% instances), INTJ (1235; 2% instances), ADJ (738; 1% instances), ADP (426; 1% instances), DET (344; 1% instances), NUM (155; 0% instances), X (1; 0% instances)


VERB in other languages: [am] [ar] [bg] [bxr] [ca] [ckb] [cop] [cs] [cu] [da] [de] [el] [en] [es] [et] [eu] [fa] [fi] [fo] [fr] [ga] [gl] [got] [grc] [he] [hi] [hr] [hu] [id] [it] [ja] [kk] [kmr] [ko] [la] [lv] [mr] [nl] [no] [pl] [pt] [ro] [ru] [sa] [sk] [sla] [sl] [so] [sr] [sv] [swl] [ta] [tr] [ug] [uk] [u] [urj] [ur] [vi] [yue] [zh]