This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home cs/pos issue tracker

VERB: verb

Definition

A verb is a member of the syntactic class of words that typically signal events and actions, can constitute a minimal predicate in a clause, and govern the number and types of other constituents which may occur in the clause.

Note that the VERB tag covers main verbs (content verbs), modal verbs and copulas but it does not cover auxiliary verbs, for which there is the AUX tag. (Czech modal verbs are not considered auxiliary.) See the description of AUX for more information on the borderline between VERB and AUX.

Czech verbs can take the following morphological forms:

There are participial forms that are tagged as adjectives (ADJ) rather than verbs. See below for examples.

A verbal noun can be derived productively from almost every verb (e.g. dělat  “to do” → dělání  “doing”). While in other languages a corresponding form may be called gerund and tagged VERB, in Czech it is tagged NOUN. It has always the neuter cs-feat/Gender and it inflects for cs-feat/Number and cs-feat/Case.

Examples

Border cases

There are passive participles as verb forms (VERB) and participial adjectives (ADJ). For example:

Their meaning is almost identical but the usage slightly varies. Both groups can be used in nominal predication with copula. Only true participles (verbs) can be used to form the passive voice (but it may be sometimes difficult to distinguish from copula constructions, see AUX). On the other hand, the participial adjectives inflect for case and thus can modify nouns.

There is an analogy with some adjectives that preserved so called nominal (short) forms. And these adjectives are not derived from verbs. Example:

Here both groups are ADJ. The nominal forms are used in predication, the standard forms both in predication and to modify nouns.

References


Treebank Statistics (UD_Czech)

There are 5926 VERB lemmas (10%), 24444 VERB types (19%) and 165634 VERB tokens (11%). Out of 17 observed tags, the rank of VERB is: 4 in number of lemmas, 3 in number of types and 4 in number of tokens.

The 10 most frequent VERB lemmas: být, mít, moci, muset, říci, stát, chtít, jít, lze, dát

The 10 most frequent VERB types: je, jsou, má, není, byl, být, může, bylo, řekl, měl

The 10 most frequent ambiguous lemmas: být (VERB 25647, AUX 20737), stát (VERB 1542, NOUN 1446), bývat (VERB 154, AUX 58), růst (NOUN 353, VERB 149), vzrůst (VERB 139, NOUN 13), jet (VERB 129, PROPN 6, NOUN 3), hledět (VERB 39, ADP 1), škodit (VERB 18, NOUN 1), rozlišit (VERB 13, NOUN 1), drát (NOUN 25, VERB 4)

The 10 most frequent ambiguous types: je (VERB 11424, PRON 887, AUX 713), jsou (VERB 2884, AUX 371), (VERB 2171, DET 15, PRON 1), není (VERB 1489, AUX 57), byl (VERB 1246, AUX 913), být (VERB 1317, AUX 745), bylo (VERB 1045, AUX 611), bude (AUX 1843, VERB 864), byla (VERB 765, AUX 733), byly (AUX 462, VERB 373)

Morphology

The form / lemma ratio of VERB is 4.124873 (the average of all parts of speech is 2.195930).

The 1st highest number of forms (52) was observed with the lemma “být”: Buďme, Nebuďte, bolo, bude, budeme, budete, budiž, budou, budu, buď, buďte, by, bych, bychom, byl, byla, byli, bylo, byly, bysme, být, býti, j, je, jest, jsa, jsem, jsi, jsme, jsou, jsouc, jsouce, jste, nebude, nebudeme, nebudete, nebudeš, nebudou, nebudu, nebyl, nebyla, nebyli, nebylo, nebyly, nebýt, nejsem, nejsme, nejsou, nejste, není, nésó, si.

The 2nd highest number of forms (36) was observed with the lemma “stát”: nestal, nestala, nestali, nestalo, nestaly, nestane, nestanou, nestojí, nestojíme, nestojíte, nestál, nestála, nestáli, nestálo, nestály, stal, stala, stali, stalo, staly, stane, stanete, stanou, stanu, stoje, stojí, stojím, stojíme, stál, stála, stáli, stálo, stály, stát, státi, stůj.

The 3rd highest number of forms (34) was observed with the lemma “dát”: Dej, Nedejte, Nedám, dají, dal, dala, dali, dalo, daly, dejme, dejte, dá, dám, dáme, dán, dána, dáno, dánu, dány, dát, dáte, dáti, nedají, nedal, nedala, nedali, nedalo, nedaly, nedat, nedej, nedejme, nedá, nedáme, nedáš.

VERB occurs with 15 features: cs-feat/VerbForm (165634; 100% instances), cs-feat/Negative (165619; 100% instances), cs-feat/Number (140097; 85% instances), cs-feat/Voice (139146; 84% instances), cs-feat/Tense (129620; 78% instances), cs-feat/Aspect (87377; 53% instances), cs-feat/Mood (76695; 46% instances), cs-feat/Person (76683; 46% instances), cs-feat/Gender (63395; 38% instances), cs-feat/Animacy (15642; 9% instances), cs-feat/Style (127; 0% instances), cs-feat/Foreign (120; 0% instances), cs-feat/Abbr (22; 0% instances), cs-feat/Case (20; 0% instances), cs-feat/NameType (13; 0% instances)

VERB occurs with 37 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Aspect=Imp, Aspect=Perf, Case=Acc, Foreign=Foreign, Gender=Fem, Gender=Fem,Masc, Gender=Fem,Neut, Gender=Masc, Gender=Neut, Mood=Cnd, Mood=Imp, Mood=Ind, NameType=Com, NameType=Oth, NameType=Pro, Negative=Neg, Negative=Pos, Number=Plur, Number=Plur,Sing, Number=Sing, Person=1, Person=2, Person=3, Style=Arch, Style=Coll, Tense=Fut, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, VerbForm=Trans, Voice=Act, Voice=Pass

VERB occurs with 199 feature combinations. The most frequent feature combination is Mood=Ind|Negative=Pos|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act (23257 tokens). Examples: je, má, může, jde, musí, lze, chce, zdá, platí, stojí

Relations

VERB nodes are attached to their parents using 19 different relations: cs-dep/root (63239; 38% instances), cs-dep/acl (20974; 13% instances), cs-dep/cop (20598; 12% instances), cs-dep/conj (19611; 12% instances), cs-dep/xcomp (14366; 9% instances), cs-dep/ccomp (9358; 6% instances), cs-dep/advcl (8333; 5% instances), cs-dep/csubj (5494; 3% instances), cs-dep/parataxis (1628; 1% instances), cs-dep/appos (780; 0% instances), cs-dep/dep (564; 0% instances), cs-dep/csubjpass (422; 0% instances), cs-dep/cc (133; 0% instances), cs-dep/foreign (69; 0% instances), cs-dep/advmod (29; 0% instances), cs-dep/case (28; 0% instances), cs-dep/mwe (5; 0% instances), cs-dep/mark (2; 0% instances), cs-dep/nmod (1; 0% instances)

Parents of VERB nodes belong to 16 different parts of speech: ROOT (63239; 38% instances), VERB (53469; 32% instances), NOUN (26838; 16% instances), ADJ (14141; 9% instances), PRON (3984; 2% instances), PROPN (2061; 1% instances), ADV (926; 1% instances), NUM (716; 0% instances), DET (114; 0% instances), PART (96; 0% instances), CONJ (17; 0% instances), SCONJ (12; 0% instances), SYM (10; 0% instances), INTJ (6; 0% instances), ADP (3; 0% instances), PUNCT (2; 0% instances)

23242 (14%) VERB nodes are leaves.

13507 (8%) VERB nodes have one child.

18612 (11%) VERB nodes have two children.

110273 (67%) VERB nodes have three or more children.

The highest child degree of a VERB node is 28.

Children of VERB nodes are attached using 32 different relations: cs-dep/punct (126081; 23% instances), cs-dep/dobj (75225; 13% instances), cs-dep/nsubj (73746; 13% instances), cs-dep/nmod (68241; 12% instances), cs-dep/advmod (50834; 9% instances), cs-dep/conj (22859; 4% instances), cs-dep/cc (19911; 4% instances), cs-dep/mark (19669; 4% instances), cs-dep/expl (16638; 3% instances), cs-dep/xcomp (15347; 3% instances), cs-dep/aux (13895; 2% instances), cs-dep/ccomp (10903; 2% instances), cs-dep/iobj (8496; 2% instances), cs-dep/advcl (7748; 1% instances), cs-dep/nsubjpass (7698; 1% instances), cs-dep/auxpass (6068; 1% instances), cs-dep/auxpass:reflex (4897; 1% instances), cs-dep/csubj (2858; 1% instances), cs-dep/dep (2610; 0% instances), cs-dep/cop (2419; 0% instances), cs-dep/advmod:emph (1663; 0% instances), cs-dep/parataxis (1229; 0% instances), cs-dep/csubjpass (475; 0% instances), cs-dep/appos (337; 0% instances), cs-dep/discourse (299; 0% instances), cs-dep/foreign (65; 0% instances), cs-dep/vocative (65; 0% instances), cs-dep/amod (26; 0% instances), cs-dep/acl (21; 0% instances), cs-dep/neg (16; 0% instances), cs-dep/nummod (10; 0% instances), cs-dep/mwe (1; 0% instances)

Children of VERB nodes belong to 15 different parts of speech: NOUN (176681; 32% instances), PUNCT (126089; 23% instances), PRON (60614; 11% instances), VERB (53469; 10% instances), ADV (47876; 9% instances), PROPN (23727; 4% instances), CONJ (20074; 4% instances), AUX (19963; 4% instances), SCONJ (19054; 3% instances), ADJ (6642; 1% instances), NUM (3763; 1% instances), PART (2170; 0% instances), SYM (100; 0% instances), ADP (95; 0% instances), INTJ (33; 0% instances)


Treebank Statistics (UD_Czech-CAC)

There are 3921 VERB lemmas (14%), 12659 VERB types (20%) and 52943 VERB tokens (11%). Out of 16 observed tags, the rank of VERB is: 3 in number of lemmas, 3 in number of types and 4 in number of tokens.

The 10 most frequent VERB lemmas: být, mít, moci, muset, jít, lze, stát, dát, chtít, pracovat

The 10 most frequent VERB types: je, jsou, má, není, být, mají, musí, může, bylo, byl

The 10 most frequent ambiguous lemmas: být (VERB 9840, AUX 6133), stát (VERB 348, NOUN 169), znát (VERB 111, ADJ 1), bývat (VERB 89, AUX 24), růst (NOUN 104, VERB 58), vzrůst (VERB 32, NOUN 13), opravit (VERB 8, NOUN 4), budit (CONJ 9, VERB 5), stavit (NOUN 5, VERB 2), data (NOUN 61, VERB 1)

The 10 most frequent ambiguous types: je (VERB 4673, AUX 482, PRON 356), jsou (VERB 1382, AUX 266), (VERB 728, DET 1), není (VERB 456, AUX 38), být (VERB 428, AUX 239), bylo (VERB 342, AUX 293), byl (VERB 343, AUX 285), byla (AUX 283, VERB 268), bude (AUX 406, VERB 236), byly (AUX 268, VERB 174)

Morphology

The form / lemma ratio of VERB is 3.228513 (the average of all parts of speech is 2.206260).

The 1st highest number of forms (42) was observed with the lemma “být”: Nebuď, bude, budeme, budete, budiž, budou, budu, buď, buďme, buďte, byl, byla, byli, bylo, byly, být, býti, je, jest, jsem, jsi, jsme, jsou, jsouc, jsouce, jste, nebude, nebudeme, nebudou, nebudu, nebyl, nebyla, nebyli, nebylo, nebyly, nebýt, nejsem, nejsme, nejsou, nejste, není, seš.

The 2nd highest number of forms (28) was observed with the lemma “dát”: Dej, dají, dal, dala, dali, dalo, daly, dejme, dejte, dá, dám, dáme, dán, dána, dáno, dány, dát, dáte, dáti, dáš, nedají, nedal, nedala, nedali, nedalo, nedá, nedám, nedáme.

The 3rd highest number of forms (26) was observed with the lemma “moci”: moci, mohl, mohla, mohli, mohlo, mohly, mohou, mohu, může, můžeme, můžete, můžeš, můžu, nemohl, nemohla, nemohli, nemohlo, nemohly, nemohou, nemohu, nemůže, nemůžeme, nemůžete, nemůžeš, nemůžou, nemůžu.

VERB occurs with 13 features: cs-feat/Negative (52943; 100% instances), cs-feat/VerbForm (52943; 100% instances), cs-feat/Number (44945; 85% instances), cs-feat/Voice (44479; 84% instances), cs-feat/Tense (40213; 76% instances), cs-feat/Mood (28970; 55% instances), cs-feat/Person (28970; 55% instances), cs-feat/Aspect (27478; 52% instances), cs-feat/Gender (15966; 30% instances), cs-feat/Animacy (4740; 9% instances), cs-feat/Style (76; 0% instances), cs-feat/Foreign (8; 0% instances), cs-feat/Case (5; 0% instances)

VERB occurs with 32 feature-value pairs: Animacy=Anim, Animacy=Inan, Aspect=Imp, Aspect=Perf, Case=Acc, Foreign=Foreign, Gender=Fem, Gender=Fem,Masc, Gender=Fem,Neut, Gender=Masc, Gender=Neut, Mood=Imp, Mood=Ind, Negative=Neg, Negative=Pos, Number=Plur, Number=Plur,Sing, Number=Sing, Person=1, Person=2, Person=3, Style=Arch, Style=Coll, Tense=Fut, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, VerbForm=Trans, Voice=Act, Voice=Pass

VERB occurs with 141 feature combinations. The most frequent feature combination is Mood=Ind|Negative=Pos|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act (8687 tokens). Examples: je, má, může, jde, lze, musí, platí, vede, stojí, působí

Relations

VERB nodes are attached to their parents using 21 different relations: cs-dep/root (18981; 36% instances), cs-dep/cop (8087; 15% instances), cs-dep/conj (7752; 15% instances), cs-dep/acl (6620; 13% instances), cs-dep/xcomp (3848; 7% instances), cs-dep/advcl (2746; 5% instances), cs-dep/csubj (2248; 4% instances), cs-dep/ccomp (1659; 3% instances), cs-dep/parataxis (580; 1% instances), cs-dep/dep (152; 0% instances), cs-dep/csubjpass (141; 0% instances), cs-dep/appos (75; 0% instances), cs-dep/cc (22; 0% instances), cs-dep/advmod (12; 0% instances), cs-dep/case (11; 0% instances), cs-dep/nmod (3; 0% instances), cs-dep/aux (2; 0% instances), cs-dep/advmod:emph (1; 0% instances), cs-dep/foreign (1; 0% instances), cs-dep/mark (1; 0% instances), cs-dep/nsubj (1; 0% instances)

Parents of VERB nodes belong to 16 different parts of speech: ROOT (18981; 36% instances), VERB (16849; 32% instances), NOUN (8844; 17% instances), ADJ (5304; 10% instances), PRON (1419; 3% instances), ADV (894; 2% instances), NUM (211; 0% instances), PROPN (186; 0% instances), SYM (120; 0% instances), DET (43; 0% instances), SCONJ (42; 0% instances), PART (33; 0% instances), INTJ (7; 0% instances), AUX (6; 0% instances), CONJ (2; 0% instances), PUNCT (2; 0% instances)

9018 (17%) VERB nodes are leaves.

3978 (8%) VERB nodes have one child.

6135 (12%) VERB nodes have two children.

33812 (64%) VERB nodes have three or more children.

The highest child degree of a VERB node is 22.

Children of VERB nodes are attached using 31 different relations: cs-dep/punct (37539; 22% instances), cs-dep/dobj (23081; 13% instances), cs-dep/nmod (21504; 12% instances), cs-dep/nsubj (19310; 11% instances), cs-dep/advmod (16508; 10% instances), cs-dep/conj (9363; 5% instances), cs-dep/cc (7120; 4% instances), cs-dep/expl (5780; 3% instances), cs-dep/mark (5622; 3% instances), cs-dep/xcomp (4168; 2% instances), cs-dep/nsubjpass (3450; 2% instances), cs-dep/aux (3332; 2% instances), cs-dep/advcl (2572; 1% instances), cs-dep/auxpass (2539; 1% instances), cs-dep/iobj (2128; 1% instances), cs-dep/auxpass:reflex (2084; 1% instances), cs-dep/ccomp (1947; 1% instances), cs-dep/cop (1190; 1% instances), cs-dep/csubj (967; 1% instances), cs-dep/dep (885; 1% instances), cs-dep/parataxis (382; 0% instances), cs-dep/advmod:emph (326; 0% instances), cs-dep/csubjpass (171; 0% instances), cs-dep/discourse (77; 0% instances), cs-dep/appos (67; 0% instances), cs-dep/vocative (51; 0% instances), cs-dep/acl (25; 0% instances), cs-dep/amod (14; 0% instances), cs-dep/nummod (6; 0% instances), cs-dep/foreign (4; 0% instances), cs-dep/neg (4; 0% instances)

Children of VERB nodes belong to 15 different parts of speech: NOUN (57829; 34% instances), PUNCT (37540; 22% instances), PRON (19560; 11% instances), VERB (16849; 10% instances), ADV (15243; 9% instances), CONJ (6815; 4% instances), AUX (5859; 3% instances), SCONJ (5474; 3% instances), PROPN (2445; 1% instances), ADJ (2152; 1% instances), PART (951; 1% instances), NUM (816; 0% instances), SYM (658; 0% instances), ADP (21; 0% instances), INTJ (4; 0% instances)


Treebank Statistics (UD_Czech-CLTT)

There are 316 VERB lemmas (12%), 664 VERB types (14%) and 2517 VERB tokens (7%). Out of 15 observed tags, the rank of VERB is: 4 in number of lemmas, 3 in number of types and 5 in number of tokens.

The 10 most frequent VERB lemmas: být, obsahovat, použít, moci, účtovat, uvést, mít, stanovit, rozumět, vést

The 10 most frequent VERB types: je, jsou, obsahuje, rozumí, může, uvede, mohou, není, nejsou, použijí

The 10 most frequent ambiguous lemmas: být (VERB 416, AUX 170), stát (NOUN 40, VERB 7)

The 10 most frequent ambiguous types: je (VERB 185, AUX 14, PRON 11), jsou (VERB 130, AUX 11), není (VERB 39, AUX 2), nejsou (VERB 37, AUX 13), být (AUX 35, VERB 13), bylo (AUX 5, VERB 5), delší (ADJ 15, VERB 4), bude (AUX 10, VERB 3), koupí (NOUN 2, VERB 2), nebyla (VERB 2, AUX 1)

Morphology

The form / lemma ratio of VERB is 2.101266 (the average of all parts of speech is 1.764161).

The 1st highest number of forms (11) was observed with the lemma “účtovat”: neúčtovala, neúčtovat, neúčtuje, neúčtují, účtovala, účtovat, účtována, účtováno, účtovány, účtuje, účtují.

The 2nd highest number of forms (10) was observed with the lemma “být”: bude, budou, bylo, byly, být, je, jsou, nebyla, nejsou, není.

The 3rd highest number of forms (9) was observed with the lemma “použít”: nepoužije, nepoužijí, použije, použijí, použila, použita, použito, použity, použít.

VERB occurs with 12 features: cs-feat/Negative (2517; 100% instances), cs-feat/VerbForm (2517; 100% instances), cs-feat/Number (2190; 87% instances), cs-feat/Voice (2190; 87% instances), cs-feat/Tense (1930; 77% instances), cs-feat/Mood (1806; 72% instances), cs-feat/Person (1806; 72% instances), cs-feat/Gender (384; 15% instances), cs-feat/Animacy (145; 6% instances), cs-feat/Case (3; 0% instances), cs-feat/Aspect (1; 0% instances), cs-feat/Style (1; 0% instances)

VERB occurs with 26 feature-value pairs: Animacy=Anim, Animacy=Inan, Aspect=Imp, Case=Acc, Gender=Fem, Gender=Fem,Masc, Gender=Fem,Neut, Gender=Masc, Gender=Neut, Mood=Ind, Negative=Neg, Negative=Pos, Number=Plur, Number=Plur,Sing, Number=Sing, Person=3, Style=Arch, Tense=Fut, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, VerbForm=Trans, Voice=Act, Voice=Pass

VERB occurs with 25 feature combinations. The most frequent feature combination is Mood=Ind|Negative=Pos|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act (1043 tokens). Examples: je, obsahuje, rozumí, může, uvede, není, stanoví, účtuje, lze, musí

Relations

VERB nodes are attached to their parents using 16 different relations: cs-dep/root (744; 30% instances), cs-dep/acl (523; 21% instances), cs-dep/cop (400; 16% instances), cs-dep/conj (287; 11% instances), cs-dep/xcomp (207; 8% instances), cs-dep/advcl (138; 5% instances), cs-dep/parataxis (60; 2% instances), cs-dep/csubj (57; 2% instances), cs-dep/ccomp (55; 2% instances), cs-dep/dep (29; 1% instances), cs-dep/advmod (8; 0% instances), cs-dep/aux (3; 0% instances), cs-dep/cc (2; 0% instances), cs-dep/nmod (2; 0% instances), cs-dep/appos (1; 0% instances), cs-dep/csubjpass (1; 0% instances)

Parents of VERB nodes belong to 9 different parts of speech: ROOT (744; 30% instances), NOUN (730; 29% instances), VERB (691; 27% instances), ADJ (311; 12% instances), PRON (22; 1% instances), X (13; 1% instances), ADV (4; 0% instances), DET (1; 0% instances), NUM (1; 0% instances)

426 (17%) VERB nodes are leaves.

205 (8%) VERB nodes have one child.

244 (10%) VERB nodes have two children.

1642 (65%) VERB nodes have three or more children.

The highest child degree of a VERB node is 26.

Children of VERB nodes are attached using 26 different relations: cs-dep/punct (2157; 26% instances), cs-dep/dobj (1251; 15% instances), cs-dep/nmod (1235; 15% instances), cs-dep/nsubj (960; 11% instances), cs-dep/advmod (530; 6% instances), cs-dep/nsubjpass (383; 5% instances), cs-dep/auxpass:reflex (355; 4% instances), cs-dep/conj (309; 4% instances), cs-dep/cc (210; 2% instances), cs-dep/mark (200; 2% instances), cs-dep/xcomp (141; 2% instances), cs-dep/auxpass (133; 2% instances), cs-dep/advcl (131; 2% instances), cs-dep/cop (99; 1% instances), cs-dep/expl (80; 1% instances), cs-dep/ccomp (58; 1% instances), cs-dep/csubj (38; 0% instances), cs-dep/iobj (38; 0% instances), cs-dep/aux (37; 0% instances), cs-dep/parataxis (33; 0% instances), cs-dep/dep (27; 0% instances), cs-dep/advmod:emph (5; 0% instances), cs-dep/csubjpass (3; 0% instances), cs-dep/amod (2; 0% instances), cs-dep/appos (2; 0% instances), cs-dep/acl (1; 0% instances)

Children of VERB nodes belong to 12 different parts of speech: NOUN (3404; 40% instances), PUNCT (2157; 26% instances), PRON (952; 11% instances), VERB (691; 8% instances), ADV (334; 4% instances), CONJ (205; 2% instances), SCONJ (200; 2% instances), X (191; 2% instances), AUX (166; 2% instances), ADJ (75; 1% instances), NUM (37; 0% instances), ADP (6; 0% instances)


VERB in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]