home cs/pos edit page issue tracker

VERB: verb

Definition

A verb is a member of the syntactic class of words that typically signal events and actions, can constitute a minimal predicate in a clause, and govern the number and types of other constituents which may occur in the clause.

Note that the VERB tag covers main verbs (content verbs), modal verbs and copulas but it does not cover auxiliary verbs, for which there is the AUX tag. (Czech modal verbs are not considered auxiliary.) See the description of AUX for more information on the borderline between VERB and AUX.

Czech verbs can take the following morphological forms:

There are participial forms that are tagged as adjectives (ADJ) rather than verbs. See below for examples.

A verbal noun can be derived productively from almost every verb (e.g. dělat  “to do” → dělání  “doing”). While in other languages a corresponding form may be called gerund and tagged VERB, in Czech it is tagged NOUN. It has always the neuter cs-feat/Gender and it inflects for cs-feat/Number and cs-feat/Case.

Examples

Border cases

There are passive participles as verb forms (VERB) and participial adjectives (ADJ). For example:

Their meaning is almost identical but the usage slightly varies. Both groups can be used in nominal predication with copula. Only true participles (verbs) can be used to form the passive voice (but it may be sometimes difficult to distinguish from copula constructions, see AUX). On the other hand, the participial adjectives inflect for case and thus can modify nouns.

There is an analogy with some adjectives that preserved so called nominal (short) forms. And these adjectives are not derived from verbs. Example:

Here both groups are ADJ. The nominal forms are used in predication, the standard forms both in predication and to modify nouns.

References


Treebank Statistics (UD_Czech)

There are 5926 VERB lemmas (10%), 24444 VERB types (19%) and 165634 VERB tokens (11%). Out of 17 observed tags, the rank of VERB is: 4 in number of lemmas, 3 in number of types and 4 in number of tokens.

The 10 most frequent VERB lemmas: být, mít, moci, muset, říci, stát, chtít, jít, lze, dát

The 10 most frequent VERB types: je, jsou, má, není, byl, být, může, bylo, řekl, měl

The 10 most frequent ambiguous lemmas: být (VERB 25647, AUX 20737), stát (VERB 1542, NOUN 1446), bývat (VERB 154, AUX 58), růst (NOUN 353, VERB 149), vzrůst (VERB 139, NOUN 13), jet (VERB 129, PROPN 6, NOUN 3), hledět (VERB 39, ADP 1), škodit (VERB 18, NOUN 1), rozlišit (VERB 13, NOUN 1), drát (NOUN 25, VERB 4)

The 10 most frequent ambiguous types: je (VERB 11424, PRON 887, AUX 713), jsou (VERB 2884, AUX 371), (VERB 2171, DET 15, PRON 1), není (VERB 1489, AUX 57), byl (VERB 1246, AUX 913), být (VERB 1317, AUX 745), bylo (VERB 1045, AUX 611), bude (AUX 1843, VERB 864), byla (VERB 765, AUX 733), byly (AUX 462, VERB 373)

Morphology

The form / lemma ratio of VERB is 4.124873 (the average of all parts of speech is 2.195970).

The 1st highest number of forms (52) was observed with the lemma “být”: Buďme, Nebuďte, bolo, bude, budeme, budete, budiž, budou, budu, buď, buďte, by, bych, bychom, byl, byla, byli, bylo, byly, bysme, být, býti, j, je, jest, jsa, jsem, jsi, jsme, jsou, jsouc, jsouce, jste, nebude, nebudeme, nebudete, nebudeš, nebudou, nebudu, nebyl, nebyla, nebyli, nebylo, nebyly, nebýt, nejsem, nejsme, nejsou, nejste, není, nésó, si

The 2nd highest number of forms (36) was observed with the lemma “stát”: nestal, nestala, nestali, nestalo, nestaly, nestane, nestanou, nestojí, nestojíme, nestojíte, nestál, nestála, nestáli, nestálo, nestály, stal, stala, stali, stalo, staly, stane, stanete, stanou, stanu, stoje, stojí, stojím, stojíme, stál, stála, stáli, stálo, stály, stát, státi, stůj

The 3rd highest number of forms (34) was observed with the lemma “dát”: Dej, Nedejte, Nedám, dají, dal, dala, dali, dalo, daly, dejme, dejte, dá, dám, dáme, dán, dána, dáno, dánu, dány, dát, dáte, dáti, nedají, nedal, nedala, nedali, nedalo, nedaly, nedat, nedej, nedejme, nedá, nedáme, nedáš

VERB occurs with 15 features: cs-feat/VerbForm (165634; 100% instances), cs-feat/Negative (165619; 100% instances), cs-feat/Number (140097; 85% instances), cs-feat/Voice (139146; 84% instances), cs-feat/Tense (129620; 78% instances), cs-feat/Aspect (87377; 53% instances), cs-feat/Mood (76695; 46% instances), cs-feat/Person (76683; 46% instances), cs-feat/Gender (63395; 38% instances), cs-feat/Animacy (15642; 9% instances), cs-feat/Style (127; 0% instances), cs-feat/Foreign (120; 0% instances), cs-feat/Abbr (22; 0% instances), cs-feat/Case (20; 0% instances), cs-feat/NameType (13; 0% instances)

VERB occurs with 37 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Aspect=Imp, Aspect=Perf, Case=Acc, Foreign=Foreign, Gender=Fem, Gender=Fem,Masc, Gender=Fem,Neut, Gender=Masc, Gender=Neut, Mood=Cnd, Mood=Imp, Mood=Ind, NameType=Com, NameType=Oth, NameType=Pro, Negative=Neg, Negative=Pos, Number=Plur, Number=Plur,Sing, Number=Sing, Person=1, Person=2, Person=3, Style=Arch, Style=Coll, Tense=Fut, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, VerbForm=Trans, Voice=Act, Voice=Pass

VERB occurs with 199 feature combinations. The most frequent feature combination is Mood=Ind|Negative=Pos|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act (23257 tokens). Examples: je, má, může, jde, musí, lze, chce, zdá, platí, stojí

Relations

VERB nodes are attached to their parents using 18 different relations: cs-dep/root (63219; 38% instances), cs-dep/acl (20964; 13% instances), cs-dep/cop (20695; 12% instances), cs-dep/conj (19607; 12% instances), cs-dep/xcomp (14366; 9% instances), cs-dep/ccomp (9270; 6% instances), cs-dep/advcl (8330; 5% instances), cs-dep/csubj (5899; 4% instances), cs-dep/parataxis (1631; 1% instances), cs-dep/appos (785; 0% instances), cs-dep/dep (564; 0% instances), cs-dep/cc (177; 0% instances), cs-dep/foreign (69; 0% instances), cs-dep/case (30; 0% instances), cs-dep/csubjpass (17; 0% instances), cs-dep/advmod:emph (8; 0% instances), cs-dep/mwe (2; 0% instances), cs-dep/nmod (1; 0% instances)

Parents of VERB nodes belong to 16 different parts of speech: ROOT (63219; 38% instances), VERB (53456; 32% instances), NOUN (26831; 16% instances), ADJ (14143; 9% instances), PRON (3983; 2% instances), PROPN (2061; 1% instances), ADV (940; 1% instances), NUM (715; 0% instances), DET (114; 0% instances), PART (110; 0% instances), CONJ (26; 0% instances), SCONJ (13; 0% instances), SYM (10; 0% instances), INTJ (6; 0% instances), ADP (5; 0% instances), PUNCT (2; 0% instances)

23306 (14%) VERB nodes are leaves.

13471 (8%) VERB nodes have one child.

18592 (11%) VERB nodes have two children.

110265 (67%) VERB nodes have three or more children.

The highest child degree of a VERB node is 28.

Children of VERB nodes are attached using 31 different relations: cs-dep/punct (126034; 22% instances), cs-dep/nsubj (77181; 14% instances), cs-dep/dobj (75224; 13% instances), cs-dep/nmod (68215; 12% instances), cs-dep/advmod (46847; 8% instances), cs-dep/cc (24269; 4% instances), cs-dep/conj (22837; 4% instances), cs-dep/mark (19660; 4% instances), cs-dep/expl (16638; 3% instances), cs-dep/xcomp (15348; 3% instances), cs-dep/aux (13933; 2% instances), cs-dep/ccomp (10811; 2% instances), cs-dep/iobj (8495; 2% instances), cs-dep/advcl (7745; 1% instances), cs-dep/auxpass (6065; 1% instances), cs-dep/auxpass:reflex (4897; 1% instances), cs-dep/nsubjpass (4257; 1% instances), cs-dep/csubj (3316; 1% instances), cs-dep/dep (2604; 0% instances), cs-dep/cop (2509; 0% instances), cs-dep/advmod:emph (1657; 0% instances), cs-dep/parataxis (1228; 0% instances), cs-dep/appos (333; 0% instances), cs-dep/discourse (299; 0% instances), cs-dep/foreign (65; 0% instances), cs-dep/vocative (65; 0% instances), cs-dep/amod (38; 0% instances), cs-dep/acl (22; 0% instances), cs-dep/csubjpass (17; 0% instances), cs-dep/neg (16; 0% instances), cs-dep/nummod (10; 0% instances)

Children of VERB nodes belong to 15 different parts of speech: NOUN (176628; 32% instances), PUNCT (126034; 22% instances), PRON (60721; 11% instances), VERB (53456; 10% instances), ADV (47807; 9% instances), PROPN (23717; 4% instances), CONJ (20059; 4% instances), AUX (19998; 4% instances), SCONJ (19378; 3% instances), ADJ (6653; 1% instances), NUM (3765; 1% instances), PART (2163; 0% instances), ADP (125; 0% instances), SYM (98; 0% instances), INTJ (33; 0% instances)


VERB in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]