VERB
: verb
Definition
A verb is a member of the syntactic class of words that typically signal events and actions, can constitute a minimal predicate in a clause, and govern the number and types of other constituents which may occur in the clause.
Note that the VERB
tag covers main verbs (content verbs),
modal verbs and
copulas but it does not cover auxiliary verbs, for which there is
the AUX tag. (Czech modal verbs are not considered auxiliary.)
See the description of AUX
for more information on the borderline
between VERB
and AUX
.
Czech verbs can take the following morphological forms:
- Infinitive (this is the citation form)
- Finite verb (indicative and imperative forms; conditional is constructed periphrastically)
- Past participle (used to construct past and conditional)
- Passive participle (used to construct passive voice; also used separately as an adjective)
- Transgressive (also called adverbial participle)
There are participial forms that are tagged as adjectives (ADJ) rather than verbs. See below for examples.
A verbal noun can be derived productively from almost every verb
(e.g. dělat “to do” → dělání “doing”).
While in other languages a corresponding form may be called gerund and tagged VERB
,
in Czech it is tagged NOUN. It has always the neuter cs-feat/Gender
and it inflects for cs-feat/Number and cs-feat/Case.
Examples
- nést “to carry”
- nesu, neseš, nese, neseme, nesete, nesou “I carry, you carry, he/she/it carries, we carry, you carry, they carry”
- nes, nesme, neste “carry” (imperative in different persons and numbers)
- nesl, nesla, neslo, nesli, nesly “carried” (past participle in different genders and numbers)
- nesen, nesena, neseno, neseni, neseny “carried” (passive participle in different genders and numbers)
- nesa, nesouc, nesouce “carrying” (present transgressive in different genders and numbers)
Border cases
There are passive participles as verb forms (VERB
)
and participial adjectives (ADJ
). For example:
- Verb: nesen, nesena, neseno, neseni, neseny “carried”
- Adjective: nesený, nesená, nesené, nesení, nesené “carried”
Their meaning is almost identical but the usage slightly varies. Both groups can be used in nominal predication with copula. Only true participles (verbs) can be used to form the passive voice (but it may be sometimes difficult to distinguish from copula constructions, see AUX). On the other hand, the participial adjectives inflect for case and thus can modify nouns.
There is an analogy with some adjectives that preserved so called nominal (short) forms. And these adjectives are not derived from verbs. Example:
- Short (nominal) forms: stár, stára, stáro “old”
- Normal (pronominal) forms: starý, stará, staré “old”
Here both groups are ADJ
. The nominal forms are used in predication,
the standard forms both in predication and to modify nouns.
References
Treebank Statistics (UD_Czech)
There are 5926 VERB
lemmas (10%), 24444 VERB
types (19%) and 165634 VERB
tokens (11%).
Out of 17 observed tags, the rank of VERB
is: 4 in number of lemmas, 3 in number of types and 4 in number of tokens.
The 10 most frequent VERB
lemmas: být, mít, moci, muset, říci, stát, chtít, jít, lze, dát
The 10 most frequent VERB
types: je, jsou, má, není, byl, být, může, bylo, řekl, měl
The 10 most frequent ambiguous lemmas: být (VERB 25647, AUX 20737), stát (VERB 1542, NOUN 1446), bývat (VERB 154, AUX 58), růst (NOUN 353, VERB 149), vzrůst (VERB 139, NOUN 13), jet (VERB 129, PROPN 6, NOUN 3), hledět (VERB 39, ADP 1), škodit (VERB 18, NOUN 1), rozlišit (VERB 13, NOUN 1), drát (NOUN 25, VERB 4)
The 10 most frequent ambiguous types: je (VERB 11424, PRON 887, AUX 713), jsou (VERB 2884, AUX 371), má (VERB 2171, DET 15, PRON 1), není (VERB 1489, AUX 57), byl (VERB 1246, AUX 913), být (VERB 1317, AUX 745), bylo (VERB 1045, AUX 611), bude (AUX 1843, VERB 864), byla (VERB 765, AUX 733), byly (AUX 462, VERB 373)
- je
- jsou
- má
- VERB 2171: A čeho si má při své návštěvě především všímat ?
- DET 15: Kam se poděla má bojovnost ? ptala se sama sebe po utkání Martinezová .
- PRON 1: Už J . Gebauer ve své Mluvnici z konce minulého století uvádí , že věty typu “ Slyším tě zpívat svou píseň “ jsou dvojznačné ( není jasné , zda je to píseň “ tvá “ nebo “ má “ ) .
- není
- byl
- být
- bylo
- bude
- byla
- byly
Morphology
The form / lemma ratio of VERB
is 4.124873 (the average of all parts of speech is 2.195970).
The 1st highest number of forms (52) was observed with the lemma “být”: Buďme, Nebuďte, bolo, bude, budeme, budete, budiž, budou, budu, buď, buďte, by, bych, bychom, byl, byla, byli, bylo, byly, bysme, být, býti, j, je, jest, jsa, jsem, jsi, jsme, jsou, jsouc, jsouce, jste, nebude, nebudeme, nebudete, nebudeš, nebudou, nebudu, nebyl, nebyla, nebyli, nebylo, nebyly, nebýt, nejsem, nejsme, nejsou, nejste, není, nésó, si
The 2nd highest number of forms (36) was observed with the lemma “stát”: nestal, nestala, nestali, nestalo, nestaly, nestane, nestanou, nestojí, nestojíme, nestojíte, nestál, nestála, nestáli, nestálo, nestály, stal, stala, stali, stalo, staly, stane, stanete, stanou, stanu, stoje, stojí, stojím, stojíme, stál, stála, stáli, stálo, stály, stát, státi, stůj
The 3rd highest number of forms (34) was observed with the lemma “dát”: Dej, Nedejte, Nedám, dají, dal, dala, dali, dalo, daly, dejme, dejte, dá, dám, dáme, dán, dána, dáno, dánu, dány, dát, dáte, dáti, nedají, nedal, nedala, nedali, nedalo, nedaly, nedat, nedej, nedejme, nedá, nedáme, nedáš
VERB
occurs with 15 features: cs-feat/VerbForm (165634; 100% instances), cs-feat/Negative (165619; 100% instances), cs-feat/Number (140097; 85% instances), cs-feat/Voice (139146; 84% instances), cs-feat/Tense (129620; 78% instances), cs-feat/Aspect (87377; 53% instances), cs-feat/Mood (76695; 46% instances), cs-feat/Person (76683; 46% instances), cs-feat/Gender (63395; 38% instances), cs-feat/Animacy (15642; 9% instances), cs-feat/Style (127; 0% instances), cs-feat/Foreign (120; 0% instances), cs-feat/Abbr (22; 0% instances), cs-feat/Case (20; 0% instances), cs-feat/NameType (13; 0% instances)
VERB
occurs with 37 feature-value pairs: Abbr=Yes
, Animacy=Anim
, Animacy=Inan
, Aspect=Imp
, Aspect=Perf
, Case=Acc
, Foreign=Foreign
, Gender=Fem
, Gender=Fem,Masc
, Gender=Fem,Neut
, Gender=Masc
, Gender=Neut
, Mood=Cnd
, Mood=Imp
, Mood=Ind
, NameType=Com
, NameType=Oth
, NameType=Pro
, Negative=Neg
, Negative=Pos
, Number=Plur
, Number=Plur,Sing
, Number=Sing
, Person=1
, Person=2
, Person=3
, Style=Arch
, Style=Coll
, Tense=Fut
, Tense=Past
, Tense=Pres
, VerbForm=Fin
, VerbForm=Inf
, VerbForm=Part
, VerbForm=Trans
, Voice=Act
, Voice=Pass
VERB
occurs with 199 feature combinations.
The most frequent feature combination is Mood=Ind|Negative=Pos|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act
(23257 tokens).
Examples: je, má, může, jde, musí, lze, chce, zdá, platí, stojí
Relations
VERB
nodes are attached to their parents using 18 different relations: cs-dep/root (63219; 38% instances), cs-dep/acl (20964; 13% instances), cs-dep/cop (20695; 12% instances), cs-dep/conj (19607; 12% instances), cs-dep/xcomp (14366; 9% instances), cs-dep/ccomp (9270; 6% instances), cs-dep/advcl (8330; 5% instances), cs-dep/csubj (5899; 4% instances), cs-dep/parataxis (1631; 1% instances), cs-dep/appos (785; 0% instances), cs-dep/dep (564; 0% instances), cs-dep/cc (177; 0% instances), cs-dep/foreign (69; 0% instances), cs-dep/case (30; 0% instances), cs-dep/csubjpass (17; 0% instances), cs-dep/advmod:emph (8; 0% instances), cs-dep/mwe (2; 0% instances), cs-dep/nmod (1; 0% instances)
Parents of VERB
nodes belong to 16 different parts of speech: ROOT (63219; 38% instances), VERB (53456; 32% instances), NOUN (26831; 16% instances), ADJ (14143; 9% instances), PRON (3983; 2% instances), PROPN (2061; 1% instances), ADV (940; 1% instances), NUM (715; 0% instances), DET (114; 0% instances), PART (110; 0% instances), CONJ (26; 0% instances), SCONJ (13; 0% instances), SYM (10; 0% instances), INTJ (6; 0% instances), ADP (5; 0% instances), PUNCT (2; 0% instances)
23306 (14%) VERB
nodes are leaves.
13471 (8%) VERB
nodes have one child.
18592 (11%) VERB
nodes have two children.
110265 (67%) VERB
nodes have three or more children.
The highest child degree of a VERB
node is 28.
Children of VERB
nodes are attached using 31 different relations: cs-dep/punct (126034; 22% instances), cs-dep/nsubj (77181; 14% instances), cs-dep/dobj (75224; 13% instances), cs-dep/nmod (68215; 12% instances), cs-dep/advmod (46847; 8% instances), cs-dep/cc (24269; 4% instances), cs-dep/conj (22837; 4% instances), cs-dep/mark (19660; 4% instances), cs-dep/expl (16638; 3% instances), cs-dep/xcomp (15348; 3% instances), cs-dep/aux (13933; 2% instances), cs-dep/ccomp (10811; 2% instances), cs-dep/iobj (8495; 2% instances), cs-dep/advcl (7745; 1% instances), cs-dep/auxpass (6065; 1% instances), cs-dep/auxpass:reflex (4897; 1% instances), cs-dep/nsubjpass (4257; 1% instances), cs-dep/csubj (3316; 1% instances), cs-dep/dep (2604; 0% instances), cs-dep/cop (2509; 0% instances), cs-dep/advmod:emph (1657; 0% instances), cs-dep/parataxis (1228; 0% instances), cs-dep/appos (333; 0% instances), cs-dep/discourse (299; 0% instances), cs-dep/foreign (65; 0% instances), cs-dep/vocative (65; 0% instances), cs-dep/amod (38; 0% instances), cs-dep/acl (22; 0% instances), cs-dep/csubjpass (17; 0% instances), cs-dep/neg (16; 0% instances), cs-dep/nummod (10; 0% instances)
Children of VERB
nodes belong to 15 different parts of speech: NOUN (176628; 32% instances), PUNCT (126034; 22% instances), PRON (60721; 11% instances), VERB (53456; 10% instances), ADV (47807; 9% instances), PROPN (23717; 4% instances), CONJ (20059; 4% instances), AUX (19998; 4% instances), SCONJ (19378; 3% instances), ADJ (6653; 1% instances), NUM (3765; 1% instances), PART (2163; 0% instances), ADP (125; 0% instances), SYM (98; 0% instances), INTJ (33; 0% instances)
VERB in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]