VERB
: verb
Definition
A verb typically signals events and actions; it can constitute a minimal predicate in a clause.
Verbs in Estonian associate with grammatical categories like person, number, tense, mood and voice.
The verb tag in Estonian UD v 1.3 does not cover auxiliaries AUX
.
Auxiliaries are:
olema “be” and in rare occasions saama “get” are auxiliaries that form periphrastic tense forms;
modal verbs are võima, tohtima “may”, saama “can”, pidama “must”, näima, paistma, tunduma “seem”;
ei and ära “not” in negative verb forms.
Participles are word forms that share properties and usage of adjectives and verbs. Depending on their syntactic function they are tagged as VERB
or ADJ
in Estonian UD.
Gerunds and infinitives are tagged as VERB
, except for grammatized word-forms.
Treebank Statistics (UD_Estonian)
There are 2228 VERB
lemmas (8%), 7620 VERB
types (15%) and 33457 VERB
tokens (14%).
Out of 15 observed tags, the rank of VERB
is: 4 in number of lemmas, 3 in number of types and 3 in number of tokens.
The 10 most frequent VERB
lemmas: olema, saama, tulema, tegema, ütlema, minema, jääma, võtma, hakkama, andma
The 10 most frequent VERB
types: on, oli, pole, tuleb, ole, olnud, teha, ütles, olid, olla
The 10 most frequent ambiguous lemmas: olema (VERB 6427, AUX 2444), saama (VERB 783, AUX 360), pidama (AUX 525, VERB 239), tunduma (VERB 80, AUX 6), ole (VERB 65, AUX 44), paistma (VERB 52, AUX 5), sõit (VERB 43, NOUN 18), tõus (VERB 28, NOUN 15), sisene (VERB 27, ADJ 2), näima (VERB 19, AUX 8)
The 10 most frequent ambiguous types: on (VERB 3459, AUX 1397), oli (VERB 825, AUX 309), pole (VERB 383, AUX 174), ole (VERB 245, AUX 63), olnud (VERB 225, AUX 19, ADJ 9), olid (VERB 179, AUX 84), olla (VERB 178, AUX 9), oleks (VERB 150, AUX 119), saab (VERB 134, AUX 122), sai (VERB 126, AUX 21, NOUN 2)
- on
- oli
- pole
- ole
- olnud
- olid
- olla
- oleks
- saab
- sai
Morphology
The form / lemma ratio of VERB
is 3.420108 (the average of all parts of speech is 1.839644).
The 1st highest number of forms (50) was observed with the lemma “olema”: Olge, olda, oldi, oldud, ole, oled, oledki, olegi, oleks, oleksid, olekski, olema, olemas, olemast, olemata, oleme, olemegi, olen, olengi, olete, oletegi, olevat, olgem, olgu, oli, olid, olidki, oligi, olime, olimegi, olin, olite, olla, ollagi, ollakse, ollaksegi, olles, olnud, olnudki, olnuks, on, ongi, ons, pole, polegi, poleks, polekski, polevat, polnud, polnudki.
The 2nd highest number of forms (38) was observed with the lemma “tegema”: tee, teeb, teebki, teed, teegi, teeks, teeksime, teeksite, teeme, teen, teengi, teete, teevad, tegema, tegemas, tegemast, tegemata, tegi, tegid, tegigi, tegime, tegin, tegingi, tegite, teha, tehakse, tehaksegi, tehes, tehke, tehku, tehta, tehtagi, tehtagu, tehtaks, tehti, tehtud, teinud, teinudki.
The 3rd highest number of forms (36) was observed with the lemma “saama”: saa, saab, saad, saada, saadaks, saadakse, saades, saadi, saadud, saage, saagi, saagu, saaks, saaksid, saaksime, saaksin, saaksite, saakski, saama, saamas, saamata, saame, saan, saand, saanud, saanudki, saanuks, saate, saavad, sai, said, saigi, saime, sain, saingi, saite.
VERB
occurs with 10 features: VerbForm (33457; 100% instances), Voice (28610; 86% instances), Tense (26729; 80% instances), Mood (24094; 72% instances), Number (19752; 59% instances), Person (19740; 59% instances), Connegative (2015; 6% instances), Case (1878; 6% instances), Negative (552; 2% instances), Abbr (12; 0% instances)
VERB
occurs with 28 feature-value pairs: Abbr=Yes
, Case=Abe
, Case=All
, Case=Ela
, Case=Ill
, Case=Ine
, Case=Tra
, Connegative=Yes
, Mood=Cnd
, Mood=Imp
, Mood=Ind
, Mood=Qot
, Negative=Neg
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Tense=Imp
, Tense=Past
, Tense=Pres
, VerbForm=Fin
, VerbForm=Ger
, VerbForm=Inf
, VerbForm=Part
, VerbForm=Sup
, Voice=Act
, Voice=Pass
VERB
occurs with 76 feature combinations.
The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act
(7887 tokens).
Examples: on, tuleb, saab, jääb, ütleb, läheb, hakkab, teeb, annab, tähendab
Relations
VERB
nodes are attached to their parents using 15 different relations: root (14007; 42% instances), conj (4263; 13% instances), cop (3458; 10% instances), advcl (2328; 7% instances), xcomp (2135; 6% instances), parataxis (1636; 5% instances), dep (1593; 5% instances), acl:relcl (1580; 5% instances), ccomp (982; 3% instances), csubj (622; 2% instances), acl (527; 2% instances), csubj:cop (313; 1% instances), compound (7; 0% instances), nmod (5; 0% instances), foreign (1; 0% instances)
Parents of VERB
nodes belong to 12 different parts of speech: ROOT (14007; 42% instances), VERB (11901; 36% instances), NOUN (3313; 10% instances), ADJ (3268; 10% instances), PRON (556; 2% instances), PROPN (227; 1% instances), ADV (114; 0% instances), NUM (54; 0% instances), ADP (9; 0% instances), AUX (4; 0% instances), SYM (2; 0% instances), X (2; 0% instances)
4232 (13%) VERB
nodes are leaves.
3663 (11%) VERB
nodes have one child.
3356 (10%) VERB
nodes have two children.
22206 (66%) VERB
nodes have three or more children.
The highest child degree of a VERB
node is 19.
Children of VERB
nodes are attached using 32 different relations: punct (27671; 23% instances), nmod (19148; 16% instances), nsubj (17034; 14% instances), dobj (11381; 10% instances), advmod (10838; 9% instances), conj (4357; 4% instances), mark (4244; 4% instances), aux (4038; 3% instances), cc (3489; 3% instances), xcomp (3102; 3% instances), compound:prt (2708; 2% instances), advcl (2398; 2% instances), neg (1944; 2% instances), dep (1682; 1% instances), parataxis (1571; 1% instances), ccomp (1203; 1% instances), csubj (581; 0% instances), amod (389; 0% instances), nummod (295; 0% instances), discourse (147; 0% instances), vocative (70; 0% instances), nsubj:cop (57; 0% instances), cc:preconj (34; 0% instances), list (17; 0% instances), acl:relcl (12; 0% instances), cop (12; 0% instances), foreign (12; 0% instances), compound (7; 0% instances), appos (6; 0% instances), auxpass (6; 0% instances), case (6; 0% instances), csubj:cop (2; 0% instances)
Children of VERB
nodes belong to 15 different parts of speech: NOUN (35580; 30% instances), PUNCT (27671; 23% instances), ADV (14463; 12% instances), VERB (11901; 10% instances), PRON (8887; 8% instances), AUX (5989; 5% instances), PROPN (4614; 4% instances), SCONJ (3527; 3% instances), CONJ (3489; 3% instances), ADJ (1692; 1% instances), NUM (462; 0% instances), INTJ (147; 0% instances), SYM (15; 0% instances), X (14; 0% instances), ADP (10; 0% instances)
VERB in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]