This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home et/pos issue tracker

VERB: verb

Definition

A verb typically signals events and actions; it can constitute a minimal predicate in a clause. Verbs in Estonian associate with grammatical categories like person, number, tense, mood and voice.
The verb tag in Estonian UD v 1.3 does not cover auxiliaries AUX.
Auxiliaries are:
olema “be” and in rare occasions saama “get” are auxiliaries that form periphrastic tense forms;
modal verbs are võima, tohtima “may”, saama “can”, pidama “must”, näima, paistma, tunduma “seem”;
ei and ära “not” in negative verb forms.

Participles are word forms that share properties and usage of adjectives and verbs. Depending on their syntactic function they are tagged as VERB or ADJ in Estonian UD.

Gerunds and infinitives are tagged as VERB, except for grammatized word-forms.


Treebank Statistics (UD_Estonian)

There are 2228 VERB lemmas (8%), 7620 VERB types (15%) and 33457 VERB tokens (14%). Out of 15 observed tags, the rank of VERB is: 4 in number of lemmas, 3 in number of types and 3 in number of tokens.

The 10 most frequent VERB lemmas: olema, saama, tulema, tegema, ütlema, minema, jääma, võtma, hakkama, andma

The 10 most frequent VERB types: on, oli, pole, tuleb, ole, olnud, teha, ütles, olid, olla

The 10 most frequent ambiguous lemmas: olema (VERB 6427, AUX 2444), saama (VERB 783, AUX 360), pidama (AUX 525, VERB 239), tunduma (VERB 80, AUX 6), ole (VERB 65, AUX 44), paistma (VERB 52, AUX 5), sõit (VERB 43, NOUN 18), tõus (VERB 28, NOUN 15), sisene (VERB 27, ADJ 2), näima (VERB 19, AUX 8)

The 10 most frequent ambiguous types: on (VERB 3459, AUX 1397), oli (VERB 825, AUX 309), pole (VERB 383, AUX 174), ole (VERB 245, AUX 63), olnud (VERB 225, AUX 19, ADJ 9), olid (VERB 179, AUX 84), olla (VERB 178, AUX 9), oleks (VERB 150, AUX 119), saab (VERB 134, AUX 122), sai (VERB 126, AUX 21, NOUN 2)

Morphology

The form / lemma ratio of VERB is 3.420108 (the average of all parts of speech is 1.839644).

The 1st highest number of forms (50) was observed with the lemma “olema”: Olge, olda, oldi, oldud, ole, oled, oledki, olegi, oleks, oleksid, olekski, olema, olemas, olemast, olemata, oleme, olemegi, olen, olengi, olete, oletegi, olevat, olgem, olgu, oli, olid, olidki, oligi, olime, olimegi, olin, olite, olla, ollagi, ollakse, ollaksegi, olles, olnud, olnudki, olnuks, on, ongi, ons, pole, polegi, poleks, polekski, polevat, polnud, polnudki.

The 2nd highest number of forms (38) was observed with the lemma “tegema”: tee, teeb, teebki, teed, teegi, teeks, teeksime, teeksite, teeme, teen, teengi, teete, teevad, tegema, tegemas, tegemast, tegemata, tegi, tegid, tegigi, tegime, tegin, tegingi, tegite, teha, tehakse, tehaksegi, tehes, tehke, tehku, tehta, tehtagi, tehtagu, tehtaks, tehti, tehtud, teinud, teinudki.

The 3rd highest number of forms (36) was observed with the lemma “saama”: saa, saab, saad, saada, saadaks, saadakse, saades, saadi, saadud, saage, saagi, saagu, saaks, saaksid, saaksime, saaksin, saaksite, saakski, saama, saamas, saamata, saame, saan, saand, saanud, saanudki, saanuks, saate, saavad, sai, said, saigi, saime, sain, saingi, saite.

VERB occurs with 10 features: VerbForm (33457; 100% instances), Voice (28610; 86% instances), Tense (26729; 80% instances), Mood (24094; 72% instances), Number (19752; 59% instances), Person (19740; 59% instances), Connegative (2015; 6% instances), Case (1878; 6% instances), Negative (552; 2% instances), Abbr (12; 0% instances)

VERB occurs with 28 feature-value pairs: Abbr=Yes, Case=Abe, Case=All, Case=Ela, Case=Ill, Case=Ine, Case=Tra, Connegative=Yes, Mood=Cnd, Mood=Imp, Mood=Ind, Mood=Qot, Negative=Neg, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Tense=Imp, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Ger, VerbForm=Inf, VerbForm=Part, VerbForm=Sup, Voice=Act, Voice=Pass

VERB occurs with 76 feature combinations. The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act (7887 tokens). Examples: on, tuleb, saab, jääb, ütleb, läheb, hakkab, teeb, annab, tähendab

Relations

VERB nodes are attached to their parents using 15 different relations: root (14007; 42% instances), conj (4263; 13% instances), cop (3458; 10% instances), advcl (2328; 7% instances), xcomp (2135; 6% instances), parataxis (1636; 5% instances), dep (1593; 5% instances), acl:relcl (1580; 5% instances), ccomp (982; 3% instances), csubj (622; 2% instances), acl (527; 2% instances), csubj:cop (313; 1% instances), compound (7; 0% instances), nmod (5; 0% instances), foreign (1; 0% instances)

Parents of VERB nodes belong to 12 different parts of speech: ROOT (14007; 42% instances), VERB (11901; 36% instances), NOUN (3313; 10% instances), ADJ (3268; 10% instances), PRON (556; 2% instances), PROPN (227; 1% instances), ADV (114; 0% instances), NUM (54; 0% instances), ADP (9; 0% instances), AUX (4; 0% instances), SYM (2; 0% instances), X (2; 0% instances)

4232 (13%) VERB nodes are leaves.

3663 (11%) VERB nodes have one child.

3356 (10%) VERB nodes have two children.

22206 (66%) VERB nodes have three or more children.

The highest child degree of a VERB node is 19.

Children of VERB nodes are attached using 32 different relations: punct (27671; 23% instances), nmod (19148; 16% instances), nsubj (17034; 14% instances), dobj (11381; 10% instances), advmod (10838; 9% instances), conj (4357; 4% instances), mark (4244; 4% instances), aux (4038; 3% instances), cc (3489; 3% instances), xcomp (3102; 3% instances), compound:prt (2708; 2% instances), advcl (2398; 2% instances), neg (1944; 2% instances), dep (1682; 1% instances), parataxis (1571; 1% instances), ccomp (1203; 1% instances), csubj (581; 0% instances), amod (389; 0% instances), nummod (295; 0% instances), discourse (147; 0% instances), vocative (70; 0% instances), nsubj:cop (57; 0% instances), cc:preconj (34; 0% instances), list (17; 0% instances), acl:relcl (12; 0% instances), cop (12; 0% instances), foreign (12; 0% instances), compound (7; 0% instances), appos (6; 0% instances), auxpass (6; 0% instances), case (6; 0% instances), csubj:cop (2; 0% instances)

Children of VERB nodes belong to 15 different parts of speech: NOUN (35580; 30% instances), PUNCT (27671; 23% instances), ADV (14463; 12% instances), VERB (11901; 10% instances), PRON (8887; 8% instances), AUX (5989; 5% instances), PROPN (4614; 4% instances), SCONJ (3527; 3% instances), CONJ (3489; 3% instances), ADJ (1692; 1% instances), NUM (462; 0% instances), INTJ (147; 0% instances), SYM (15; 0% instances), X (14; 0% instances), ADP (10; 0% instances)


VERB in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]