Statistics of VERB in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Estonian-EDT: POS Tags: `VERB`

There are 2350 VERB lemmas (5%), 10351 VERB types (12%) and 47861 VERB tokens (11%). Out of 17 observed tags, the rank of VERB is: 4 in number of lemmas, 3 in number of types and 3 in number of tokens.

The 10 most frequent VERB lemmas: saama, tulema, tegema, olema, ütlema, minema, võtma, jääma, hakkama, andma

The 10 most frequent VERB types: tuleb, on, teha, ütles, saada, sai, saanud, tuli, saab, jääb

The 10 most frequent ambiguous lemmas: saama (VERB 1306, AUX 670), tulema (VERB 1205, ADJ 1), olema (AUX 15463, VERB 1039), pidama (AUX 900, VERB 428), viima (VERB 227, ADV 1), kandma (VERB 121, ADJ 1), tunduma (VERB 94, AUX 50), paistma (VERB 83, AUX 11), tasuma (VERB 69, ADJ 1), paluma (VERB 61, INTJ 1)

The 10 most frequent ambiguous types: on (AUX 8813, VERB 385), saada (VERB 221, AUX 1), sai (VERB 210, AUX 37, NOUN 3), saanud (VERB 182, AUX 38, ADJ 33, NOUN 1), tuli (VERB 176, NOUN 6), saab (AUX 232, VERB 179), jäänud (VERB 112, ADJ 38, NOUN 3), pole (AUX 778, VERB 88), oli (AUX 2083, VERB 81), tulnud (VERB 99, ADJ 19)

on
- AUX 8813: Mind on Vermeeri looming alati fastsineerinud .
- VERB 385: Assisteerimas on talle kartulisalatiga täidetud tavalise singi rull .
saada
- VERB 221: Kangelaseks saada oli Otsmani suur ja salajane unistus .
- AUX 1: Võlur vastab , et see on kerge töö ja muuseas tahab ta teada saada , kuidas maailm toimib .
sai
- VERB 210: Meie uid sai filosoofiaks !
- AUX 37: Iga päev sai loomi kammitu ja klanitu .
- NOUN 3: Otse ahjust tulnud suur pannkoogi moodi sai maitseb hea iseäranis soojalt , ent suuremates linnades on sai standartne masstoodang .
saanud
- VERB 182: Ei ole vastust saanud .
- AUX 38: Õnnetuses ükski inimene kannatada ei saanud .
- ADJ 33: Ei lase mu dopingut saanud bronhid sugugi paremini õhku läbi .
- NOUN 1: Pea kõik 5. oktoobril 1949 kilinad-kulinad rinda saanud rabasid tööd teha Udeva mõisa vanas laudas .
tuli
- VERB 176: Kust tuli mõte kirjutada ooper “ Writing to Vermeer “ ?
- NOUN 6: Koridoris ei põlenud tuli .
saab
- AUX 232: Kuidas saab ikkagi nõnda olla ?
- VERB 179: “ Nüüd on kõik läind ja ei tea , kes peremeheks saab ! ”
jäänud
- VERB 112: Tegelikult oli rattarummu vahele jäänud puuoks , mis teda udjas .
- ADJ 38: Venno Loosaar ergutab vaiksemaks jäänud publikut ka saate ajal .
- NOUN 3: Kevadega võrreldes peavad EVPde varumisega hiljaks jäänud maksma laenu tagasi ligi kaks korda rohkem .
pole
- AUX 778: Kas pole mitte olulisem , et näevad mind praegu elusast peast ?
- VERB 88: Kui pole tõendeid , siis tuleb uurida .
oli
- AUX 2083: Ma kaldun arvama , et Vermeeri saatus oli teistsugune .
- VERB 81: Mehe väitel oli ta haiglasse jõudes juba suremas .
tulnud
- VERB 99: Greenspani fenomen ei tulnud äkki .
- ADJ 19: Reste täis , restide otsas äsja ahjust tulnud kalad .

Morphology

The form / lemma ratio of VERB is 4.404681 (the average of all parts of speech is 1.914465).

The 1st highest number of forms (41) was observed with the lemma “saama”: saa, saab, saabki, saad, saada, saadaks, saadakse, saades, saadi, saadud, saage, saagem, saagi, saagu, saaks, saaksid, saaksime, saaksin, saaksite, saakski, saama, saamas, saamata, saame, saamegi, saan, saand, saanud, saanudki, saanuks, saate, saavad, saavat, sai, said, saigi, saime, saimegi, sain, saingi, saite.

The 2nd highest number of forms (40) was observed with the lemma “tegema”: tee, teeb, teebki, teed, teegi, teeks, teeksid, teeksime, teeksite, teeme, teen, teengi, teete, teevad, teevadki, tegema, tegemas, tegemast, tegemata, tegi, tegid, tegigi, tegime, tegin, tegingi, tegite, teha, tehakse, tehaksegi, tehes, tehke, tehku, tehta, tehtagi, tehtagu, tehtaks, tehti, tehtud, teinud, teinudki.

The 3rd highest number of forms (37) was observed with the lemma “olema”: Olidki, Olin, Ons, ole, oled, olegi, oleks, oleksid, olema, olemas, olemast, olemata, oleme, olemegi, olen, olevat, olgu, olgugi, oli, olid, oligi, olime, olla, ollagi, ollakse, olles, olnud, olnudki, olnuks, on, ongi, pole, polegi, poleks, polekski, polnud, polnudki.

VERB occurs with 13 features: VerbForm (47861; 100% instances), Voice (39118; 82% instances), Tense (36168; 76% instances), Mood (31502; 66% instances), Number (25134; 53% instances), Person (25125; 52% instances), Connegative (2968; 6% instances), Case (2954; 6% instances), Polarity (163; 0% instances), Abbr (47; 0% instances), ExtPos (9; 0% instances), Typo (4; 0% instances), Hyph (1; 0% instances)

VERB occurs with 31 feature-value pairs: Abbr=Yes, Case=Abe, Case=Ela, Case=Ill, Case=Ine, Case=Tra, Connegative=Yes, ExtPos=ADV, ExtPos=SCONJ, Hyph=Yes, Mood=Cnd, Mood=Imp, Mood=Ind, Mood=Qot, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Tense=Past, Tense=Pres, Typo=Yes, VerbForm=Conv, VerbForm=Fin, VerbForm=Ger, VerbForm=Inf, VerbForm=Part, VerbForm=Sup, Voice=Act, Voice=Pass

Relations

VERB nodes are attached to their parents using 24 different relations: root (21605; 45% instances), conj (7162; 15% instances), advcl (4756; 10% instances), xcomp (3799; 8% instances), ccomp (2918; 6% instances), acl:relcl (2662; 6% instances), acl (1952; 4% instances), parataxis (1100; 2% instances), csubj (966; 2% instances), csubj:cop (872; 2% instances), compound (17; 0% instances), appos (12; 0% instances), orphan (10; 0% instances), discourse (9; 0% instances), advmod (5; 0% instances), mark (4; 0% instances), amod (3; 0% instances), dep (2; 0% instances), fixed (2; 0% instances), compound:prt (1; 0% instances), nmod (1; 0% instances), nsubj:cop (1; 0% instances), obj (1; 0% instances), vocative (1; 0% instances)

Parents of VERB nodes belong to 14 different parts of speech: (21605; 45% instances), VERB (17497; 37% instances), NOUN (4880; 10% instances), ADJ (2043; 4% instances), PRON (938; 2% instances), ADV (466; 1% instances), PROPN (380; 1% instances), NUM (34; 0% instances), SYM (8; 0% instances), AUX (4; 0% instances), X (3; 0% instances), ADP (1; 0% instances), DET (1; 0% instances), INTJ (1; 0% instances)

1035 (2%) VERB nodes are leaves.

3667 (8%) VERB nodes have one child.

5055 (11%) VERB nodes have two children.

38104 (80%) VERB nodes have three or more children.

The highest child degree of a VERB node is 26.

Children of VERB nodes are attached using 41 different relations: punct (41616; 22% instances), nsubj (25725; 14% instances), obj (20571; 11% instances), obl (15116; 8% instances), advmod (12882; 7% instances), aux (10450; 6% instances), obl:lmod (8750; 5% instances), conj (7250; 4% instances), mark (6779; 4% instances), xcomp (5577; 3% instances), cc (5574; 3% instances), advcl (4662; 2% instances), compound:prt (4377; 2% instances), ccomp (4136; 2% instances), obl:tmod (3237; 2% instances), obl:arg (2755; 1% instances), advmod:tmod (2683; 1% instances), parataxis (1611; 1% instances), csubj (1029; 1% instances), advmod:lmod (947; 1% instances), compound:idiom (305; 0% instances), obl:agent (194; 0% instances), compound (165; 0% instances), discourse (159; 0% instances), vocative (107; 0% instances), nsubj:cop (97; 0% instances), cop (73; 0% instances), cc:preconj (39; 0% instances), nmod (16; 0% instances), fixed (9; 0% instances), csubj:cop (8; 0% instances), case (7; 0% instances), nummod (7; 0% instances), dep (6; 0% instances), acl (4; 0% instances), amod (3; 0% instances), orphan (3; 0% instances), acl:relcl (1; 0% instances), appos (1; 0% instances), det (1; 0% instances), flat (1; 0% instances)

Children of VERB nodes belong to 17 different parts of speech: NOUN (58843; 31% instances), PUNCT (41616; 22% instances), ADV (22709; 12% instances), VERB (17497; 9% instances), PRON (14053; 8% instances), AUX (10527; 6% instances), PROPN (6531; 3% instances), CCONJ (5572; 3% instances), SCONJ (5538; 3% instances), ADJ (2779; 1% instances), NUM (838; 0% instances), SYM (230; 0% instances), INTJ (155; 0% instances), X (32; 0% instances), ADP (11; 0% instances), DET (1; 0% instances), PART (1; 0% instances)

Treebank Statistics: UD_Estonian-EDT: POS Tags: VERB

Morphology

Relations

Treebank Statistics: UD_Estonian-EDT: POS Tags: `VERB`