This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home et/pos issue tracker

ADV: adverb

Definition

Adverbs are words that typically modify verbs for such categories as time, place, direction or manner. They may also modify adjectives and other adverbs, as in väga hea ‘very good’ or väga hästi ‘very well’. Pronominal adverbs, e.g. siin ‘here’, seal ‘there’, siis ‘then’, millal ‘when’ , nii ‘so’ are tagged as adverbs in the current version of Estonian UD.
Some adverbs may also function as verbal particles in Estonian, they are still tagged ADV and not PART, e.g. välja mõtlema ‘contrive’, lit. ‘think out’.


Treebank Statistics (UD_Estonian)

There are 1786 ADV lemmas (6%), 1783 ADV types (3%) and 23683 ADV tokens (10%). Out of 15 observed tags, the rank of ADV is: 5 in number of lemmas, 5 in number of types and 4 in number of tokens.

The 10 most frequent ADV lemmas: ka, siis, nii, kas, juba, välja, veel, mitte, ära, kus

The 10 most frequent ADV types: ka, siis, nii, kas, juba, välja, veel, mitte, ära, kus

The 10 most frequent ambiguous lemmas: ära (ADV 296, AUX 34), aga (CONJ 697, ADV 277), palju (ADV 271, PRON 132), enam (ADV 251, ADJ 8), küll (ADV 245, NOUN 2), vaid (ADV 218, CONJ 121), siin (ADV 172, ADP 1), tagasi (ADV 149, ADP 80), üle (ADP 257, ADV 130), edasi (ADV 129, ADP 1)

The 10 most frequent ambiguous types: ära (ADV 294, AUX 9), aga (CONJ 477, ADV 277), palju (ADV 256, PRON 4), enam (ADV 246, ADJ 1), küll (ADV 224, NOUN 1), kõige (ADV 173, PRON 6), vaid (ADV 203, CONJ 118), näiteks (ADV 149, NOUN 5), seal (ADV 141, NOUN 2), isegi (ADV 140, PRON 4)

Morphology

The form / lemma ratio of ADV is 0.998320 (the average of all parts of speech is 1.839644).

The 1st highest number of forms (2) was observed with the lemma “edas_pidi”: edaspidi, edaspidigi.

The 2nd highest number of forms (2) was observed with the lemma “egas”: Egas’, egas.

The 3rd highest number of forms (2) was observed with the lemma “eks”: eks, eks..

ADV occurs with 7 features: Negative (358; 2% instances), Abbr (164; 1% instances), Degree (3; 0% instances), Foreign (2; 0% instances), Hyph (1; 0% instances), VerbForm (1; 0% instances), Voice (1; 0% instances)

ADV occurs with 7 feature-value pairs: Abbr=Yes, Degree=Pos, Foreign=Yes, Hyph=Yes, Negative=Neg, VerbForm=Part, Voice=Act

ADV occurs with 7 feature combinations. The most frequent feature combination is _ (23154 tokens). Examples: ka, siis, nii, kas, juba, välja, veel, ära, kus, aga

Relations

ADV nodes are attached to their parents using 16 different relations: advmod (18619; 79% instances), compound:prt (2776; 12% instances), mark (933; 4% instances), root (284; 1% instances), advmod:quant (269; 1% instances), cc:preconj (245; 1% instances), conj (242; 1% instances), nmod (162; 1% instances), advcl (121; 1% instances), foreign (9; 0% instances), parataxis (6; 0% instances), amod (5; 0% instances), nsubj (5; 0% instances), cc (3; 0% instances), dobj (2; 0% instances), list (2; 0% instances)

Parents of ADV nodes belong to 15 different parts of speech: VERB (14463; 61% instances), NOUN (3165; 13% instances), ADJ (3004; 13% instances), ADV (1354; 6% instances), PRON (491; 2% instances), NUM (396; 2% instances), PROPN (341; 1% instances), ROOT (284; 1% instances), SCONJ (110; 0% instances), ADP (52; 0% instances), AUX (17; 0% instances), SYM (2; 0% instances), X (2; 0% instances), CONJ (1; 0% instances), INTJ (1; 0% instances)

21542 (91%) ADV nodes are leaves.

1578 (7%) ADV nodes have one child.

328 (1%) ADV nodes have two children.

235 (1%) ADV nodes have three or more children.

The highest child degree of a ADV node is 10.

Children of ADV nodes are attached using 23 different relations: advmod (1116; 35% instances), punct (732; 23% instances), nmod (399; 13% instances), conj (208; 7% instances), cc (183; 6% instances), mark (168; 5% instances), advcl (132; 4% instances), amod (51; 2% instances), parataxis (47; 1% instances), nummod (36; 1% instances), dep (27; 1% instances), discourse (18; 1% instances), cc:preconj (8; 0% instances), cop (8; 0% instances), nsubj:cop (8; 0% instances), dobj (7; 0% instances), xcomp (5; 0% instances), nsubj (4; 0% instances), case (3; 0% instances), det (2; 0% instances), csubj (1; 0% instances), csubj:cop (1; 0% instances), nmod:poss (1; 0% instances)

Children of ADV nodes belong to 12 different parts of speech: ADV (1354; 43% instances), PUNCT (732; 23% instances), NOUN (399; 13% instances), CONJ (183; 6% instances), PRON (117; 4% instances), VERB (114; 4% instances), SCONJ (93; 3% instances), ADJ (81; 3% instances), NUM (39; 1% instances), PROPN (31; 1% instances), INTJ (18; 1% instances), ADP (4; 0% instances)


ADV in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]