This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home hr/pos issue tracker

VERB: verb

This document is a placeholder for the language-specific documentation for VERB.


Treebank Statistics (UD_Croatian)

There are 1755 VERB lemmas (11%), 4166 VERB types (15%) and 11756 VERB tokens (8%). Out of 15 observed tags, the rank of VERB is: 4 in number of lemmas, 4 in number of types and 5 in number of tokens.

The 10 most frequent VERB lemmas: moći, kazati, imati, trebati, izjaviti, reći, morati, željeti, raditi, dobiti

The 10 most frequent VERB types: izjavio, rekao, kazao, ima, može, kaže, mogu, treba, mora, nema

The 10 most frequent ambiguous lemmas: moći (VERB 407, AUX 1), kazati (VERB 321, ADV 4, ADJ 1), imati (VERB 289, ADV 5), izjaviti (VERB 221, ADV 1), reći (VERB 204, ADJ 4, ADV 4), željeti (VERB 102, ADV 1), raditi (VERB 96, ADJ 1, ADV 1), dobiti (VERB 95, ADJ 1), očekivati (VERB 84, ADJ 1), smatrati (VERB 76, ADJ 1)

The 10 most frequent ambiguous types: mora (VERB 73, NOUN 18), radi (VERB 30, ADP 11), nalazi (VERB 30, NOUN 1), pomoći (VERB 28, NOUN 18), nalaze (VERB 26, NOUN 1), tvrdi (VERB 23, ADJ 1), dobiti (VERB 22, NOUN 1), poziva (VERB 21, NOUN 2), vodi (VERB 17, NOUN 4), koristi (VERB 15, NOUN 6)

Morphology

The form / lemma ratio of VERB is 2.373789 (the average of all parts of speech is 1.779790).

The 1st highest number of forms (13) was observed with the lemma “imati”: ima, imaj, imaju, imala, imale, imali, imalo, imam, imamo, imao, imat, imate, imati.

The 2nd highest number of forms (13) was observed with the lemma “moći”: Nemojmo, mogao, mogla, mogle, mogli, moglo, mogu, moći, može, možemo, možete, možeš, nemojte.

The 3rd highest number of forms (12) was observed with the lemma “morati”: MORAŠ, mora, moraju, morala, morali, moralo, moram, moramo, morao, morat, morate, morati.

VERB occurs with 6 features: Number (9542; 81% instances), VerbForm (6772; 58% instances), Person (4984; 42% instances), Tense (4888; 42% instances), Gender (4558; 39% instances), Mood (96; 1% instances)

VERB occurs with 13 feature-value pairs: Gender=Fem, Gender=Masc, Gender=Neut, Mood=Imp, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Tense=Past, Tense=Pres, VerbForm=Inf, VerbForm=Part

VERB occurs with 17 feature combinations. The most frequent feature combination is Number=Sing|Person=3|Tense=Pres (2849 tokens). Examples: ima, može, kaže, treba, mora, nema, navodi, očekuje, postoji, smatra

Relations

VERB nodes are attached to their parents using 27 different relations: root (4404; 37% instances), acl (1804; 15% instances), conj (1451; 12% instances), xcomp (1147; 10% instances), ccomp (903; 8% instances), parataxis (876; 7% instances), advcl (841; 7% instances), csubj (156; 1% instances), nsubj (50; 0% instances), csubjpass (31; 0% instances), aux (16; 0% instances), advmod (14; 0% instances), nmod (13; 0% instances), appos (10; 0% instances), cop (6; 0% instances), dobj (6; 0% instances), compound (5; 0% instances), amod (4; 0% instances), remnant (4; 0% instances), case (3; 0% instances), nsubjpass (3; 0% instances), dep (2; 0% instances), discourse (2; 0% instances), iobj (2; 0% instances), cc (1; 0% instances), punct (1; 0% instances), vocative (1; 0% instances)

Parents of VERB nodes belong to 14 different parts of speech: ROOT (4404; 37% instances), VERB (4253; 36% instances), NOUN (1909; 16% instances), ADJ (590; 5% instances), ADV (210; 2% instances), PRON (164; 1% instances), PROPN (101; 1% instances), AUX (100; 1% instances), NUM (9; 0% instances), PART (4; 0% instances), SCONJ (4; 0% instances), ADP (3; 0% instances), X (3; 0% instances), CONJ (2; 0% instances)

225 (2%) VERB nodes are leaves.

1119 (10%) VERB nodes have one child.

1530 (13%) VERB nodes have two children.

8882 (76%) VERB nodes have three or more children.

The highest child degree of a VERB node is 15.

Children of VERB nodes are attached using 36 different relations: punct (7856; 17% instances), nmod (6060; 13% instances), nsubj (5946; 13% instances), dobj (5026; 11% instances), aux (4880; 11% instances), mark (3867; 8% instances), advmod (1762; 4% instances), xcomp (1694; 4% instances), compound (1560; 3% instances), conj (1415; 3% instances), cc (1359; 3% instances), ccomp (1128; 2% instances), parataxis (810; 2% instances), advcl (802; 2% instances), neg (404; 1% instances), iobj (393; 1% instances), discourse (354; 1% instances), nsubjpass (170; 0% instances), csubj (65; 0% instances), case (42; 0% instances), auxpass (37; 0% instances), nummod (23; 0% instances), csubjpass (19; 0% instances), remnant (19; 0% instances), amod (17; 0% instances), acl (13; 0% instances), cop (13; 0% instances), vocative (12; 0% instances), dislocated (4; 0% instances), expl (4; 0% instances), name (4; 0% instances), appos (3; 0% instances), dep (3; 0% instances), det (3; 0% instances), list (3; 0% instances), foreign (1; 0% instances)

Children of VERB nodes belong to 15 different parts of speech: NOUN (14119; 31% instances), PUNCT (7874; 17% instances), AUX (4982; 11% instances), PRON (4747; 10% instances), VERB (4253; 9% instances), ADV (2585; 6% instances), PROPN (2376; 5% instances), SCONJ (1922; 4% instances), CONJ (1356; 3% instances), ADJ (750; 2% instances), PART (532; 1% instances), NUM (126; 0% instances), ADP (118; 0% instances), X (25; 0% instances), INTJ (6; 0% instances)


VERB in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]