home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Slovenian-SST: POS Tags: VERB

There are 726 VERB lemmas (18%), 1482 VERB types (24%) and 3933 VERB tokens (13%). Out of 16 observed tags, the rank of VERB is: 2 in number of lemmas, 2 in number of types and 1 in number of tokens.

The 10 most frequent VERB lemmas: biti, imeti, vedeti, iti, reči, misliti, dati, priti, videti, morati

The 10 most frequent VERB types: je, vem, veš, mislim, bilo, ni, recimo, ima, so, bo

The 10 most frequent ambiguous lemmas: biti (AUX 1936, VERB 670), peti (ADJ 5, VERB 3)

The 10 most frequent ambiguous types: je (AUX 700, VERB 298, PRON 5, INTJ 3), mislim (VERB 78, NOUN 1), bilo (VERB 53, AUX 29), ni (AUX 76, VERB 53, X 1), so (AUX 198, VERB 44, X 2), bo (AUX 99, VERB 34), bil (AUX 41, VERB 33), pravi (VERB 30, ADJ 5), gre (VERB 28, X 1), bila (AUX 44, VERB 21)

Morphology

The form / lemma ratio of VERB is 2.041322 (the average of all parts of speech is 1.570645).

The 1st highest number of forms (29) was observed with the lemma “biti”: bi, bil, bila, bile, bili, bilo, biti, bo, bodite, bodo, bojo, bom, bomo, bosta, bova, boš, je, ni, nisem, nismo, niso, niste, sem, si, smo, so, sta, ste, sva.

The 2nd highest number of forms (20) was observed with the lemma “imeti”: ima, imajo, imam, imamo, imata, imate, imava, imaš, imejte, imel, imela, imele, imeli, imeti, nima, nimajo, nimam, nimamo, nimate, nimaš.

The 3rd highest number of forms (19) was observed with the lemma “iti”: gre, gredo, grejo, grem, gremo, gresta, greste, greva, greš, ide, idem, ideš, iti, pojdi, šel, šla, šle, šli, šlo.

VERB occurs with 8 features: VerbForm (3933; 100% instances), Number (3662; 93% instances), Aspect (2870; 73% instances), Mood (2513; 64% instances), Person (2498; 64% instances), Tense (2290; 58% instances), Gender (1164; 30% instances), Polarity (744; 19% instances)

VERB occurs with 22 feature-value pairs: Aspect=Imp, Aspect=Perf, Gender=Fem, Gender=Masc, Gender=Neut, Mood=Cnd, Mood=Imp, Mood=Ind, Number=Dual, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Polarity=Pos, Tense=Fut, Tense=Pres, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, VerbForm=Sup

VERB occurs with 97 feature combinations. The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Polarity=Pos|Tense=Pres|VerbForm=Fin (298 tokens). Examples: je

Relations

VERB nodes are attached to their parents using 14 different relations: root (1474; 37% instances), parataxis (653; 17% instances), conj (368; 9% instances), advcl (284; 7% instances), acl (257; 7% instances), ccomp (231; 6% instances), parataxis:discourse (224; 6% instances), xcomp (186; 5% instances), csubj (78; 2% instances), reparandum (74; 2% instances), parataxis:restart (73; 2% instances), fixed (20; 1% instances), conj:extend (6; 0% instances), dislocated (5; 0% instances)

Parents of VERB nodes belong to 15 different parts of speech: VERB (1659; 42% instances), (1474; 37% instances), NOUN (364; 9% instances), ADJ (164; 4% instances), DET (103; 3% instances), ADV (67; 2% instances), PRON (46; 1% instances), PROPN (21; 1% instances), NUM (11; 0% instances), PART (8; 0% instances), X (8; 0% instances), AUX (3; 0% instances), CCONJ (2; 0% instances), INTJ (2; 0% instances), SCONJ (1; 0% instances)

297 (8%) VERB nodes are leaves.

402 (10%) VERB nodes have one child.

622 (16%) VERB nodes have two children.

2612 (66%) VERB nodes have three or more children.

The highest child degree of a VERB node is 17.

Children of VERB nodes are attached using 33 different relations: advmod (2846; 20% instances), obj (1310; 9% instances), obl (1178; 8% instances), nsubj (1097; 8% instances), aux (1084; 8% instances), discourse (923; 7% instances), mark (902; 6% instances), punct (671; 5% instances), parataxis (664; 5% instances), cc (637; 5% instances), expl (437; 3% instances), conj (364; 3% instances), discourse:filler (350; 3% instances), ccomp (300; 2% instances), xcomp (239; 2% instances), advcl (235; 2% instances), reparandum (196; 1% instances), parataxis:discourse (139; 1% instances), iobj (97; 1% instances), parataxis:restart (70; 1% instances), csubj (51; 0% instances), vocative (48; 0% instances), dislocated (38; 0% instances), conj:extend (27; 0% instances), case (9; 0% instances), cc:preconj (8; 0% instances), nummod (6; 0% instances), dep (3; 0% instances), acl (2; 0% instances), cop (2; 0% instances), det (2; 0% instances), fixed (2; 0% instances), nmod (2; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: NOUN (2091; 15% instances), ADV (1801; 13% instances), VERB (1659; 12% instances), PART (1596; 11% instances), PRON (1381; 10% instances), AUX (1120; 8% instances), CCONJ (917; 7% instances), SCONJ (879; 6% instances), PUNCT (671; 5% instances), DET (639; 5% instances), INTJ (444; 3% instances), PROPN (268; 2% instances), ADJ (257; 2% instances), X (108; 1% instances), NUM (81; 1% instances), ADP (27; 0% instances)