Treebank Statistics: UD_Slovenian-SSJ: POS Tags: VERB
There are 2897 VERB
lemmas (11%), 8403 VERB
types (17%) and 24593 VERB
tokens (9%).
Out of 17 observed tags, the rank of VERB
is: 4 in number of lemmas, 3 in number of types and 4 in number of tokens.
The 10 most frequent VERB
lemmas: biti, imeti, morati, iti, začeti, vedeti, dobiti, priti, moči, reči
The 10 most frequent VERB
types: je, ima, bilo, ni, gre, so, imajo, bo, mora, imel
The 10 most frequent ambiguous lemmas: biti (AUX 17326, VERB 1582), peti (ADJ 15, VERB 12)
The 10 most frequent ambiguous types: je (AUX 7643, VERB 629, PRON 23, X 2), bilo (AUX 236, VERB 186), ni (AUX 683, VERB 158), so (AUX 2616, VERB 143, X 1), bo (AUX 737, VERB 108), mora (VERB 98, NOUN 4), pomeni (VERB 81, NOUN 1), pravi (VERB 74, ADJ 36), bila (AUX 394, VERB 63), bil (AUX 482, VERB 55)
- je
- AUX 7643: Škoda je , da slovenski uporabniki iščejo informacije na tujih straneh .
- VERB 629: In to je način našega informiranja , da vemo , kako je z nami .
- PRON 23: Opazil je novo jahto , ki je prejšnji dan še ni bilo v pristanišču .
- X 2: » Pa pukla je Avstrija - preskupo je , ljudi nemaju para za Prater i gađanje . «
- bilo
- ni
- so
- AUX 2616: Za zadovoljitev pomembne želje so pripravljeni vložiti več truda .
- VERB 143: Zadružniki se zavedamo , kako pomembne odločitve so pred nami .
- X 1: “ The temperatures and winds on top of a building can be brutal , so the best plant material to use should be low - lying and need minimal maintenance . ”
- bo
- mora
- pomeni
- pravi
- bila
- bil
Morphology
The form / lemma ratio of VERB
is 2.900587 (the average of all parts of speech is 1.935546).
The 1st highest number of forms (33) was observed with the lemma “biti”: bi, bijejo, bil, bila, bile, bili, bilo, biti, bla, blo, bo, bodi, bodo, bom, bomo, bosta, boste, boš, je, ni, nisem, nisi, nismo, niso, niste, s, sem, si, smo, so, sta, ste, sva.
The 2nd highest number of forms (21) was observed with the lemma “imeti”: ima, imajo, imam, imamo, imata, imate, imava, imaš, imejte, imel, imela, imele, imeli, imelo, imeti, nima, nimajo, nimam, nimamo, nimate, nimaš.
The 3rd highest number of forms (17) was observed with the lemma “hoteti”: hotel, hotela, hotele, hoteli, hoče, hočejo, hočem, hočemo, hočeta, hočete, hočeš, noče, nočejo, nočem, nočemo, nočete, nočeš.
VERB
occurs with 8 features: VerbForm (24593; 100% instances), Number (22412; 91% instances), Aspect (21155; 86% instances), Gender (11423; 46% instances), Mood (10995; 45% instances), Person (10989; 45% instances), Tense (10532; 43% instances), Polarity (1849; 8% instances)
VERB
occurs with 22 feature-value pairs: Aspect=Imp
, Aspect=Perf
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Mood=Cnd
, Mood=Imp
, Mood=Ind
, Number=Dual
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Polarity=Neg
, Polarity=Pos
, Tense=Fut
, Tense=Pres
, VerbForm=Fin
, VerbForm=Inf
, VerbForm=Part
, VerbForm=Sup
VERB
occurs with 111 feature combinations.
The most frequent feature combination is Aspect=Imp|Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin
(2761 tokens).
Examples: mora, zdi, velja, omogoča, more, deluje, obstaja, kaže, uporablja, želi
Relations
VERB
nodes are attached to their parents using 19 different relations: root (9733; 40% instances), acl (3536; 14% instances), conj (2981; 12% instances), parataxis (2584; 11% instances), advcl (1950; 8% instances), ccomp (1522; 6% instances), xcomp (1425; 6% instances), csubj (812; 3% instances), fixed (22; 0% instances), orphan (12; 0% instances), appos (3; 0% instances), list (3; 0% instances), dep (2; 0% instances), dislocated (2; 0% instances), nmod (2; 0% instances), advmod (1; 0% instances), amod (1; 0% instances), discourse (1; 0% instances), obl (1; 0% instances)
Parents of VERB
nodes belong to 13 different parts of speech: (9733; 40% instances), VERB (9284; 38% instances), NOUN (3371; 14% instances), ADJ (1365; 6% instances), DET (445; 2% instances), PROPN (210; 1% instances), PRON (73; 0% instances), ADV (60; 0% instances), NUM (27; 0% instances), X (13; 0% instances), PART (7; 0% instances), AUX (4; 0% instances), INTJ (1; 0% instances)
138 (1%) VERB
nodes are leaves.
821 (3%) VERB
nodes have one child.
2127 (9%) VERB
nodes have two children.
21507 (87%) VERB
nodes have three or more children.
The highest child degree of a VERB
node is 13.
Children of VERB
nodes are attached using 29 different relations: punct (22449; 21% instances), obl (14834; 14% instances), advmod (12699; 12% instances), obj (11732; 11% instances), aux (10618; 10% instances), nsubj (9787; 9% instances), mark (6590; 6% instances), expl (3683; 3% instances), cc (3210; 3% instances), conj (2985; 3% instances), parataxis (2933; 3% instances), xcomp (1915; 2% instances), advcl (1890; 2% instances), ccomp (1801; 2% instances), iobj (1056; 1% instances), csubj (430; 0% instances), discourse (117; 0% instances), vocative (50; 0% instances), dep (44; 0% instances), cop (19; 0% instances), cc:preconj (9; 0% instances), appos (8; 0% instances), dislocated (8; 0% instances), fixed (5; 0% instances), case (3; 0% instances), orphan (3; 0% instances), nummod (2; 0% instances), amod (1; 0% instances), nmod (1; 0% instances)
Children of VERB
nodes belong to 17 different parts of speech: NOUN (28178; 26% instances), PUNCT (22449; 21% instances), AUX (10637; 10% instances), VERB (9284; 9% instances), PRON (8371; 8% instances), ADV (7205; 7% instances), SCONJ (6229; 6% instances), CCONJ (4684; 4% instances), PART (4332; 4% instances), PROPN (2927; 3% instances), DET (2088; 2% instances), ADJ (1977; 2% instances), NUM (272; 0% instances), X (115; 0% instances), ADP (62; 0% instances), SYM (54; 0% instances), INTJ (18; 0% instances)