Treebank Statistics: UD_Slovenian-SSJ: POS Tags: VERB
There are 2897 VERB
lemmas (11%), 8403 VERB
types (17%) and 24592 VERB
tokens (9%).
Out of 17 observed tags, the rank of VERB
is: 4 in number of lemmas, 3 in number of types and 4 in number of tokens.
The 10 most frequent VERB
lemmas: biti, imeti, morati, iti, začeti, vedeti, dobiti, priti, moči, reči
The 10 most frequent VERB
types: je, ima, bilo, ni, gre, so, imajo, bo, mora, imel
The 10 most frequent ambiguous lemmas: biti (AUX 17327, VERB 1581), peti (ADJ 15, VERB 12)
The 10 most frequent ambiguous types: je (AUX 7644, VERB 628, PRON 23, X 2), bilo (AUX 236, VERB 186), ni (AUX 683, VERB 158), so (AUX 2616, VERB 143, X 1), bo (AUX 737, VERB 108), mora (VERB 98, NOUN 4), pomeni (VERB 81, NOUN 1), pravi (VERB 74, ADJ 36), bila (AUX 394, VERB 63), bil (AUX 482, VERB 55)
- je
- AUX 7644: Škoda je , da slovenski uporabniki iščejo informacije na tujih straneh .
- VERB 628: In to je način našega informiranja , da vemo , kako je z nami .
- PRON 23: Opazil je novo jahto , ki je prejšnji dan še ni bilo v pristanišču .
- X 2: » Pa pukla je Avstrija - preskupo je , ljudi nemaju para za Prater i gađanje . «
- bilo
- ni
- so
- AUX 2616: Za zadovoljitev pomembne želje so pripravljeni vložiti več truda .
- VERB 143: Zadružniki se zavedamo , kako pomembne odločitve so pred nami .
- X 1: “ The temperatures and winds on top of a building can be brutal , so the best plant material to use should be low - lying and need minimal maintenance . ”
- bo
- mora
- pomeni
- pravi
- bila
- bil
Morphology
The form / lemma ratio of VERB
is 2.900587 (the average of all parts of speech is 1.932008).
The 1st highest number of forms (33) was observed with the lemma “biti”: bi, bijejo, bil, bila, bile, bili, bilo, biti, bla, blo, bo, bodi, bodo, bom, bomo, bosta, boste, boš, je, ni, nisem, nisi, nismo, niso, niste, s, sem, si, smo, so, sta, ste, sva.
The 2nd highest number of forms (21) was observed with the lemma “imeti”: ima, imajo, imam, imamo, imata, imate, imava, imaš, imejte, imel, imela, imele, imeli, imelo, imeti, nima, nimajo, nimam, nimamo, nimate, nimaš.
The 3rd highest number of forms (17) was observed with the lemma “hoteti”: hotel, hotela, hotele, hoteli, hoče, hočejo, hočem, hočemo, hočeta, hočete, hočeš, noče, nočejo, nočem, nočemo, nočete, nočeš.
VERB
occurs with 8 features: VerbForm (24592; 100% instances), Number (22411; 91% instances), Aspect (21155; 86% instances), Gender (11423; 46% instances), Mood (10994; 45% instances), Person (10988; 45% instances), Tense (10531; 43% instances), Polarity (1848; 8% instances)
VERB
occurs with 22 feature-value pairs: Aspect=Imp
, Aspect=Perf
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Mood=Cnd
, Mood=Imp
, Mood=Ind
, Number=Dual
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Polarity=Neg
, Polarity=Pos
, Tense=Fut
, Tense=Pres
, VerbForm=Fin
, VerbForm=Inf
, VerbForm=Part
, VerbForm=Sup
VERB
occurs with 111 feature combinations.
The most frequent feature combination is Aspect=Imp|Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin
(2761 tokens).
Examples: mora, zdi, velja, omogoča, more, deluje, obstaja, kaže, uporablja, želi
Relations
VERB
nodes are attached to their parents using 20 different relations: root (9764; 40% instances), acl (3537; 14% instances), conj (2982; 12% instances), parataxis (2439; 10% instances), advcl (1943; 8% instances), ccomp (1575; 6% instances), xcomp (1482; 6% instances), csubj (811; 3% instances), fixed (22; 0% instances), orphan (20; 0% instances), appos (3; 0% instances), list (3; 0% instances), dep (2; 0% instances), dislocated (2; 0% instances), nmod (2; 0% instances), advmod (1; 0% instances), amod (1; 0% instances), discourse (1; 0% instances), nsubj (1; 0% instances), obl (1; 0% instances)
Parents of VERB
nodes belong to 12 different parts of speech: (9764; 40% instances), VERB (9283; 38% instances), NOUN (3363; 14% instances), ADJ (1351; 5% instances), DET (442; 2% instances), PROPN (207; 1% instances), PRON (71; 0% instances), ADV (57; 0% instances), NUM (26; 0% instances), X (14; 0% instances), AUX (7; 0% instances), PART (7; 0% instances)
137 (1%) VERB
nodes are leaves.
802 (3%) VERB
nodes have one child.
2118 (9%) VERB
nodes have two children.
21535 (88%) VERB
nodes have three or more children.
The highest child degree of a VERB
node is 13.
Children of VERB
nodes are attached using 28 different relations: punct (22480; 21% instances), obl (14834; 14% instances), advmod (11502; 11% instances), obj (11169; 10% instances), aux (10618; 10% instances), nsubj (9786; 9% instances), mark (6590; 6% instances), expl (3684; 3% instances), cc (3184; 3% instances), conj (2988; 3% instances), parataxis (2812; 3% instances), ccomp (1958; 2% instances), advcl (1953; 2% instances), xcomp (1912; 2% instances), iobj (1618; 2% instances), csubj (431; 0% instances), discourse (117; 0% instances), vocative (50; 0% instances), dep (44; 0% instances), cop (19; 0% instances), cc:preconj (9; 0% instances), appos (8; 0% instances), dislocated (8; 0% instances), fixed (5; 0% instances), case (3; 0% instances), amod (1; 0% instances), nummod (1; 0% instances), orphan (1; 0% instances)
Children of VERB
nodes belong to 17 different parts of speech: NOUN (28181; 26% instances), PUNCT (22480; 21% instances), AUX (10639; 10% instances), VERB (9283; 9% instances), PRON (8368; 8% instances), ADV (7192; 7% instances), SCONJ (6226; 6% instances), CCONJ (4686; 4% instances), PART (3124; 3% instances), PROPN (2903; 3% instances), DET (2089; 2% instances), ADJ (2055; 2% instances), NUM (272; 0% instances), X (151; 0% instances), ADP (63; 0% instances), SYM (54; 0% instances), INTJ (19; 0% instances)