Treebank Statistics: UD_Livvi-KKPP: POS Tags: VERB
There are 118 VERB lemmas (20%), 170 VERB types (22%) and 258 VERB tokens (16%).
Out of 14 observed tags, the rank of VERB is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.
The 10 most frequent VERB lemmas: sanuo, olla, lähtie, tulla, sanella, kuulla, mennä, pidiä, tieteä, kyzyö
The 10 most frequent VERB types: sanoi, pietäh, rodieu, sanoo, kuulimo, kuulittogo, kuulluh, sanottih, tiezimö, tiezittö
The 10 most frequent ambiguous lemmas: olla (AUX 40, VERB 11), pidiä (VERB 6, AUX 3), voija (AUX 7, VERB 2)
The 10 most frequent ambiguous types: olen (VERB 3, AUX 2), oli (AUX 17, VERB 3), on (AUX 11, VERB 3), koskijoi (ADJ 1, VERB 1), ole (AUX 2, VERB 1), voibi (AUX 3, VERB 1)
- olen
- oli
- on
- koskijoi
- ADJ 1: “ Tverinkarjalazien ystävien ” piämies 2010-2014 Tapio Mustonen sanoi Lihoslavl’an konferensies ” Karjalazet 400 vuottu Tverin mual ” , ku Karjalua koskijoi kielitiijollizii tutkimuksii rodieu ainos vai enämbi Suomes da Ven’al .
- VERB 1: – Tverin Karjalua koskijoi kielitiijollizii tutkimuksii rodieu ainos vai enämbi Suomes da Ven’al .
- ole
- voibi
Morphology
The form / lemma ratio of VERB is 1.440678 (the average of all parts of speech is 1.337308).
The 1st highest number of forms (9) was observed with the lemma “sanuo”: sanoi, sanomah, sanon, sanoo, sanottih, sanottu, sanottuu, sanou, sanuo.
The 2nd highest number of forms (6) was observed with the lemma “tulla”: tule, tuli, tulla, tulluot, tuloo, tulou.
The 3rd highest number of forms (5) was observed with the lemma “lähtie”: Lähtöö, lähtie, lähtiettih, lähtietäh, lähtöy.
VERB occurs with 11 features: VerbForm (253; 98% instances), Mood (203; 79% instances), Number (203; 79% instances), Tense (201; 78% instances), Voice (195; 76% instances), Person (194; 75% instances), Case (24; 9% instances), Connegative (18; 7% instances), Clitic (6; 2% instances), Degree (1; 0% instances), Reflex (1; 0% instances)
VERB occurs with 27 feature-value pairs: Case=Acc, Case=Ess, Case=Gen, Case=Ill, Case=Ine, Case=Nom, Case=Par, Clitic=Go, Connegative=Yes, Degree=Pos, Mood=Imp, Mood=Ind, Mood=Pot, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Reflex=Yes, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Ger, VerbForm=Inf, VerbForm=Part, Voice=Act, Voice=Pass
VERB occurs with 52 feature combinations.
The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act (41 tokens).
Examples: rodieu, kyzyy, on, sanou, tulou, löydyy, menöö, sanoo, työndää, Lähtöö
Relations
VERB nodes are attached to their parents using 13 different relations: root (105; 41% instances), conj (61; 24% instances), parataxis (33; 13% instances), advcl (14; 5% instances), xcomp (14; 5% instances), ccomp (12; 5% instances), acl:relcl (11; 4% instances), obl (2; 1% instances), xcomp:ds (2; 1% instances), amod (1; 0% instances), csubj:cop (1; 0% instances), nmod:poss (1; 0% instances), nsubj (1; 0% instances)
Parents of VERB nodes belong to 8 different parts of speech: VERB (119; 46% instances), (105; 41% instances), NOUN (26; 10% instances), ADJ (3; 1% instances), PRON (2; 1% instances), AUX (1; 0% instances), NUM (1; 0% instances), X (1; 0% instances)
11 (4%) VERB nodes are leaves.
31 (12%) VERB nodes have one child.
40 (16%) VERB nodes have two children.
176 (68%) VERB nodes have three or more children.
The highest child degree of a VERB node is 10.
Children of VERB nodes are attached using 21 different relations: punct (245; 28% instances), obl (133; 15% instances), nsubj (118; 13% instances), advmod (86; 10% instances), obj (86; 10% instances), conj (56; 6% instances), aux (34; 4% instances), parataxis (31; 4% instances), cc (27; 3% instances), advcl (16; 2% instances), ccomp (13; 1% instances), xcomp (12; 1% instances), mark (11; 1% instances), acl:relcl (4; 0% instances), amod (2; 0% instances), discourse (2; 0% instances), fixed (1; 0% instances), nmod (1; 0% instances), orphan (1; 0% instances), vocative (1; 0% instances), xcomp:ds (1; 0% instances)
Children of VERB nodes belong to 13 different parts of speech: NOUN (261; 30% instances), PUNCT (245; 28% instances), VERB (119; 14% instances), ADV (86; 10% instances), PRON (59; 7% instances), AUX (34; 4% instances), CCONJ (27; 3% instances), PROPN (25; 3% instances), SCONJ (12; 1% instances), ADJ (9; 1% instances), INTJ (2; 0% instances), NUM (1; 0% instances), X (1; 0% instances)