Treebank Statistics: UD_Swedish-Talbanken: POS Tags: VERB
There are 1251 VERB lemmas (11%), 2569 VERB types (16%) and 9790 VERB tokens (10%).
Out of 17 observed tags, the rank of VERB is: 3 in number of lemmas, 3 in number of types and 4 in number of tokens.
The 10 most frequent VERB lemmas: ha, få, bli, finnas, göra, ge, ta, gå, komma, se
The 10 most frequent VERB types: har, finns, blir, få, får, ha, gäller, behöver, ger, går
The 10 most frequent ambiguous lemmas: ha (VERB 589, AUX 577, INTJ 2), få (VERB 383, AUX 122, ADJ 18), bli (VERB 314, AUX 49), komma (VERB 162, AUX 123), tro (VERB 32, NOUN 3), bestämma (VERB 31, ADV 1), bo (VERB 31, NOUN 6), lära (VERB 30, NOUN 2), vara (AUX 1847, NOUN 24, VERB 24, PRON 9, ADV 8, DET 8), dela (VERB 22, NOUN 1)
The 10 most frequent ambiguous types: har (AUX 520, VERB 395), blir (VERB 175, AUX 22), få (VERB 164, AUX 25, ADJ 9), får (VERB 144, AUX 78), ha (VERB 117, AUX 25, INTJ 1), går (VERB 81, NOUN 3), kommer (AUX 110, VERB 76), bli (VERB 71, AUX 17), komma (VERB 49, AUX 2), fått (VERB 43, AUX 7)
- har
- blir
- få
- får
- ha
- går
- kommer
- bli
- komma
- fått
Morphology
The form / lemma ratio of VERB is 2.053557 (the average of all parts of speech is 1.421561).
The 1st highest number of forms (11) was observed with the lemma “säga”: sa, sade, sagt, sagts, säga, sägas, säger, sägs, säja, säjer, säjs.
The 2nd highest number of forms (10) was observed with the lemma “lägga”: Lägg, la, lade, lades, lagt, lagts, lägga, läggas, lägger, läggs.
The 3rd highest number of forms (9) was observed with the lemma “använda”: Använd, använda, användas, använde, använder, användes, används, använt, använts.
VERB occurs with 10 features: VerbForm (9779; 100% instances), Voice (9483; 97% instances), Mood (5766; 59% instances), Tense (5607; 57% instances), Case (49; 1% instances), Number (49; 1% instances), Definite (27; 0% instances), Gender (27; 0% instances), Abbr (7; 0% instances), ExtPos (3; 0% instances)
VERB occurs with 20 feature-value pairs: Abbr=Yes, Case=Nom, Definite=Ind, ExtPos=ADV, Gender=Com, Gender=Neut, Mood=Imp, Mood=Ind, Mood=Sub, Number=Plur, Number=Sing, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, VerbForm=Stem, VerbForm=Sup, Voice=Act, Voice=Pass
VERB occurs with 27 feature combinations.
The most frequent feature combination is Mood=Ind|Tense=Pres|VerbForm=Fin|Voice=Act (3900 tokens).
Examples: har, blir, får, gäller, behöver, ger, går, kommer, gör, visar
Relations
VERB nodes are attached to their parents using 22 different relations: root (4386; 45% instances), advcl (1392; 14% instances), acl:relcl (1107; 11% instances), conj (1009; 10% instances), xcomp (403; 4% instances), ccomp (365; 4% instances), acl (355; 4% instances), csubj (309; 3% instances), parataxis (287; 3% instances), acl:cleft (93; 1% instances), csubj:pass (23; 0% instances), advcl:relcl (20; 0% instances), appos (11; 0% instances), fixed (11; 0% instances), dislocated (5; 0% instances), csubj:outer (4; 0% instances), advmod (3; 0% instances), orphan (3; 0% instances), amod (1; 0% instances), discourse (1; 0% instances), nmod (1; 0% instances), nsubj (1; 0% instances)
Parents of VERB nodes belong to 11 different parts of speech: (4386; 45% instances), VERB (3068; 31% instances), NOUN (1534; 16% instances), ADJ (469; 5% instances), PRON (240; 2% instances), ADV (54; 1% instances), PROPN (22; 0% instances), NUM (8; 0% instances), AUX (6; 0% instances), INTJ (2; 0% instances), PART (1; 0% instances)
56 (1%) VERB nodes are leaves.
442 (5%) VERB nodes have one child.
1288 (13%) VERB nodes have two children.
8004 (82%) VERB nodes have three or more children.
The highest child degree of a VERB node is 24.
Children of VERB nodes are attached using 38 different relations: nsubj (6144; 16% instances), punct (5811; 15% instances), obl (5322; 14% instances), obj (4241; 11% instances), advmod (3606; 9% instances), mark (3150; 8% instances), aux (2313; 6% instances), advcl (1351; 3% instances), nsubj:pass (1318; 3% instances), conj (1104; 3% instances), xcomp (1095; 3% instances), cc (1066; 3% instances), compound:prt (800; 2% instances), ccomp (473; 1% instances), parataxis (280; 1% instances), expl (273; 1% instances), iobj (162; 0% instances), obl:agent (139; 0% instances), csubj (114; 0% instances), appos (55; 0% instances), nummod (55; 0% instances), aux:pass (49; 0% instances), dislocated (46; 0% instances), cop (37; 0% instances), csubj:pass (34; 0% instances), nsubj:outer (30; 0% instances), nmod (25; 0% instances), advcl:relcl (20; 0% instances), amod (15; 0% instances), discourse (14; 0% instances), acl (4; 0% instances), csubj:outer (3; 0% instances), fixed (3; 0% instances), acl:cleft (2; 0% instances), acl:relcl (2; 0% instances), case (1; 0% instances), list (1; 0% instances), vocative (1; 0% instances)
Children of VERB nodes belong to 16 different parts of speech: NOUN (12592; 32% instances), PUNCT (5811; 15% instances), PRON (4868; 12% instances), ADV (3795; 10% instances), VERB (3068; 8% instances), AUX (2400; 6% instances), PART (1625; 4% instances), SCONJ (1428; 4% instances), CCONJ (1077; 3% instances), ADJ (979; 3% instances), ADP (847; 2% instances), PROPN (424; 1% instances), NUM (225; 1% instances), INTJ (18; 0% instances), DET (1; 0% instances), SYM (1; 0% instances)