home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Swedish-Talbanken: POS Tags: VERB

There are 1250 VERB lemmas (11%), 2568 VERB types (16%) and 9870 VERB tokens (10%). Out of 17 observed tags, the rank of VERB is: 3 in number of lemmas, 3 in number of types and 4 in number of tokens.

The 10 most frequent VERB lemmas: ha, få, bli, finnas, göra, ge, ta, gå, komma, vara

The 10 most frequent VERB types: har, finns, blir, få, får, ha, är, gäller, ger, går

The 10 most frequent ambiguous lemmas: ha (VERB 589, AUX 577, INTJ 2), (VERB 383, AUX 122, ADJ 17, PRON 1), bli (VERB 314, AUX 49), komma (VERB 162, AUX 123), vara (AUX 1717, VERB 154, NOUN 24, PRON 9, ADV 8, DET 8), behöva (VERB 68, AUX 49), tro (VERB 32, NOUN 3), bestämma (VERB 31, ADV 1), bo (VERB 31, NOUN 6), betala (VERB 30, ADJ 1)

The 10 most frequent ambiguous types: har (AUX 520, VERB 395), blir (VERB 175, AUX 22), (VERB 164, AUX 25, ADJ 8, PRON 1), får (VERB 144, AUX 78), ha (VERB 117, AUX 25, INTJ 1), är (AUX 1357, VERB 101), går (VERB 81, NOUN 3), kommer (AUX 110, VERB 76), bli (VERB 71, AUX 17), komma (VERB 49, AUX 2)

Morphology

The form / lemma ratio of VERB is 2.054400 (the average of all parts of speech is 1.430604).

The 1st highest number of forms (11) was observed with the lemma “säga”: sa, sade, sagt, sagts, säga, sägas, säger, sägs, säja, säjer, säjs.

The 2nd highest number of forms (10) was observed with the lemma “lägga”: Lägg, la, lade, lades, lagt, lagts, lägga, läggas, lägger, läggs.

The 3rd highest number of forms (9) was observed with the lemma “använda”: Använd, använda, användas, använde, använder, användes, används, använt, använts.

VERB occurs with 9 features: VerbForm (9860; 100% instances), Voice (9559; 97% instances), Mood (5835; 59% instances), Tense (5676; 58% instances), Case (49; 0% instances), Number (49; 0% instances), Definite (27; 0% instances), Gender (27; 0% instances), Abbr (7; 0% instances)

VERB occurs with 19 feature-value pairs: Abbr=Yes, Case=Nom, Definite=Ind, Gender=Com, Gender=Neut, Mood=Imp, Mood=Ind, Mood=Sub, Number=Plur, Number=Sing, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, VerbForm=Stem, VerbForm=Sup, Voice=Act, Voice=Pass

VERB occurs with 24 feature combinations. The most frequent feature combination is Mood=Ind|Tense=Pres|VerbForm=Fin|Voice=Act (3956 tokens). Examples: har, blir, får, är, gäller, ger, går, kommer, gör, visar

Relations

VERB nodes are attached to their parents using 19 different relations: root (4426; 45% instances), advcl (1388; 14% instances), acl:relcl (1117; 11% instances), conj (1013; 10% instances), ccomp (399; 4% instances), xcomp (351; 4% instances), acl (345; 3% instances), csubj (311; 3% instances), parataxis (294; 3% instances), acl:cleft (87; 1% instances), dislocated (43; 0% instances), appos (39; 0% instances), csubj:pass (23; 0% instances), nmod (14; 0% instances), fixed (11; 0% instances), advmod (3; 0% instances), orphan (3; 0% instances), nsubj (2; 0% instances), amod (1; 0% instances)

Parents of VERB nodes belong to 12 different parts of speech: (4426; 45% instances), VERB (3127; 32% instances), NOUN (1535; 16% instances), ADJ (468; 5% instances), PRON (235; 2% instances), ADV (36; 0% instances), PROPN (22; 0% instances), NUM (10; 0% instances), AUX (4; 0% instances), DET (4; 0% instances), INTJ (2; 0% instances), PART (1; 0% instances)

49 (0%) VERB nodes are leaves.

430 (4%) VERB nodes have one child.

1313 (13%) VERB nodes have two children.

8078 (82%) VERB nodes have three or more children.

The highest child degree of a VERB node is 50.

Children of VERB nodes are attached using 36 different relations: nsubj (6239; 16% instances), punct (5893; 15% instances), obl (5271; 13% instances), obj (4244; 11% instances), advmod (3717; 9% instances), mark (3110; 8% instances), aux (2367; 6% instances), advcl (1382; 3% instances), nsubj:pass (1330; 3% instances), conj (1112; 3% instances), cc (1078; 3% instances), xcomp (1038; 3% instances), compound:prt (798; 2% instances), ccomp (506; 1% instances), parataxis (292; 1% instances), expl (284; 1% instances), iobj (162; 0% instances), obl:agent (140; 0% instances), csubj (119; 0% instances), dislocated (92; 0% instances), appos (75; 0% instances), case (70; 0% instances), nummod (56; 0% instances), aux:pass (49; 0% instances), amod (48; 0% instances), csubj:pass (34; 0% instances), nmod (27; 0% instances), discourse (15; 0% instances), acl (4; 0% instances), cop (4; 0% instances), fixed (3; 0% instances), acl:cleft (2; 0% instances), acl:relcl (2; 0% instances), det (1; 0% instances), list (1; 0% instances), vocative (1; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: NOUN (12672; 32% instances), PUNCT (5893; 15% instances), PRON (4863; 12% instances), ADV (3887; 10% instances), VERB (3127; 8% instances), AUX (2421; 6% instances), PART (1627; 4% instances), SCONJ (1399; 4% instances), CCONJ (1090; 3% instances), ADJ (1044; 3% instances), ADP (857; 2% instances), PROPN (426; 1% instances), NUM (229; 1% instances), INTJ (19; 0% instances), DET (11; 0% instances), SYM (1; 0% instances)