home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Norwegian-Nynorsk: POS Tags: VERB

There are 1739 VERB lemmas (7%), 3911 VERB types (12%) and 28776 VERB tokens (10%). Out of 17 observed tags, the rank of VERB is: 4 in number of lemmas, 4 in number of types and 5 in number of tokens.

The 10 most frequent VERB lemmas: ha, seie, få, vere, kome, gå, gjere, ta, sjå, bli

The 10 most frequent VERB types: har, seier, er, få, kjem, får, meiner, ha, går, fekk

The 10 most frequent ambiguous lemmas: ha (AUX 2737, VERB 1765, INTJ 4, ADJ 1, X 1), seie (VERB 1301, ADJ 24), (VERB 1206, AUX 252, ADJ 95, X 1), vere (AUX 7734, VERB 855, ADJ 2), kome (VERB 840, ADJ 43), (VERB 826, ADJ 4, X 2), gjere (VERB 722, ADJ 56), ta (VERB 661, ADJ 56, ADP 1), sjå (VERB 653, ADJ 36), bli (VERB 608, AUX 572, ADJ 16, X 2)

The 10 most frequent ambiguous types: har (AUX 2197, VERB 1050, X 7, SCONJ 1), er (AUX 5260, VERB 558, X 12, NOUN 1), (VERB 361, AUX 82, ADJ 63, X 1), får (VERB 339, AUX 86, X 1), ha (VERB 337, AUX 201, INTJ 4, PRON 2, X 1), går (VERB 307, NOUN 23, X 2), fekk (VERB 301, AUX 58), blir (AUX 330, VERB 284), ta (VERB 263, ADP 1), (VERB 226, X 2)

Morphology

The form / lemma ratio of VERB is 2.248994 (the average of all parts of speech is 1.346455).

The 1st highest number of forms (11) was observed with the lemma “la”: Lat, la, ladd, lar, late, latt, let, lot, lèt, lét, lête.

The 2nd highest number of forms (10) was observed with the lemma “følgje”: følg, følgd, følgde, følge, følgja, følgjast, følgje, følgjer, følgt, følgte.

The 3rd highest number of forms (10) was observed with the lemma “seie”: sa, sagt, seg, segja, sei, seia, seiast, seie, seier, sier.

VERB occurs with 6 features: VerbForm (28776; 100% instances), Mood (17662; 61% instances), Tense (17326; 60% instances), Abbr (40; 0% instances), Definite (4; 0% instances), Gender (4; 0% instances)

VERB occurs with 12 feature-value pairs: Abbr=Yes, Definite=Ind, Gender=Fem,Masc, Gender=Neut, Mood=Imp, Mood=Ind, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Fin,Part, VerbForm=Inf, VerbForm=Part

VERB occurs with 9 feature combinations. The most frequent feature combination is Mood=Ind|Tense=Pres|VerbForm=Fin (12221 tokens). Examples: har, seier, er, kjem, får, meiner, går, blir, ser, gjer

Relations

VERB nodes are attached to their parents using 18 different relations: root (11270; 39% instances), advcl (4379; 15% instances), conj (3621; 13% instances), acl:relcl (3581; 12% instances), ccomp (1792; 6% instances), xcomp (1581; 5% instances), acl (1264; 4% instances), csubj (908; 3% instances), parataxis (270; 1% instances), dislocated (51; 0% instances), flat:name (34; 0% instances), reparandum (10; 0% instances), iobj (6; 0% instances), compound (3; 0% instances), csubj:pass (3; 0% instances), csubj:outer (1; 0% instances), flat (1; 0% instances), nmod (1; 0% instances)

Parents of VERB nodes belong to 14 different parts of speech: (11270; 39% instances), VERB (9372; 33% instances), NOUN (4307; 15% instances), ADJ (2094; 7% instances), PRON (827; 3% instances), ADV (395; 1% instances), PROPN (287; 1% instances), ADP (86; 0% instances), DET (86; 0% instances), NUM (35; 0% instances), INTJ (6; 0% instances), X (6; 0% instances), AUX (4; 0% instances), PART (1; 0% instances)

104 (0%) VERB nodes are leaves.

757 (3%) VERB nodes have one child.

3221 (11%) VERB nodes have two children.

24694 (86%) VERB nodes have three or more children.

The highest child degree of a VERB node is 13.

Children of VERB nodes are attached using 25 different relations: nsubj (19492; 17% instances), punct (17424; 15% instances), obl (12834; 11% instances), obj (12411; 11% instances), mark (11425; 10% instances), advmod (11099; 10% instances), aux (6760; 6% instances), xcomp (4018; 3% instances), cc (3847; 3% instances), conj (3522; 3% instances), case (3418; 3% instances), advcl (3268; 3% instances), ccomp (2801; 2% instances), expl (1812; 2% instances), iobj (529; 0% instances), cop (379; 0% instances), csubj (250; 0% instances), parataxis (214; 0% instances), nsubj:outer (178; 0% instances), dislocated (169; 0% instances), discourse (89; 0% instances), nummod (40; 0% instances), reparandum (25; 0% instances), flat (2; 0% instances), csubj:outer (1; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: NOUN (29187; 25% instances), PUNCT (17424; 15% instances), PRON (13051; 11% instances), VERB (9372; 8% instances), SCONJ (7484; 6% instances), AUX (7145; 6% instances), ADV (7141; 6% instances), ADJ (5943; 5% instances), PART (5488; 5% instances), PROPN (5325; 5% instances), CCONJ (3849; 3% instances), ADP (3545; 3% instances), NUM (491; 0% instances), DET (425; 0% instances), INTJ (95; 0% instances), X (42; 0% instances)