home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Norwegian-Nynorsk: POS Tags: VERB

There are 1873 VERB lemmas (8%), 4285 VERB types (13%) and 30723 VERB tokens (10%). Out of 17 observed tags, the rank of VERB is: 4 in number of lemmas, 3 in number of types and 4 in number of tokens.

The 10 most frequent VERB lemmas: ha, seie, få, kome, vere, gå, gjere, ta, sjå, bli

The 10 most frequent VERB types: har, seier, er, få, kjem, får, meiner, ha, går, fekk

The 10 most frequent ambiguous lemmas: ha (AUX 2737, VERB 1766, INTJ 4, X 1), seie (VERB 1317, ADJ 8), (VERB 1206, AUX 252, ADJ 95, X 1), kome (VERB 862, ADJ 21), vere (AUX 7734, VERB 855, ADJ 2), (VERB 829, X 2, ADJ 1), gjere (VERB 776, ADJ 2), ta (VERB 708, ADJ 9, ADP 1), sjå (VERB 668, ADJ 21), bli (VERB 620, AUX 576, X 2)

The 10 most frequent ambiguous types: har (AUX 2197, VERB 1050, X 7, SCONJ 1), er (AUX 5259, VERB 559, X 12, NOUN 1), (VERB 361, AUX 82, ADJ 63, X 1), får (VERB 339, AUX 86, X 1), ha (VERB 337, AUX 201, INTJ 4, PRON 2, X 1), går (VERB 307, NOUN 23, X 2), fekk (VERB 301, AUX 58), blir (AUX 330, VERB 284), ta (VERB 263, ADP 1), (VERB 226, X 2)

Morphology

The form / lemma ratio of VERB is 2.287774 (the average of all parts of speech is 1.352830).

The 1st highest number of forms (11) was observed with the lemma “følgje”: fulgt, følg, følgd, følgde, følge, følgja, følgjast, følgje, følgjer, følgt, følgte.

The 2nd highest number of forms (11) was observed with the lemma “la”: Lat, la, ladd, lar, late, latt, let, lot, lèt, lét, lête.

The 3rd highest number of forms (10) was observed with the lemma “seie”: sa, sagt, seg, segja, sei, seia, seiast, seie, seier, sier.

VERB occurs with 7 features: VerbForm (30723; 100% instances), Mood (17694; 58% instances), Tense (17358; 56% instances), Number (1909; 6% instances), Definite (1538; 5% instances), Gender (863; 3% instances), Abbr (37; 0% instances)

VERB occurs with 13 feature-value pairs: Abbr=Yes, Definite=Ind, Gender=Fem, Gender=Neut, Mood=Imp, Mood=Ind, Number=Plur, Number=Sing, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Inf, VerbForm=Part

VERB occurs with 10 feature combinations. The most frequent feature combination is Mood=Ind|Tense=Pres|VerbForm=Fin (12263 tokens). Examples: har, seier, er, kjem, får, meiner, går, blir, ser, gjer

Relations

VERB nodes are attached to their parents using 18 different relations: root (11715; 38% instances), advcl (4572; 15% instances), conj (3817; 12% instances), acl:relcl (3338; 11% instances), acl (1695; 6% instances), ccomp (1568; 5% instances), parataxis (1511; 5% instances), xcomp (1147; 4% instances), csubj (951; 3% instances), acl:cleft (265; 1% instances), nmod (63; 0% instances), flat:name (35; 0% instances), orphan (23; 0% instances), reparandum (10; 0% instances), iobj (6; 0% instances), compound (3; 0% instances), csubj:pass (3; 0% instances), appos (1; 0% instances)

Parents of VERB nodes belong to 14 different parts of speech: (11715; 38% instances), VERB (10260; 33% instances), NOUN (4865; 16% instances), ADJ (1949; 6% instances), PRON (903; 3% instances), ADV (519; 2% instances), PROPN (239; 1% instances), ADP (108; 0% instances), DET (94; 0% instances), NUM (40; 0% instances), X (18; 0% instances), INTJ (8; 0% instances), AUX (4; 0% instances), PART (1; 0% instances)

103 (0%) VERB nodes are leaves.

877 (3%) VERB nodes have one child.

4129 (13%) VERB nodes have two children.

25614 (83%) VERB nodes have three or more children.

The highest child degree of a VERB node is 13.

Children of VERB nodes are attached using 31 different relations: nsubj (22303; 18% instances), punct (18655; 15% instances), obl (14212; 11% instances), obj (12954; 10% instances), mark (12577; 10% instances), advmod (8902; 7% instances), aux (7711; 6% instances), cc (4071; 3% instances), conj (3792; 3% instances), advcl (3578; 3% instances), xcomp (3539; 3% instances), compound:prt (3067; 2% instances), expl (2094; 2% instances), ccomp (2034; 2% instances), aux:pass (1204; 1% instances), parataxis (1079; 1% instances), nsubj:pass (952; 1% instances), iobj (669; 1% instances), cop (387; 0% instances), csubj (293; 0% instances), case (229; 0% instances), nmod (175; 0% instances), discourse (91; 0% instances), appos (55; 0% instances), nummod (45; 0% instances), reparandum (27; 0% instances), acl (20; 0% instances), orphan (13; 0% instances), csubj:pass (3; 0% instances), flat:name (2; 0% instances), compound (1; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: NOUN (32069; 26% instances), PUNCT (18655; 15% instances), PRON (16592; 13% instances), VERB (10260; 8% instances), AUX (9302; 7% instances), SCONJ (8610; 7% instances), PART (5628; 5% instances), ADJ (5564; 4% instances), PROPN (4812; 4% instances), ADV (4597; 4% instances), CCONJ (4072; 3% instances), ADP (3418; 3% instances), NUM (573; 0% instances), DET (448; 0% instances), INTJ (97; 0% instances), X (37; 0% instances)