home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Norwegian-Bokmaal: POS Tags: VERB

There are 2058 VERB lemmas (8%), 4456 VERB types (13%) and 33351 VERB tokens (11%). Out of 17 observed tags, the rank of VERB is: 4 in number of lemmas, 4 in number of types and 4 in number of tokens.

The 10 most frequent VERB lemmas: ha, si, bli, få, komme, ta, være, gjøre, gå, se

The 10 most frequent VERB types: har, sier, er, blir, kommer, går, ha, få, bli, ta

The 10 most frequent ambiguous lemmas: ha (AUX 2853, VERB 1796, X 2), si (VERB 1387, ADJ 3), bli (AUX 1106, VERB 1042), (VERB 949, AUX 250, ADJ 98), komme (VERB 782, ADJ 9), ta (VERB 782, ADJ 2, X 1), være (AUX 8101, VERB 766, ADJ 1), gjøre (VERB 763, ADJ 3), (VERB 735, ADJ 3, X 1), se (VERB 669, ADJ 7)

The 10 most frequent ambiguous types: har (AUX 2219, VERB 1104, X 1), er (AUX 5494, VERB 496, X 4, DET 2), blir (VERB 342, AUX 226, X 2), går (VERB 296, NOUN 56), ha (VERB 283, AUX 218, X 2), (VERB 292, AUX 80, ADJ 69), bli (VERB 280, AUX 154), ta (VERB 271, X 1), ble (AUX 578, VERB 264), får (VERB 254, AUX 86)

Morphology

The form / lemma ratio of VERB is 2.165209 (the average of all parts of speech is 1.381903).

The 1st highest number of forms (7) was observed with the lemma “bygge”: bygd, bygde, byge, bygge, bygger, bygges, bygget.

The 2nd highest number of forms (7) was observed with the lemma “fortelle”: Fortell, Fotelle, fortalt, fortalte, fortelle, forteller, fortelles.

The 3rd highest number of forms (7) was observed with the lemma “kalle”: Kall, kalle, kaller, kalles, kallet, kalt, kalte.

VERB occurs with 7 features: VerbForm (33351; 100% instances), Mood (19377; 58% instances), Tense (19147; 57% instances), Voice (1147; 3% instances), Abbr (19; 0% instances), Definite (1; 0% instances), Number (1; 0% instances)

VERB occurs with 11 feature-value pairs: Abbr=Yes, Definite=Ind, Mood=Imp, Mood=Ind, Number=Sing, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Pass

VERB occurs with 10 feature combinations. The most frequent feature combination is Mood=Ind|Tense=Pres|VerbForm=Fin (13057 tokens). Examples: har, sier, er, blir, kommer, går, mener, får, ser, gjør

Relations

VERB nodes are attached to their parents using 17 different relations: root (13780; 41% instances), advcl (4760; 14% instances), conj (4152; 12% instances), acl:relcl (3490; 10% instances), ccomp (1722; 5% instances), acl (1544; 5% instances), xcomp (1301; 4% instances), parataxis (1261; 4% instances), csubj (927; 3% instances), acl:cleft (267; 1% instances), nmod (69; 0% instances), orphan (34; 0% instances), flat:name (14; 0% instances), reparandum (11; 0% instances), compound (10; 0% instances), csubj:pass (8; 0% instances), iobj (1; 0% instances)

Parents of VERB nodes belong to 15 different parts of speech: (13780; 41% instances), VERB (10916; 33% instances), NOUN (4718; 14% instances), ADJ (2017; 6% instances), PRON (892; 3% instances), ADV (440; 1% instances), PROPN (354; 1% instances), DET (85; 0% instances), ADP (75; 0% instances), NUM (41; 0% instances), X (12; 0% instances), AUX (11; 0% instances), INTJ (8; 0% instances), CCONJ (1; 0% instances), PART (1; 0% instances)

87 (0%) VERB nodes are leaves.

1264 (4%) VERB nodes have one child.

4380 (13%) VERB nodes have two children.

27620 (83%) VERB nodes have three or more children.

The highest child degree of a VERB node is 14.

Children of VERB nodes are attached using 31 different relations: nsubj (23342; 18% instances), punct (21227; 16% instances), obl (15382; 12% instances), obj (13810; 10% instances), mark (12717; 10% instances), advmod (10033; 8% instances), aux (8143; 6% instances), cc (4181; 3% instances), conj (3998; 3% instances), xcomp (3789; 3% instances), advcl (3774; 3% instances), compound:prt (2537; 2% instances), ccomp (2229; 2% instances), expl (1876; 1% instances), nsubj:pass (1853; 1% instances), aux:pass (1106; 1% instances), parataxis (962; 1% instances), iobj (695; 1% instances), cop (367; 0% instances), csubj (278; 0% instances), case (264; 0% instances), nmod (115; 0% instances), discourse (76; 0% instances), appos (42; 0% instances), reparandum (35; 0% instances), orphan (30; 0% instances), nummod (15; 0% instances), acl (10; 0% instances), flat:name (9; 0% instances), csubj:pass (8; 0% instances), compound (1; 0% instances)

Children of VERB nodes belong to 17 different parts of speech: NOUN (31629; 24% instances), PUNCT (21227; 16% instances), PRON (18809; 14% instances), VERB (10916; 8% instances), AUX (9616; 7% instances), SCONJ (8639; 7% instances), PROPN (6877; 5% instances), PART (5954; 4% instances), ADJ (5847; 4% instances), ADV (5239; 4% instances), CCONJ (4184; 3% instances), ADP (2906; 2% instances), NUM (569; 0% instances), DET (394; 0% instances), INTJ (77; 0% instances), X (20; 0% instances), SYM (1; 0% instances)