home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Norwegian-Nynorsk: POS Tags: NOUN

There are 12333 NOUN lemmas (50%), 16968 NOUN types (51%) and 56531 NOUN tokens (19%). Out of 17 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: år, dag, land, tid, folk, språk, del, kommune, gong, sak

The 10 most frequent NOUN types: år, dag, folk, tid, prosent, språk, kroner, del, landet, regjeringa

The 10 most frequent ambiguous lemmas: år (NOUN 711, X 2), tid (NOUN 377, X 1), folk (NOUN 353, X 2), språk (NOUN 304, X 1), del (NOUN 296, X 2), prosent (NOUN 240, X 1), bok (NOUN 211, X 1), mann (NOUN 186, X 1), grunn (NOUN 181, ADJ 2), liv (NOUN 163, X 1)

The 10 most frequent ambiguous types: år (NOUN 502, X 2), folk (NOUN 284, X 2), tid (NOUN 247, X 1), prosent (NOUN 233, X 1), språk (NOUN 215, X 1), del (NOUN 187, VERB 3, X 2), grunn (NOUN 127, ADJ 1), departementet (NOUN 78, X 1), arbeidet (NOUN 105, X 1), bruk (NOUN 109, VERB 1, X 1)

Morphology

The form / lemma ratio of NOUN is 1.375821 (the average of all parts of speech is 1.346455).

The 1st highest number of forms (9) was observed with the lemma “medlem”: medlammar, medlem, medlemar, medlemene, medlemer, medlemmane, medlemmar, medlemmene, medlemmer.

The 2nd highest number of forms (8) was observed with the lemma “tid”: tid, tida, tidene, tidenes, tider, tiders, tidi, tids.

The 3rd highest number of forms (7) was observed with the lemma “barn”: barn, barna, barnet, barnets, barns, born, borna.

NOUN occurs with 5 features: Gender (55466; 98% instances), Definite (33362; 59% instances), Number (14577; 26% instances), Case (525; 1% instances), Abbr (341; 1% instances)

NOUN occurs with 8 feature-value pairs: Abbr=Yes, Case=Gen, Definite=Ind, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing

NOUN occurs with 34 feature combinations. The most frequent feature combination is Definite=Ind|Gender=Masc (11329 tokens). Examples: dag, del, gong, grunn, leiar, bruk, plass, politikk, måte, fredag

Relations

NOUN nodes are attached to their parents using 21 different relations: obl (14645; 26% instances), nmod (10923; 19% instances), nsubj (10005; 18% instances), obj (9629; 17% instances), conj (4655; 8% instances), root (2768; 5% instances), xcomp (1210; 2% instances), flat:name (687; 1% instances), appos (614; 1% instances), nsubj:pass (528; 1% instances), ccomp (363; 1% instances), iobj (155; 0% instances), nsubj:outer (137; 0% instances), dislocated (94; 0% instances), compound (37; 0% instances), csubj (36; 0% instances), parataxis (31; 0% instances), flat (8; 0% instances), reparandum (3; 0% instances), discourse (2; 0% instances), flat:foreign (1; 0% instances)

Parents of NOUN nodes belong to 15 different parts of speech: VERB (29187; 52% instances), NOUN (15704; 28% instances), ADJ (5732; 10% instances), (2768; 5% instances), PROPN (1443; 3% instances), PRON (460; 1% instances), NUM (347; 1% instances), DET (342; 1% instances), ADV (326; 1% instances), ADP (177; 0% instances), AUX (17; 0% instances), INTJ (9; 0% instances), X (9; 0% instances), SCONJ (6; 0% instances), PART (4; 0% instances)

9079 (16%) NOUN nodes are leaves.

19258 (34%) NOUN nodes have one child.

14881 (26%) NOUN nodes have two children.

13313 (24%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 13.

Children of NOUN nodes are attached using 31 different relations: case (25274; 25% instances), amod (14307; 14% instances), nmod (14068; 14% instances), det (12753; 13% instances), punct (6635; 7% instances), conj (4539; 4% instances), cc (4000; 4% instances), acl:relcl (2960; 3% instances), cop (2714; 3% instances), nummod (2214; 2% instances), advmod (2061; 2% instances), nsubj (1893; 2% instances), nmod:poss (1535; 2% instances), appos (1336; 1% instances), acl (1190; 1% instances), flat:name (1047; 1% instances), obl (809; 1% instances), mark (688; 1% instances), expl (511; 1% instances), aux (284; 0% instances), advcl (207; 0% instances), csubj (156; 0% instances), parataxis (146; 0% instances), compound (136; 0% instances), xcomp (121; 0% instances), discourse (33; 0% instances), dislocated (27; 0% instances), nsubj:outer (11; 0% instances), reparandum (3; 0% instances), ccomp (2; 0% instances), obj (2; 0% instances)

Children of NOUN nodes belong to 17 different parts of speech: ADP (25591; 25% instances), NOUN (15704; 15% instances), ADJ (15622; 15% instances), DET (12904; 13% instances), PUNCT (6635; 7% instances), PROPN (5503; 5% instances), VERB (4307; 4% instances), CCONJ (3980; 4% instances), PRON (3203; 3% instances), AUX (3009; 3% instances), NUM (2780; 3% instances), ADV (1369; 1% instances), SCONJ (623; 1% instances), PART (336; 0% instances), X (35; 0% instances), INTJ (34; 0% instances), SYM (27; 0% instances)