home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Norwegian-Nynorsk: POS Tags: NOUN

There are 12759 NOUN lemmas (52%), 17405 NOUN types (53%) and 60030 NOUN tokens (20%). Out of 17 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: år, dag, land, tid, folk, språk, del, kommune, gong, sak

The 10 most frequent NOUN types: år, dag, folk, tid, prosent, språk, kroner, del, landet, regjeringa

The 10 most frequent ambiguous lemmas: år (NOUN 711, X 2), tid (NOUN 377, X 1), folk (NOUN 353, X 2), språk (NOUN 304, X 1), del (NOUN 296, X 2), prosent (NOUN 240, X 1), bok (NOUN 211, X 1), liv (NOUN 163, X 1), mann (NOUN 186, X 1), grunn (NOUN 181, ADJ 2)

The 10 most frequent ambiguous types: år (NOUN 502, X 2), folk (NOUN 284, X 2), tid (NOUN 247, X 1), prosent (NOUN 233, X 1), språk (NOUN 215, X 1), del (NOUN 187, VERB 3, X 2), grunn (NOUN 127, ADJ 1), departementet (NOUN 78, X 1), SV (NOUN 114, X 1), arbeidet (NOUN 105, X 1)

Morphology

The form / lemma ratio of NOUN is 1.364135 (the average of all parts of speech is 1.352830).

The 1st highest number of forms (9) was observed with the lemma “medlem”: medlammar, medlem, medlemar, medlemene, medlemer, medlemmane, medlemmar, medlemmene, medlemmer.

The 2nd highest number of forms (8) was observed with the lemma “tid”: tid, tida, tidene, tidenes, tider, tiders, tidi, tids.

The 3rd highest number of forms (7) was observed with the lemma “barn”: barn, barna, barnet, barnets, barns, born, borna.

NOUN occurs with 5 features: Gender (58293; 97% instances), Number (55395; 92% instances), Definite (55391; 92% instances), Abbr (1013; 2% instances), Case (598; 1% instances)

NOUN occurs with 11 feature-value pairs: Abbr=Yes, Case=Gen, Definite=Def, Definite=Def,Ind, Definite=Ind, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Plur,Sing, Number=Sing

NOUN occurs with 41 feature combinations. The most frequent feature combination is Definite=Ind|Gender=Masc|Number=Sing (11328 tokens). Examples: dag, del, gong, grunn, leiar, bruk, plass, politikk, måte, fredag

Relations

NOUN nodes are attached to their parents using 23 different relations: obl (14506; 24% instances), nmod (12467; 21% instances), nsubj (10942; 18% instances), obj (9635; 16% instances), conj (4860; 8% instances), root (3015; 5% instances), flat:name (1086; 2% instances), xcomp (1085; 2% instances), appos (692; 1% instances), nsubj:pass (547; 1% instances), acl (209; 0% instances), ccomp (192; 0% instances), advcl (191; 0% instances), iobj (163; 0% instances), acl:relcl (156; 0% instances), orphan (105; 0% instances), parataxis (80; 0% instances), compound (40; 0% instances), csubj (33; 0% instances), acl:cleft (20; 0% instances), reparandum (3; 0% instances), discourse (2; 0% instances), flat:foreign (1; 0% instances)

Parents of NOUN nodes belong to 14 different parts of speech: VERB (32069; 53% instances), NOUN (17597; 29% instances), ADJ (3387; 6% instances), (3015; 5% instances), PROPN (2097; 3% instances), PRON (586; 1% instances), DET (356; 1% instances), NUM (354; 1% instances), ADV (309; 1% instances), ADP (228; 0% instances), X (13; 0% instances), INTJ (9; 0% instances), SCONJ (6; 0% instances), PART (4; 0% instances)

10886 (18%) NOUN nodes are leaves.

19900 (33%) NOUN nodes have one child.

15516 (26%) NOUN nodes have two children.

13728 (23%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 12.

Children of NOUN nodes are attached using 29 different relations: case (24649; 23% instances), nmod (16336; 16% instances), amod (14425; 14% instances), det (12783; 12% instances), punct (7326; 7% instances), conj (4848; 5% instances), cc (4116; 4% instances), cop (2742; 3% instances), acl:relcl (2686; 3% instances), flat:name (2361; 2% instances), nummod (2212; 2% instances), nsubj (2075; 2% instances), advmod (2055; 2% instances), acl (1565; 1% instances), mark (1529; 1% instances), obl (805; 1% instances), appos (597; 1% instances), expl (524; 0% instances), parataxis (300; 0% instances), aux (288; 0% instances), advcl (216; 0% instances), acl:cleft (206; 0% instances), csubj (158; 0% instances), compound (138; 0% instances), xcomp (110; 0% instances), orphan (104; 0% instances), discourse (32; 0% instances), obj (14; 0% instances), reparandum (3; 0% instances)

Children of NOUN nodes belong to 17 different parts of speech: ADP (25008; 24% instances), NOUN (17597; 17% instances), ADJ (15385; 15% instances), DET (12948; 12% instances), PUNCT (7326; 7% instances), PROPN (5424; 5% instances), VERB (4865; 5% instances), CCONJ (4156; 4% instances), PRON (3396; 3% instances), AUX (3030; 3% instances), NUM (2819; 3% instances), SCONJ (1466; 1% instances), ADV (1369; 1% instances), PART (341; 0% instances), X (36; 0% instances), INTJ (33; 0% instances), SYM (4; 0% instances)