home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Norwegian-Bokmaal: POS Tags: NOUN

There are 12572 NOUN lemmas (51%), 17692 NOUN types (52%) and 57253 NOUN tokens (18%). Out of 17 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: år, dag, land, gang, tid, barn, verden, kirke, del, folk

The 10 most frequent NOUN types: år, dag, prosent, gang, tid, folk, verden, land, barn, del

The 10 most frequent ambiguous lemmas: land (NOUN 387, X 1), tid (NOUN 362, X 1), del (NOUN 251, X 3, PROPN 1), mann (NOUN 198, X 1), problem (NOUN 163, X 1), krone (NOUN 114, VERB 2), ord (NOUN 107, PROPN 1), by (NOUN 102, VERB 14, X 1), person (NOUN 102, X 1), fall (NOUN 100, X 1)

The 10 most frequent ambiguous types: tid (NOUN 204, X 1), land (NOUN 175, X 1), del (NOUN 169, X 3, PROPN 1), landet (NOUN 131, VERB 8), kroner (NOUN 110, VERB 1), fall (NOUN 92, X 1), bruk (NOUN 67, VERB 1, X 1), leder (NOUN 68, VERB 30), mann (NOUN 67, PRON 1, X 1), rekke (NOUN 64, VERB 3)

Morphology

The form / lemma ratio of NOUN is 1.407254 (the average of all parts of speech is 1.381699).

The 1st highest number of forms (9) was observed with the lemma “tid”: tid, tida, tidas, tiden, tidene, tidenes, tider, tiders, tids.

The 2nd highest number of forms (8) was observed with the lemma “kirke”: Kirkenes, kirka, kirke, kirken, kirkene, kirkens, kirker, kirkes.

The 3rd highest number of forms (8) was observed with the lemma “kvinne”: kvinna, kvinne, kvinnen, kvinnene, kvinnenes, kvinnens, kvinner, kvinners.

NOUN occurs with 6 features: Gender (56327; 98% instances), Number (55823; 98% instances), Definite (55821; 97% instances), Case (1395; 2% instances), Abbr (176; 0% instances), PronType (1; 0% instances)

NOUN occurs with 13 feature-value pairs: Abbr=Yes, Case=Gen, Definite=Def, Definite=Def,Ind, Definite=Ind, Gender=Fem, Gender=Fem,Masc, Gender=Masc, Gender=Neut, Number=Plur, Number=Plur,Sing, Number=Sing, PronType=Prs

NOUN occurs with 44 feature combinations. The most frequent feature combination is Definite=Ind|Gender=Masc|Number=Sing (13885 tokens). Examples: dag, gang, verden, del, grunn, plass, vei, måte, politikk, grad

Relations

NOUN nodes are attached to their parents using 19 different relations: obl (15101; 26% instances), nmod (10817; 19% instances), nsubj (10393; 18% instances), obj (10017; 17% instances), conj (4824; 8% instances), root (2862; 5% instances), xcomp (1303; 2% instances), appos (525; 1% instances), flat:name (469; 1% instances), ccomp (385; 1% instances), nsubj:outer (155; 0% instances), iobj (146; 0% instances), parataxis (74; 0% instances), dislocated (72; 0% instances), compound (55; 0% instances), csubj (46; 0% instances), flat (6; 0% instances), reparandum (2; 0% instances), discourse (1; 0% instances)

Parents of NOUN nodes belong to 16 different parts of speech: VERB (32935; 58% instances), NOUN (15697; 27% instances), ADJ (3127; 5% instances), (2862; 5% instances), PROPN (1220; 2% instances), PRON (372; 1% instances), NUM (302; 1% instances), DET (295; 1% instances), ADV (288; 1% instances), ADP (127; 0% instances), PART (6; 0% instances), SCONJ (6; 0% instances), AUX (5; 0% instances), INTJ (5; 0% instances), X (5; 0% instances), CCONJ (1; 0% instances)

9524 (17%) NOUN nodes are leaves.

19282 (34%) NOUN nodes have one child.

15393 (27%) NOUN nodes have two children.

13054 (23%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 17.

Children of NOUN nodes are attached using 30 different relations: case (24757; 24% instances), nmod (14429; 14% instances), det (14289; 14% instances), amod (13737; 14% instances), punct (7209; 7% instances), conj (4653; 5% instances), cc (3946; 4% instances), acl:relcl (2878; 3% instances), cop (2661; 3% instances), nummod (2197; 2% instances), advmod (1961; 2% instances), nsubj (1825; 2% instances), appos (1287; 1% instances), flat:name (1173; 1% instances), acl (1040; 1% instances), obl (731; 1% instances), mark (714; 1% instances), expl (503; 0% instances), aux (252; 0% instances), compound (242; 0% instances), advcl (199; 0% instances), csubj (169; 0% instances), xcomp (169; 0% instances), parataxis (78; 0% instances), dislocated (24; 0% instances), discourse (18; 0% instances), nsubj:outer (15; 0% instances), obj (5; 0% instances), reparandum (2; 0% instances), ccomp (1; 0% instances)

Children of NOUN nodes belong to 17 different parts of speech: ADP (25010; 25% instances), NOUN (15697; 16% instances), ADJ (14453; 14% instances), DET (12391; 12% instances), PUNCT (7209; 7% instances), PROPN (6035; 6% instances), VERB (4590; 5% instances), CCONJ (3930; 4% instances), PRON (3830; 4% instances), AUX (2914; 3% instances), NUM (2749; 3% instances), ADV (1326; 1% instances), SCONJ (637; 1% instances), PART (323; 0% instances), SYM (34; 0% instances), INTJ (18; 0% instances), X (18; 0% instances)