This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home no/pos issue tracker

NOUN: noun

Definition

Nouns are a part of speech typically denoting a person, place, thing, animal or idea. The NOUN tag is used only for common nouns, see PROPN for proper nouns.

In Norwegian, nouns inflect for definiteness (bil-bilen) and usually also for number (bil - biler).

Examples

Treebank Statistics (UD_Norwegian)

There are 12572 NOUN lemmas (51%), 17692 NOUN types (52%) and 57252 NOUN tokens (18%). Out of 17 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: år, dag, land, gang, tid, barn, verden, kirke, del, folk

The 10 most frequent NOUN types: år, dag, prosent, gang, tid, folk, verden, land, barn, del

The 10 most frequent ambiguous lemmas: land (NOUN 387, X 1), tid (NOUN 362, X 1), del (NOUN 251, X 3, PROPN 1), mann (NOUN 198, X 1), problem (NOUN 163, X 1), krone (NOUN 114, VERB 2), ord (NOUN 107, PROPN 1), by (NOUN 102, VERB 14, X 1), person (NOUN 102, X 1), fall (NOUN 100, X 1)

The 10 most frequent ambiguous types: tid (NOUN 204, X 1), land (NOUN 175, X 1), del (NOUN 169, X 3, PROPN 1), landet (NOUN 131, VERB 8), kroner (NOUN 110, VERB 1), fall (NOUN 92, X 1), bruk (NOUN 67, VERB 1, X 1), leder (NOUN 68, VERB 30), mann (NOUN 67, PRON 1, X 1), rekke (NOUN 64, VERB 3)

Morphology

The form / lemma ratio of NOUN is 1.407254 (the average of all parts of speech is 1.382778).

The 1st highest number of forms (9) was observed with the lemma “tid”: tid, tida, tidas, tiden, tidene, tidenes, tider, tiders, tids.

The 2nd highest number of forms (8) was observed with the lemma “kirke”: Kirkenes, kirka, kirke, kirken, kirkene, kirkens, kirker, kirkes.

The 3rd highest number of forms (8) was observed with the lemma “kvinne”: kvinna, kvinne, kvinnen, kvinnene, kvinnenes, kvinnens, kvinner, kvinners.

NOUN occurs with 4 features: Gender (56308; 98% instances), Number (55805; 97% instances), Definite (55803; 97% instances), Case (1395; 2% instances)

NOUN occurs with 10 feature-value pairs: Case=Gen, Definite=Def, Definite=Def,Ind, Definite=Ind, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Plur,Sing, Number=Sing

NOUN occurs with 39 feature combinations. The most frequent feature combination is Definite=Ind|Gender=Masc|Number=Sing (13867 tokens). Examples: dag, gang, verden, del, grunn, plass, vei, måte, politikk, grad

Relations

NOUN nodes are attached to their parents using 23 different relations: nmod (24762; 43% instances), dobj (9929; 17% instances), nsubj (8808; 15% instances), conj (4567; 8% instances), root (2918; 5% instances), det (1657; 3% instances), xcomp (1152; 2% instances), nsubjpass (974; 2% instances), appos (540; 1% instances), name (475; 1% instances), acl (250; 0% instances), ccomp (197; 0% instances), advcl (194; 0% instances), acl:relcl (166; 0% instances), nummod (150; 0% instances), iobj (145; 0% instances), remnant (138; 0% instances), parataxis (73; 0% instances), dislocated (64; 0% instances), compound (55; 0% instances), csubj (35; 0% instances), goeswith (2; 0% instances), discourse (1; 0% instances)

Parents of NOUN nodes belong to 15 different parts of speech: VERB (31629; 55% instances), NOUN (15396; 27% instances), ADJ (3063; 5% instances), ROOT (2918; 5% instances), PROPN (2728; 5% instances), PRON (455; 1% instances), NUM (306; 1% instances), DET (292; 1% instances), ADV (269; 0% instances), ADP (175; 0% instances), X (7; 0% instances), SCONJ (6; 0% instances), INTJ (5; 0% instances), AUX (2; 0% instances), CONJ (1; 0% instances)

12383 (22%) NOUN nodes are leaves.

17894 (31%) NOUN nodes have one child.

13940 (24%) NOUN nodes have two children.

13035 (23%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 20.

Children of NOUN nodes are attached using 29 different relations: case (23274; 24% instances), det (14852; 15% instances), nmod (14005; 14% instances), amod (13520; 14% instances), punct (7512; 8% instances), conj (4549; 5% instances), cc (3802; 4% instances), acl:relcl (2952; 3% instances), cop (2644; 3% instances), nummod (2340; 2% instances), nsubj (1986; 2% instances), advmod (1690; 2% instances), mark (1625; 2% instances), acl (1399; 1% instances), expl (495; 1% instances), appos (455; 0% instances), neg (416; 0% instances), compound (266; 0% instances), aux (252; 0% instances), parataxis (226; 0% instances), advcl (197; 0% instances), csubj (169; 0% instances), xcomp (159; 0% instances), remnant (43; 0% instances), discourse (18; 0% instances), name (17; 0% instances), dobj (11; 0% instances), goeswith (2; 0% instances), ccomp (1; 0% instances)

Children of NOUN nodes belong to 17 different parts of speech: ADP (24633; 25% instances), NOUN (15396; 16% instances), ADJ (14482; 15% instances), DET (14402; 15% instances), PUNCT (7512; 8% instances), VERB (7361; 7% instances), PROPN (4192; 4% instances), CONJ (3808; 4% instances), NUM (2739; 3% instances), PRON (1949; 2% instances), ADV (1536; 2% instances), SCONJ (466; 0% instances), AUX (252; 0% instances), PART (79; 0% instances), SYM (35; 0% instances), INTJ (18; 0% instances), X (17; 0% instances)


NOUN in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]