home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Norwegian-Bokmaal: POS Tags: NOUN

There are 12572 NOUN lemmas (51%), 17692 NOUN types (52%) and 57252 NOUN tokens (18%). Out of 17 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: år, dag, land, gang, tid, barn, verden, kirke, del, folk

The 10 most frequent NOUN types: år, dag, prosent, gang, tid, folk, verden, land, barn, del

The 10 most frequent ambiguous lemmas: land (NOUN 387, X 1), tid (NOUN 362, X 1), del (NOUN 251, X 3, PROPN 1), mann (NOUN 198, X 1), problem (NOUN 163, X 1), krone (NOUN 114, VERB 2), ord (NOUN 107, PROPN 1), by (NOUN 102, VERB 14, X 1), person (NOUN 102, X 1), fall (NOUN 100, X 1)

The 10 most frequent ambiguous types: tid (NOUN 204, X 1), land (NOUN 175, X 1), del (NOUN 169, X 3, PROPN 1), landet (NOUN 131, VERB 8), kroner (NOUN 110, VERB 1), fall (NOUN 92, X 1), bruk (NOUN 67, VERB 1, X 1), leder (NOUN 68, VERB 30), mann (NOUN 67, PRON 1, X 1), rekke (NOUN 64, VERB 3)

Morphology

The form / lemma ratio of NOUN is 1.407254 (the average of all parts of speech is 1.381903).

The 1st highest number of forms (9) was observed with the lemma “tid”: tid, tida, tidas, tiden, tidene, tidenes, tider, tiders, tids.

The 2nd highest number of forms (8) was observed with the lemma “kirke”: Kirkenes, kirka, kirke, kirken, kirkene, kirkens, kirker, kirkes.

The 3rd highest number of forms (8) was observed with the lemma “kvinne”: kvinna, kvinne, kvinnen, kvinnene, kvinnenes, kvinnens, kvinner, kvinners.

NOUN occurs with 5 features: Gender (56308; 98% instances), Number (55805; 97% instances), Definite (55803; 97% instances), Case (1395; 2% instances), Abbr (176; 0% instances)

NOUN occurs with 11 feature-value pairs: Abbr=Yes, Case=Gen, Definite=Def, Definite=Def,Ind, Definite=Ind, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Plur,Sing, Number=Sing

NOUN occurs with 43 feature combinations. The most frequent feature combination is Definite=Ind|Gender=Masc|Number=Sing (13867 tokens). Examples: dag, gang, verden, del, grunn, plass, vei, måte, politikk, grad

Relations

NOUN nodes are attached to their parents using 22 different relations: obl (14619; 26% instances), nmod (11885; 21% instances), obj (9917; 17% instances), nsubj (8869; 15% instances), conj (4705; 8% instances), root (2918; 5% instances), xcomp (1142; 2% instances), nsubj:pass (975; 2% instances), appos (540; 1% instances), flat:name (471; 1% instances), acl (252; 0% instances), ccomp (197; 0% instances), advcl (194; 0% instances), iobj (145; 0% instances), acl:relcl (141; 0% instances), orphan (94; 0% instances), parataxis (73; 0% instances), compound (55; 0% instances), csubj (35; 0% instances), acl:cleft (22; 0% instances), reparandum (2; 0% instances), discourse (1; 0% instances)

Parents of NOUN nodes belong to 16 different parts of speech: VERB (31629; 55% instances), NOUN (15397; 27% instances), ADJ (3063; 5% instances), (2918; 5% instances), PROPN (2728; 5% instances), PRON (458; 1% instances), NUM (306; 1% instances), DET (289; 1% instances), ADV (263; 0% instances), ADP (175; 0% instances), X (7; 0% instances), PART (6; 0% instances), SCONJ (6; 0% instances), INTJ (5; 0% instances), AUX (1; 0% instances), CCONJ (1; 0% instances)

10554 (18%) NOUN nodes are leaves.

18982 (33%) NOUN nodes have one child.

15188 (27%) NOUN nodes have two children.

12528 (22%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 17.

Children of NOUN nodes are attached using 29 different relations: case (23274; 24% instances), nmod (16202; 16% instances), amod (13542; 14% instances), det (12220; 12% instances), punct (7227; 7% instances), conj (4594; 5% instances), cc (3834; 4% instances), acl:relcl (2721; 3% instances), cop (2644; 3% instances), nummod (2197; 2% instances), nsubj (1986; 2% instances), advmod (1905; 2% instances), mark (1625; 2% instances), acl (1478; 1% instances), obl (697; 1% instances), expl (495; 1% instances), appos (407; 0% instances), aux (252; 0% instances), compound (242; 0% instances), parataxis (227; 0% instances), advcl (192; 0% instances), acl:cleft (185; 0% instances), csubj (168; 0% instances), xcomp (141; 0% instances), orphan (103; 0% instances), discourse (18; 0% instances), flat:name (18; 0% instances), obj (7; 0% instances), reparandum (2; 0% instances)

Children of NOUN nodes belong to 17 different parts of speech: ADP (23552; 24% instances), NOUN (15397; 16% instances), ADJ (14482; 15% instances), DET (12380; 13% instances), PUNCT (7227; 7% instances), VERB (4718; 5% instances), PROPN (4193; 4% instances), PRON (3972; 4% instances), CCONJ (3840; 4% instances), AUX (2896; 3% instances), NUM (2739; 3% instances), SCONJ (1547; 2% instances), ADV (1291; 1% instances), PART (323; 0% instances), INTJ (18; 0% instances), X (17; 0% instances), SYM (11; 0% instances)