home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Norwegian-Nynorsk: POS Tags: PROPN

There are 4073 PROPN lemmas (17%), 4198 PROPN types (13%) and 14302 PROPN tokens (5%). Out of 17 observed tags, the rank of PROPN is: 2 in number of lemmas, 4 in number of types and 9 in number of tokens.

The 10 most frequent PROPN lemmas: Noreg, Førde, Språkrådet, Sogn, Høgre, Fjordane, Oslo, Kviteseid, Tyskland, Helse

The 10 most frequent PROPN types: Noreg, Førde, Språkrådet, Sogn, Fjordane, Oslo, Kviteseid, Høgre, Helse, Tyskland

The 10 most frequent ambiguous lemmas: Språkrådet (PROPN 167, X 1), Gud (PROPN 43, X 1), The (PROPN 34, X 2), den (DET 1927, PRON 148, X 12, PROPN 1), Statens (PROPN 21, X 1), Det (PROPN 19, X 8), Trondheim (PROPN 18, X 1), A (PROPN 17, NOUN 1), Dagens (PROPN 14, X 11), Stavanger (PROPN 14, X 1)

The 10 most frequent ambiguous types: Språkrådet (PROPN 164, NOUN 1, X 1), Helse (PROPN 82, NOUN 1), Norsk (PROPN 62, ADJ 15, NOUN 5), Klassekampen (PROPN 55, NOUN 1), Norge (PROPN 52, X 1), Norske (PROPN 40, ADJ 9), Regjeringa (PROPN 40, NOUN 15), Tokke (PROPN 35, NOUN 1), The (PROPN 34, X 2), Den (DET 260, PRON 32, PROPN 27)

Morphology

The form / lemma ratio of PROPN is 1.030690 (the average of all parts of speech is 1.352830).

The 1st highest number of forms (3) was observed with the lemma “Arbeiderpartiet”: Arbeidarpartiet, Arbeiderpartiet, Arbeiderpartiets.

The 2nd highest number of forms (2) was observed with the lemma “Afghanistan”: Afghanistan, Afghanistans.

The 3rd highest number of forms (2) was observed with the lemma “Albania”: Albania, Albanias.

PROPN occurs with 2 features: Case (337; 2% instances), Abbr (1; 0% instances)

PROPN occurs with 2 feature-value pairs: Abbr=Yes, Case=Gen

PROPN occurs with 3 feature combinations. The most frequent feature combination is _ (13964 tokens). Examples: Noreg, Førde, Språkrådet, Sogn, Fjordane, Oslo, Kviteseid, Høgre, Helse, Tyskland

Relations

PROPN nodes are attached to their parents using 20 different relations: flat:name (3968; 28% instances), nmod (3294; 23% instances), nsubj (3219; 23% instances), obl (1906; 13% instances), conj (838; 6% instances), obj (425; 3% instances), root (267; 2% instances), appos (75; 1% instances), nsubj:pass (70; 0% instances), parataxis (67; 0% instances), compound (49; 0% instances), xcomp (48; 0% instances), iobj (38; 0% instances), orphan (15; 0% instances), advcl (9; 0% instances), acl:relcl (7; 0% instances), acl (3; 0% instances), csubj (2; 0% instances), ccomp (1; 0% instances), reparandum (1; 0% instances)

Parents of PROPN nodes belong to 12 different parts of speech: NOUN (5424; 38% instances), VERB (4812; 34% instances), PROPN (3040; 21% instances), ADJ (419; 3% instances), (267; 2% instances), PRON (182; 1% instances), ADV (46; 0% instances), ADP (42; 0% instances), DET (38; 0% instances), NUM (29; 0% instances), X (2; 0% instances), INTJ (1; 0% instances)

6806 (48%) PROPN nodes are leaves.

4003 (28%) PROPN nodes have one child.

1895 (13%) PROPN nodes have two children.

1598 (11%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 13.

Children of PROPN nodes are attached using 27 different relations: case (4929; 35% instances), flat:name (3546; 25% instances), punct (1597; 11% instances), nmod (1273; 9% instances), conj (874; 6% instances), cc (632; 4% instances), amod (200; 1% instances), advmod (164; 1% instances), acl:relcl (160; 1% instances), appos (156; 1% instances), det (145; 1% instances), obl (106; 1% instances), cop (92; 1% instances), acl (80; 1% instances), nsubj (61; 0% instances), mark (46; 0% instances), orphan (35; 0% instances), expl (27; 0% instances), parataxis (22; 0% instances), nummod (20; 0% instances), xcomp (20; 0% instances), acl:cleft (19; 0% instances), advcl (14; 0% instances), aux (6; 0% instances), discourse (4; 0% instances), compound (3; 0% instances), obj (3; 0% instances)

Children of PROPN nodes belong to 17 different parts of speech: ADP (5104; 36% instances), PROPN (3040; 21% instances), NOUN (2097; 15% instances), PUNCT (1597; 11% instances), CCONJ (819; 6% instances), ADJ (338; 2% instances), X (245; 2% instances), VERB (239; 2% instances), DET (189; 1% instances), NUM (160; 1% instances), ADV (129; 1% instances), PRON (100; 1% instances), AUX (98; 1% instances), SCONJ (42; 0% instances), PART (21; 0% instances), SYM (12; 0% instances), INTJ (4; 0% instances)