home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Norwegian-Nynorsk: POS Tags: PROPN

There are 4491 PROPN lemmas (18%), 4631 PROPN types (14%) and 17802 PROPN tokens (6%). Out of 17 observed tags, the rank of PROPN is: 2 in number of lemmas, 3 in number of types and 7 in number of tokens.

The 10 most frequent PROPN lemmas: Noreg, Førde, Språkrådet, USA, Sogn, SV, Høgre, Fjordane, Oslo, Stortinget

The 10 most frequent PROPN types: Noreg, Førde, Språkrådet, Sogn, USA, SV, Fjordane, Oslo, Kviteseid, Stortinget

The 10 most frequent ambiguous lemmas: Språkrådet (PROPN 167, X 1), SV (PROPN 123, X 1), EU (PROPN 58, X 1), Gud (PROPN 43, X 1), The (PROPN 34, X 2), per (ADP 37, PROPN 1), den (DET 1927, PRON 148, X 12, PROPN 1), FN (PROPN 24, X 1), Statens (PROPN 21, X 1), Det (PROPN 19, X 8)

The 10 most frequent ambiguous types: Språkrådet (PROPN 164, NOUN 1, X 1), SV (PROPN 114, X 1), Stortinget (PROPN 94, X 1), Helse (PROPN 82, NOUN 1), Norsk (PROPN 62, ADJ 15, NOUN 5), Klassekampen (PROPN 55, NOUN 1), Norge (PROPN 52, X 1), EU (PROPN 49, X 1), Norske (PROPN 40, ADJ 9), Regjeringa (PROPN 40, NOUN 15)

Morphology

The form / lemma ratio of PROPN is 1.031173 (the average of all parts of speech is 1.346455).

The 1st highest number of forms (3) was observed with the lemma “Ap”: AP, Ap, Aps.

The 2nd highest number of forms (3) was observed with the lemma “Arbeiderpartiet”: Arbeidarpartiet, Arbeiderpartiet, Arbeiderpartiets.

The 3rd highest number of forms (2) was observed with the lemma “Afghanistan”: Afghanistan, Afghanistans.

PROPN occurs with 3 features: Gender (2829; 16% instances), Abbr (673; 4% instances), Case (410; 2% instances)

PROPN occurs with 5 feature-value pairs: Abbr=Yes, Case=Gen, Gender=Fem, Gender=Masc, Gender=Neut

PROPN occurs with 11 feature combinations. The most frequent feature combination is _ (13964 tokens). Examples: Noreg, Førde, Språkrådet, Sogn, Fjordane, Oslo, Kviteseid, Høgre, Helse, Tyskland

Relations

PROPN nodes are attached to their parents using 20 different relations: flat:name (4840; 27% instances), nsubj (4020; 23% instances), nmod (3660; 21% instances), obl (2046; 11% instances), conj (1044; 6% instances), appos (913; 5% instances), obj (442; 2% instances), root (375; 2% instances), parataxis (117; 1% instances), nsubj:pass (79; 0% instances), nmod:poss (71; 0% instances), xcomp (59; 0% instances), compound (52; 0% instances), iobj (46; 0% instances), dislocated (17; 0% instances), flat (10; 0% instances), ccomp (4; 0% instances), nsubj:outer (4; 0% instances), csubj (2; 0% instances), reparandum (1; 0% instances)

Parents of PROPN nodes belong to 13 different parts of speech: NOUN (5503; 31% instances), PROPN (5402; 30% instances), VERB (5325; 30% instances), ADJ (790; 4% instances), (375; 2% instances), PRON (224; 1% instances), ADV (65; 0% instances), DET (45; 0% instances), ADP (36; 0% instances), NUM (33; 0% instances), X (2; 0% instances), AUX (1; 0% instances), INTJ (1; 0% instances)

8349 (47%) PROPN nodes are leaves.

5008 (28%) PROPN nodes have one child.

2544 (14%) PROPN nodes have two children.

1901 (11%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 12.

Children of PROPN nodes are attached using 29 different relations: case (5351; 31% instances), flat:name (5271; 30% instances), punct (2245; 13% instances), conj (1182; 7% instances), nmod (870; 5% instances), cc (786; 4% instances), acl:relcl (259; 1% instances), amod (250; 1% instances), appos (244; 1% instances), advmod (220; 1% instances), det (166; 1% instances), obl (128; 1% instances), cop (121; 1% instances), flat:foreign (81; 0% instances), nsubj (71; 0% instances), expl (40; 0% instances), parataxis (38; 0% instances), nmod:poss (37; 0% instances), nummod (31; 0% instances), xcomp (28; 0% instances), mark (20; 0% instances), obj (18; 0% instances), advcl (15; 0% instances), aux (8; 0% instances), compound (5; 0% instances), discourse (4; 0% instances), acl (3; 0% instances), ccomp (1; 0% instances), csubj (1; 0% instances)

Children of PROPN nodes belong to 17 different parts of speech: ADP (5526; 32% instances), PROPN (5402; 31% instances), PUNCT (2245; 13% instances), NOUN (1443; 8% instances), CCONJ (995; 6% instances), ADJ (417; 2% instances), VERB (287; 2% instances), X (249; 1% instances), NUM (214; 1% instances), DET (213; 1% instances), ADV (174; 1% instances), AUX (129; 1% instances), PRON (127; 1% instances), SYM (27; 0% instances), PART (25; 0% instances), SCONJ (17; 0% instances), INTJ (4; 0% instances)