PROPN
: proper noun
Definition
A proper noun is a noun (or nominal content word) that is the name (or part of the name) of a specific individual, place, or object.
In Norwegian, proper nouns do usually not inflect for morphological properties like gender, number, etc. Proper nouns are furthermore written with a capital letter.
Examples
- Kari, Ola
- Oslo, Bergen
Treebank Statistics (UD_Norwegian)
There are 4640 PROPN
lemmas (19%), 5008 PROPN
types (15%) and 18260 PROPN
tokens (6%).
Out of 17 observed tags, the rank of PROPN
is: 2 in number of lemmas, 2 in number of types and 7 in number of tokens.
The 10 most frequent PROPN
lemmas: Norge, Regjeringen, Obama, USA, Oslo, Jan, Cathrine, Stortinget, Svalbard, Den
The 10 most frequent PROPN
types: Norge, Obama, Regjeringen, Jan, Oslo, USA, Den, Svalbard, Mayen, Stortinget
The 10 most frequent ambiguous lemmas: Oslo (PROPN 153, X 2), Den (PROPN 116, X 1, DET 1), The (PROPN 53, X 1), Fashanu (PROPN 44, X 2), Det (PROPN 27, X 3), Annan (PROPN 25, X 1), norsk (ADJ 498, NOUN 18, PROPN 1), de (PRON 1636, DET 1349, PROPN 11, X 6, ADV 1), Haram (PROPN 16, X 1), ©NTB (PROPN 14, X 1)
The 10 most frequent ambiguous types: Regjeringen (PROPN 168, NOUN 38), Oslo (PROPN 147, X 2), Den (DET 222, PROPN 116, PRON 82, X 1), The (PROPN 53, X 1), Regjeringens (PROPN 44, NOUN 5), Fashanu (PROPN 36, X 2), Arbeiderpartiet (PROPN 34, NOUN 1), Mitt (PROPN 28, DET 9), Det (PRON 1652, DET 159, PROPN 27, X 3), Annan (PROPN 25, X 1)
- Regjeringen
- Oslo
- Den
- The
- Regjeringens
- Fashanu
- Arbeiderpartiet
- Mitt
- Det
- Annan
Morphology
The form / lemma ratio of PROPN
is 1.079310 (the average of all parts of speech is 1.382778).
The 1st highest number of forms (3) was observed with the lemma “Demokratene”: Demokratene, demokratenes, demokretane.
The 2nd highest number of forms (3) was observed with the lemma “EU”: EU, EU’s, EUs.
The 3rd highest number of forms (3) was observed with the lemma “FN”: FN, FN’s, FNs.
PROPN
occurs with 2 features: Gender (2689; 15% instances), Case (1214; 7% instances)
PROPN
occurs with 4 feature-value pairs: Case=Gen
, Gender=Fem
, Gender=Masc
, Gender=Neut
PROPN
occurs with 8 feature combinations.
The most frequent feature combination is _
(14461 tokens).
Examples: Norge, Obama, Regjeringen, Oslo, USA, Den, Svalbard, Mayen, Cathrine, Bertelsen
Relations
PROPN
nodes are attached to their parents using 21 different relations: nmod (5183; 28% instances), nsubj (4683; 26% instances), name (4215; 23% instances), det (1235; 7% instances), conj (1152; 6% instances), dobj (570; 3% instances), root (520; 3% instances), nsubjpass (188; 1% instances), parataxis (120; 1% instances), appos (115; 1% instances), compound (110; 1% instances), xcomp (57; 0% instances), iobj (48; 0% instances), remnant (32; 0% instances), advcl (10; 0% instances), acl (6; 0% instances), acl:relcl (5; 0% instances), ccomp (5; 0% instances), foreign (3; 0% instances), csubj (2; 0% instances), goeswith (1; 0% instances)
Parents of PROPN
nodes belong to 13 different parts of speech: VERB (6878; 38% instances), PROPN (5865; 32% instances), NOUN (4192; 23% instances), ROOT (520; 3% instances), ADJ (477; 3% instances), DET (77; 0% instances), PRON (75; 0% instances), ADV (65; 0% instances), NUM (49; 0% instances), ADP (45; 0% instances), INTJ (8; 0% instances), X (5; 0% instances), SYM (4; 0% instances)
8985 (49%) PROPN
nodes are leaves.
4328 (24%) PROPN
nodes have one child.
2266 (12%) PROPN
nodes have two children.
2681 (15%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 20.
Children of PROPN
nodes are attached using 28 different relations: case (5271; 27% instances), name (5236; 26% instances), punct (2763; 14% instances), nmod (2407; 12% instances), conj (1234; 6% instances), cc (947; 5% instances), acl:relcl (314; 2% instances), advmod (259; 1% instances), appos (254; 1% instances), det (254; 1% instances), amod (232; 1% instances), cop (161; 1% instances), acl (111; 1% instances), nsubj (98; 0% instances), expl (59; 0% instances), mark (50; 0% instances), xcomp (35; 0% instances), parataxis (30; 0% instances), compound (29; 0% instances), advcl (25; 0% instances), nummod (22; 0% instances), neg (19; 0% instances), aux (10; 0% instances), csubj (2; 0% instances), remnant (2; 0% instances), discourse (1; 0% instances), dobj (1; 0% instances), goeswith (1; 0% instances)
Children of PROPN
nodes belong to 17 different parts of speech: PROPN (5865; 30% instances), ADP (5399; 27% instances), PUNCT (2763; 14% instances), NOUN (2728; 14% instances), CONJ (990; 5% instances), ADJ (527; 3% instances), VERB (515; 3% instances), NUM (255; 1% instances), ADV (229; 1% instances), DET (214; 1% instances), X (156; 1% instances), PRON (128; 1% instances), SYM (26; 0% instances), SCONJ (16; 0% instances), AUX (10; 0% instances), PART (5; 0% instances), INTJ (1; 0% instances)
PROPN in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]