Treebank Statistics: UD_Norwegian-Bokmaal: POS Tags: PROPN
There are 4621 PROPN
lemmas (19%), 4989 PROPN
types (15%) and 18229 PROPN
tokens (6%).
Out of 17 observed tags, the rank of PROPN
is: 2 in number of lemmas, 2 in number of types and 7 in number of tokens.
The 10 most frequent PROPN
lemmas: Norge, Regjeringen, Obama, USA, Oslo, Jan, Cathrine, Stortinget, Svalbard, Den
The 10 most frequent PROPN
types: Norge, Obama, Regjeringen, Jan, Oslo, USA, Den, Svalbard, Mayen, Stortinget
The 10 most frequent ambiguous lemmas: Oslo (PROPN 153, X 2), Den (PROPN 116, X 1), The (PROPN 51, X 3), Barack (PROPN 45, X 1), Fashanu (PROPN 44, X 2), Det (PROPN 27, X 3), Annan (PROPN 25, X 1), norsk (ADJ 498, NOUN 18, PROPN 1), de (PRON 1637, DET 1349, PROPN 11, X 6, ADV 1), Haram (PROPN 16, X 1)
The 10 most frequent ambiguous types: Regjeringen (PROPN 168, NOUN 38), Oslo (PROPN 147, X 2), Den (DET 222, PROPN 116, PRON 82, X 1), The (PROPN 51, X 3), Barack (PROPN 45, X 1), Regjeringens (PROPN 44, NOUN 5), Fashanu (PROPN 36, X 2), Arbeiderpartiet (PROPN 34, NOUN 1), Mitt (PROPN 28, PRON 9), Det (PRON 1652, DET 159, PROPN 27, X 3)
- Regjeringen
- Oslo
- Den
- The
- Barack
- Regjeringens
- Fashanu
- Arbeiderpartiet
- Mitt
- Det
Morphology
The form / lemma ratio of PROPN
is 1.079636 (the average of all parts of speech is 1.381641).
The 1st highest number of forms (3) was observed with the lemma “Demokratene”: Demokratene, demokratenes, demokretane.
The 2nd highest number of forms (3) was observed with the lemma “EU”: EU, EU’s, EUs.
The 3rd highest number of forms (3) was observed with the lemma “FN”: FN, FN’s, FNs.
PROPN
occurs with 3 features: Gender (2689; 15% instances), Case (1214; 7% instances), Abbr (653; 4% instances)
PROPN
occurs with 5 feature-value pairs: Abbr=Yes
, Case=Gen
, Gender=Fem
, Gender=Masc
, Gender=Neut
PROPN
occurs with 10 feature combinations.
The most frequent feature combination is _
(13885 tokens).
Examples: Norge, Obama, Regjeringen, Oslo, Den, Svalbard, Mayen, Cathrine, Bertelsen, Bergen
Relations
PROPN
nodes are attached to their parents using 18 different relations: flat:name (4828; 26% instances), nsubj (3991; 22% instances), nmod (3761; 21% instances), obl (2088; 11% instances), conj (1069; 6% instances), appos (943; 5% instances), obj (483; 3% instances), root (413; 2% instances), nsubj:pass (161; 1% instances), nmod:poss (133; 1% instances), parataxis (117; 1% instances), compound (110; 1% instances), xcomp (55; 0% instances), iobj (47; 0% instances), ccomp (14; 0% instances), dislocated (10; 0% instances), flat (5; 0% instances), csubj (1; 0% instances)
Parents of PROPN
nodes belong to 13 different parts of speech: NOUN (6051; 33% instances), VERB (5944; 33% instances), PROPN (5065; 28% instances), ADJ (429; 2% instances), (413; 2% instances), PRON (75; 0% instances), ADV (62; 0% instances), X (59; 0% instances), NUM (48; 0% instances), DET (38; 0% instances), ADP (33; 0% instances), INTJ (8; 0% instances), SYM (4; 0% instances)
9193 (50%) PROPN
nodes are leaves.
4915 (27%) PROPN
nodes have one child.
2349 (13%) PROPN
nodes have two children.
1772 (10%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 14.
Children of PROPN
nodes are attached using 27 different relations: case (4904; 30% instances), flat:name (4575; 28% instances), punct (2478; 15% instances), conj (1181; 7% instances), cc (794; 5% instances), nmod (786; 5% instances), acl:relcl (271; 2% instances), advmod (251; 2% instances), amod (243; 1% instances), appos (227; 1% instances), det (150; 1% instances), cop (142; 1% instances), obl (125; 1% instances), nsubj (84; 1% instances), expl (51; 0% instances), xcomp (34; 0% instances), advcl (24; 0% instances), nmod:poss (24; 0% instances), mark (20; 0% instances), nummod (20; 0% instances), parataxis (20; 0% instances), aux (9; 0% instances), compound (4; 0% instances), acl (3; 0% instances), csubj (2; 0% instances), discourse (1; 0% instances), reparandum (1; 0% instances)
Children of PROPN
nodes belong to 17 different parts of speech: PROPN (5065; 31% instances), ADP (4992; 30% instances), PUNCT (2478; 15% instances), NOUN (1214; 7% instances), CCONJ (818; 5% instances), ADJ (483; 3% instances), VERB (330; 2% instances), NUM (226; 1% instances), DET (198; 1% instances), ADV (194; 1% instances), AUX (151; 1% instances), PRON (125; 1% instances), X (83; 1% instances), SYM (27; 0% instances), PART (21; 0% instances), SCONJ (18; 0% instances), INTJ (1; 0% instances)