home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Danish-DDT: POS Tags: PROPN

There are 2246 PROPN lemmas (16%), 2449 PROPN types (13%) and 4978 PROPN tokens (5%). Out of 17 observed tags, the rank of PROPN is: 2 in number of lemmas, 3 in number of types and 9 in number of tokens.

The 10 most frequent PROPN lemmas: Danmark, København, Nielsen, Lars, Poul, Peter, USA, Europa, Hafnia, Henrik

The 10 most frequent PROPN types: Danmark, København, Nielsen, Lars, Poul, Peter, Europa, Henrik, Hafnia, USA

The 10 most frequent ambiguous lemmas: CD (PROPN 10, NOUN 1), K. (PROPN 6, X 1), Ducato (PROPN 5, NOUN 1), de (PRON 483, PROPN 3, X 1), Citroën (PROPN 3, NOUN 2), ECU (NOUN 1, PROPN 1), PC (NOUN 2, PROPN 1), al (ADJ 233, PROPN 1)

The 10 most frequent ambiguous types: Per (PROPN 13, ADP 1), Hans (DET 15, PROPN 10), CD (PROPN 9, NOUN 1), K. (PROPN 6, X 1), Liv (PROPN 4, NOUN 1), de (DET 579, PRON 325, PROPN 3, X 1), Bank (NOUN 8, PROPN 3), IF (PROPN 3, NOUN 1), For (ADP 32, CCONJ 19, PROPN 2), Hotel (NOUN 3, PROPN 2)

Morphology

The form / lemma ratio of PROPN is 1.090383 (the average of all parts of speech is 1.355884).

The 1st highest number of forms (3) was observed with the lemma “Beatles”: BEATLES, Beatles, Beatles’.

The 2nd highest number of forms (3) was observed with the lemma “Christian”: CHRISTIAN, Chr., Christian.

The 3rd highest number of forms (3) was observed with the lemma “EF”: EF, EF’s, EFs.

PROPN occurs with 1 features: Case (353; 7% instances)

PROPN occurs with 1 feature-value pairs: Case=Gen

PROPN occurs with 2 feature combinations. The most frequent feature combination is _ (4625 tokens). Examples: Danmark, København, Nielsen, Lars, Poul, Peter, Europa, Henrik, Hafnia, USA

Relations

PROPN nodes are attached to their parents using 19 different relations: flat (1413; 28% instances), nsubj (1063; 21% instances), nmod (757; 15% instances), obl (510; 10% instances), conj (327; 7% instances), nmod:poss (307; 6% instances), appos (247; 5% instances), obj (143; 3% instances), root (92; 2% instances), list (61; 1% instances), vocative (17; 0% instances), dep (15; 0% instances), acl:relcl (14; 0% instances), iobj (6; 0% instances), xcomp (2; 0% instances), advcl (1; 0% instances), ccomp (1; 0% instances), orphan (1; 0% instances), parataxis (1; 0% instances)

Parents of PROPN nodes belong to 13 different parts of speech: PROPN (1855; 37% instances), VERB (1510; 30% instances), NOUN (1254; 25% instances), ADV (113; 2% instances), (92; 2% instances), ADJ (73; 1% instances), X (33; 1% instances), PRON (26; 1% instances), ADP (6; 0% instances), NUM (6; 0% instances), INTJ (5; 0% instances), SYM (4; 0% instances), DET (1; 0% instances)

2167 (44%) PROPN nodes are leaves.

1393 (28%) PROPN nodes have one child.

738 (15%) PROPN nodes have two children.

680 (14%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 11.

Children of PROPN nodes are attached using 22 different relations: flat (1622; 29% instances), case (1165; 21% instances), punct (749; 14% instances), nmod (438; 8% instances), conj (343; 6% instances), cc (207; 4% instances), amod (202; 4% instances), acl:relcl (161; 3% instances), det (158; 3% instances), nmod:poss (84; 2% instances), list (80; 1% instances), advmod (78; 1% instances), appos (51; 1% instances), nummod (44; 1% instances), nsubj (36; 1% instances), cop (32; 1% instances), dep (28; 1% instances), mark (10; 0% instances), aux (3; 0% instances), discourse (3; 0% instances), obj (3; 0% instances), obl (3; 0% instances)

Children of PROPN nodes belong to 15 different parts of speech: PROPN (1855; 34% instances), ADP (1169; 21% instances), PUNCT (749; 14% instances), NOUN (703; 13% instances), ADJ (216; 4% instances), CCONJ (213; 4% instances), VERB (162; 3% instances), DET (158; 3% instances), ADV (84; 2% instances), NUM (64; 1% instances), X (46; 1% instances), PRON (38; 1% instances), AUX (35; 1% instances), INTJ (4; 0% instances), SCONJ (4; 0% instances)