home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Danish-DDT: POS Tags: PROPN

There are 2246 PROPN lemmas (16%), 2449 PROPN types (13%) and 4978 PROPN tokens (5%). Out of 17 observed tags, the rank of PROPN is: 2 in number of lemmas, 3 in number of types and 9 in number of tokens.

The 10 most frequent PROPN lemmas: Danmark, København, Nielsen, Lars, Poul, Peter, USA, Europa, Hafnia, Henrik

The 10 most frequent PROPN types: Danmark, København, Nielsen, Lars, Poul, Peter, Europa, Henrik, Hafnia, USA

The 10 most frequent ambiguous lemmas: CD (PROPN 10, NOUN 1), K. (PROPN 6, X 1), Ducato (PROPN 5, NOUN 1), de (PRON 483, PROPN 3, X 1), Citroën (PROPN 3, NOUN 2), ECU (NOUN 1, PROPN 1), PC (NOUN 2, PROPN 1), al (ADJ 233, PROPN 1)

The 10 most frequent ambiguous types: Per (PROPN 13, ADP 1), Hans (DET 15, PROPN 10), CD (PROPN 9, NOUN 1), K. (PROPN 6, X 1), Liv (PROPN 4, NOUN 1), de (DET 579, PRON 325, PROPN 3, X 1), Bank (NOUN 8, PROPN 3), IF (PROPN 3, NOUN 1), For (ADP 32, CCONJ 19, PROPN 2), Hotel (NOUN 3, PROPN 2)

Morphology

The form / lemma ratio of PROPN is 1.090383 (the average of all parts of speech is 1.355946).

The 1st highest number of forms (3) was observed with the lemma “Beatles”: BEATLES, Beatles, Beatles’.

The 2nd highest number of forms (3) was observed with the lemma “Christian”: CHRISTIAN, Chr., Christian.

The 3rd highest number of forms (3) was observed with the lemma “EF”: EF, EF’s, EFs.

PROPN occurs with 1 features: Case (353; 7% instances)

PROPN occurs with 1 feature-value pairs: Case=Gen

PROPN occurs with 2 feature combinations. The most frequent feature combination is _ (4625 tokens). Examples: Danmark, København, Nielsen, Lars, Poul, Peter, Europa, Henrik, Hafnia, USA

Relations

PROPN nodes are attached to their parents using 17 different relations: flat (1413; 28% instances), nsubj (1062; 21% instances), nmod (760; 15% instances), obl (508; 10% instances), conj (324; 7% instances), nmod:poss (307; 6% instances), appos (251; 5% instances), obj (142; 3% instances), root (93; 2% instances), list (63; 1% instances), vocative (17; 0% instances), dep (15; 0% instances), acl:relcl (14; 0% instances), iobj (6; 0% instances), advmod (1; 0% instances), ccomp (1; 0% instances), mark (1; 0% instances)

Parents of PROPN nodes belong to 15 different parts of speech: PROPN (1832; 37% instances), VERB (1493; 30% instances), NOUN (1252; 25% instances), ADV (110; 2% instances), (93; 2% instances), ADJ (79; 2% instances), ADP (36; 1% instances), X (36; 1% instances), PRON (26; 1% instances), INTJ (6; 0% instances), NUM (6; 0% instances), SYM (4; 0% instances), AUX (3; 0% instances), DET (1; 0% instances), PUNCT (1; 0% instances)

2198 (44%) PROPN nodes are leaves.

1446 (29%) PROPN nodes have one child.

609 (12%) PROPN nodes have two children.

725 (15%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 11.

Children of PROPN nodes are attached using 21 different relations: flat (1623; 29% instances), case (1157; 21% instances), punct (834; 15% instances), nmod (413; 7% instances), conj (341; 6% instances), cc (208; 4% instances), amod (200; 4% instances), acl:relcl (161; 3% instances), det (158; 3% instances), nmod:poss (84; 2% instances), advmod (78; 1% instances), list (75; 1% instances), appos (45; 1% instances), nummod (44; 1% instances), nsubj (35; 1% instances), cop (34; 1% instances), dep (27; 0% instances), mark (8; 0% instances), aux (4; 0% instances), obj (3; 0% instances), discourse (2; 0% instances)

Children of PROPN nodes belong to 15 different parts of speech: PROPN (1832; 33% instances), ADP (1158; 21% instances), PUNCT (834; 15% instances), NOUN (690; 12% instances), ADJ (216; 4% instances), CCONJ (214; 4% instances), VERB (159; 3% instances), DET (158; 3% instances), ADV (80; 1% instances), NUM (64; 1% instances), X (47; 1% instances), AUX (38; 1% instances), PRON (36; 1% instances), SCONJ (5; 0% instances), INTJ (3; 0% instances)