home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-GSD: POS Tags: PROPN

There are 4934 PROPN lemmas (22%), 4934 PROPN types (22%) and 10741 PROPN tokens (9%). Out of 15 observed tags, the rank of PROPN is: 2 in number of lemmas, 2 in number of types and 5 in number of tokens.

The 10 most frequent PROPN lemmas: 中國、 美國、 日本、 香港、 李、 英國、 中華、 美、 台灣、 英

The 10 most frequent PROPN types: 中國、 美國、 日本、 香港、 李、 英國、 中華、 美、 台灣、 英

The 10 most frequent ambiguous lemmas: 美 (PROPN 68, PART 1), 英 (PROPN 59, NOUN 1), 王 (PROPN 55, PART 19, NOUN 5), 日 (NOUN 382, PROPN 53, PART 7, NUM 2), 中 (ADP 380, NOUN 47, PROPN 42, VERB 4, PART 3), 張 (PROPN 41, NOUN 21), 林 (PROPN 25, PART 5, NOUN 1), 港 (PROPN 23, PART 14), 周 (PROPN 22, NOUN 6, PART 1), 清 (PROPN 22, PART 3, NOUN 2)

The 10 most frequent ambiguous types: 美 (PROPN 68, PART 1), 英 (PROPN 59, NOUN 1), 王 (PROPN 55, PART 19, NOUN 5), 日 (NOUN 382, PROPN 53, PART 7, NUM 2), 中 (ADP 380, NOUN 47, PROPN 42, VERB 4, PART 3), 張 (PROPN 41, NOUN 21), 林 (PROPN 25, PART 5, NOUN 1), 港 (PROPN 23, PART 14), 周 (PROPN 22, NOUN 6, PART 1), 清 (PROPN 22, PART 3, NOUN 2)

Morphology

The form / lemma ratio of PROPN is 1.000000 (the average of all parts of speech is 1.000266).

The 1st highest number of forms (1) was observed with the lemma “14572”: 14572.

The 2nd highest number of forms (1) was observed with the lemma “360”: 360.

The 3rd highest number of forms (1) was observed with the lemma “Casey”: Casey.

PROPN does not occur with any features.

Relations

PROPN nodes are attached to their parents using 23 different relations: nmod (4711; 44% instances), case:suff (2019; 19% instances), nsubj (1725; 16% instances), obj (802; 7% instances), conj (524; 5% instances), det (519; 5% instances), obl (221; 2% instances), nsubj:pass (66; 1% instances), appos (33; 0% instances), root (30; 0% instances), dep (28; 0% instances), iobj (16; 0% instances), advmod (14; 0% instances), ccomp (11; 0% instances), nummod (6; 0% instances), nmod:tmod (5; 0% instances), dislocated (3; 0% instances), flat:foreign (3; 0% instances), acl (1; 0% instances), acl:relcl (1; 0% instances), case:pref (1; 0% instances), vocative (1; 0% instances), xcomp (1; 0% instances)

Parents of PROPN nodes belong to 12 different parts of speech: NOUN (3200; 30% instances), PART (2815; 26% instances), VERB (2680; 25% instances), PROPN (1906; 18% instances), ADJ (48; 0% instances), (30; 0% instances), ADP (23; 0% instances), X (17; 0% instances), NUM (14; 0% instances), PRON (5; 0% instances), ADV (2; 0% instances), SYM (1; 0% instances)

7894 (73%) PROPN nodes are leaves.

1715 (16%) PROPN nodes have one child.

697 (6%) PROPN nodes have two children.

435 (4%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 20.

Children of PROPN nodes are attached using 26 different relations: nmod (1506; 30% instances), punct (813; 16% instances), appos (616; 12% instances), conj (571; 11% instances), case:dec (518; 10% instances), case (228; 5% instances), cc (190; 4% instances), acl (130; 3% instances), det (73; 1% instances), acl:relcl (56; 1% instances), cop (54; 1% instances), nsubj (51; 1% instances), dep (47; 1% instances), case:pref (43; 1% instances), nummod (15; 0% instances), amod (14; 0% instances), clf (13; 0% instances), advmod (10; 0% instances), dislocated (8; 0% instances), case:suff (5; 0% instances), csubj (5; 0% instances), nmod:tmod (5; 0% instances), mark (4; 0% instances), flat:foreign (2; 0% instances), ccomp (1; 0% instances), mark:relcl (1; 0% instances)

Children of PROPN nodes belong to 15 different parts of speech: PROPN (1906; 38% instances), PUNCT (812; 16% instances), NOUN (779; 16% instances), PART (723; 15% instances), ADP (262; 5% instances), CCONJ (190; 4% instances), VERB (96; 2% instances), X (81; 2% instances), AUX (54; 1% instances), DET (18; 0% instances), ADJ (15; 0% instances), NUM (15; 0% instances), ADV (13; 0% instances), PRON (13; 0% instances), SYM (2; 0% instances)