home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-GSDSimp: POS Tags: PROPN

There are 4924 PROPN lemmas (22%), 4924 PROPN types (22%) and 10742 PROPN tokens (9%). Out of 16 observed tags, the rank of PROPN is: 2 in number of lemmas, 2 in number of types and 4 in number of tokens.

The 10 most frequent PROPN lemmas: 中国、 美国、 日本、 香港、 李、 台湾、 英国、 中华、 美、 英

The 10 most frequent PROPN types: 中国、 美国、 日本、 香港、 李、 台湾、 英国、 中华、 美、 英

The 10 most frequent ambiguous lemmas: 美 (PROPN 68, PART 1), 英 (PROPN 59, NOUN 1), 王 (PROPN 55, PART 19, NOUN 5), 日 (NOUN 382, PROPN 53, PART 7, NUM 2), 中 (ADP 380, NOUN 47, PROPN 42, VERB 4, PART 3), 张 (PROPN 41, NOUN 21), 林 (PROPN 25, PART 5, NOUN 1), 港 (PROPN 23, PART 12), 周 (PROPN 22, NOUN 8, PART 2), 清 (PROPN 22, PART 3, NOUN 2)

The 10 most frequent ambiguous types: 美 (PROPN 68, PART 1), 英 (PROPN 59, NOUN 1), 王 (PROPN 55, PART 19, NOUN 5), 日 (NOUN 382, PROPN 53, PART 7, NUM 2), 中 (ADP 380, NOUN 47, PROPN 42, VERB 4, PART 3), 张 (PROPN 41, NOUN 21), 林 (PROPN 25, PART 5, NOUN 1), 港 (PROPN 23, PART 12), 周 (PROPN 22, NOUN 8, PART 2), 清 (PROPN 22, PART 3, NOUN 2)

Morphology

The form / lemma ratio of PROPN is 1.000000 (the average of all parts of speech is 1.004572).

The 1st highest number of forms (1) was observed with the lemma “14572”: 14572.

The 2nd highest number of forms (1) was observed with the lemma “360”: 360.

The 3rd highest number of forms (1) was observed with the lemma “A330”: A330.

PROPN does not occur with any features.

Relations

PROPN nodes are attached to their parents using 22 different relations: nmod (3960; 37% instances), compound (2017; 19% instances), nsubj (1530; 14% instances), flat:name (1216; 11% instances), obj (746; 7% instances), conj (513; 5% instances), appos (392; 4% instances), obl (211; 2% instances), nsubj:pass (60; 1% instances), root (21; 0% instances), parataxis (18; 0% instances), iobj (11; 0% instances), obl:agent (11; 0% instances), obl:patient (10; 0% instances), ccomp (9; 0% instances), nmod:tmod (5; 0% instances), advcl (4; 0% instances), dislocated (2; 0% instances), flat (2; 0% instances), xcomp (2; 0% instances), acl:relcl (1; 0% instances), vocative (1; 0% instances)

Parents of PROPN nodes belong to 12 different parts of speech: NOUN (3460; 32% instances), PART (2865; 27% instances), VERB (2442; 23% instances), PROPN (1882; 18% instances), ADJ (32; 0% instances), (21; 0% instances), X (17; 0% instances), NUM (14; 0% instances), PRON (5; 0% instances), ADP (2; 0% instances), ADV (1; 0% instances), SYM (1; 0% instances)

7743 (72%) PROPN nodes are leaves.

2119 (20%) PROPN nodes have one child.

588 (5%) PROPN nodes have two children.

292 (3%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 20.

Children of PROPN nodes are attached using 26 different relations: flat:name (1216; 27% instances), punct (819; 18% instances), case (802; 18% instances), conj (565; 13% instances), nmod (340; 8% instances), appos (235; 5% instances), cc (178; 4% instances), acl (98; 2% instances), acl:relcl (47; 1% instances), cop (40; 1% instances), nsubj (37; 1% instances), parataxis (34; 1% instances), nummod (23; 1% instances), det (20; 0% instances), amod (14; 0% instances), advmod (8; 0% instances), dislocated (8; 0% instances), nmod:tmod (8; 0% instances), compound (5; 0% instances), csubj (4; 0% instances), advcl (3; 0% instances), flat (3; 0% instances), mark (3; 0% instances), obl (2; 0% instances), mark:rel (1; 0% instances), nsubj:outer (1; 0% instances)

Children of PROPN nodes belong to 15 different parts of speech: PROPN (1882; 42% instances), PUNCT (819; 18% instances), PART (659; 15% instances), NOUN (450; 10% instances), ADP (242; 5% instances), CCONJ (178; 4% instances), VERB (80; 2% instances), X (78; 2% instances), AUX (40; 1% instances), NUM (26; 1% instances), DET (21; 0% instances), ADJ (15; 0% instances), PRON (12; 0% instances), ADV (8; 0% instances), SCONJ (4; 0% instances)