home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-PUD: POS Tags: PROPN

There are 806 PROPN lemmas (14%), 806 PROPN types (14%) and 1361 PROPN tokens (6%). Out of 15 observed tags, the rank of PROPN is: 3 in number of lemmas, 3 in number of types and 5 in number of tokens.

The 10 most frequent PROPN lemmas: 美國、 英國、 中國、 德國、 歐、 法國、 歐洲、 希臘、 特朗普、 地中海

The 10 most frequent PROPN types: 美國、 英國、 中國、 德國、 歐、 法國、 歐洲、 希臘、 特朗普、 地中海

The 10 most frequent ambiguous lemmas: 美 (PROPN 8, NOUN 1), 非 (VERB 4, PROPN 3), 法 (PROPN 2, PART 1), 印 (PROPN 1, VERB 1), 希斯帕尼亞 (NOUN 1, PROPN 1), 張 (NOUN 5, ADP 1, PROPN 1), 時代 (NOUN 7, PROPN 1)

The 10 most frequent ambiguous types: 美 (PROPN 8, NOUN 1), 非 (VERB 4, PROPN 3), 法 (PROPN 2, PART 1), 印 (PROPN 1, VERB 1), 希斯帕尼亞 (NOUN 1, PROPN 1), 張 (NOUN 5, ADP 1, PROPN 1), 時代 (NOUN 7, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.000000 (the average of all parts of speech is 1.006233).

The 1st highest number of forms (1) was observed with the lemma “一世”: 一世.

The 2nd highest number of forms (1) was observed with the lemma “丁丁”: 丁丁.

The 3rd highest number of forms (1) was observed with the lemma “三世”: 三世.

PROPN does not occur with any features.

Relations

PROPN nodes are attached to their parents using 19 different relations: compound (435; 32% instances), nsubj (264; 19% instances), obj (152; 11% instances), flat:name (142; 10% instances), nmod (132; 10% instances), appos (68; 5% instances), conj (67; 5% instances), obl (63; 5% instances), nsubj:pass (12; 1% instances), root (5; 0% instances), dep (4; 0% instances), iobj (4; 0% instances), obl:agent (4; 0% instances), acl:relcl (3; 0% instances), obl:patient (2; 0% instances), advcl (1; 0% instances), ccomp (1; 0% instances), csubj (1; 0% instances), xcomp (1; 0% instances)

Parents of PROPN nodes belong to 10 different parts of speech: NOUN (623; 46% instances), VERB (524; 39% instances), PROPN (179; 13% instances), ADJ (8; 1% instances), X (7; 1% instances), ADP (6; 0% instances), (5; 0% instances), NUM (4; 0% instances), PART (4; 0% instances), PRON (1; 0% instances)

801 (59%) PROPN nodes are leaves.

408 (30%) PROPN nodes have one child.

104 (8%) PROPN nodes have two children.

48 (4%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 7.

Children of PROPN nodes are attached using 22 different relations: case (197; 25% instances), punct (166; 21% instances), flat:name (102; 13% instances), conj (66; 8% instances), appos (61; 8% instances), cc (49; 6% instances), compound (42; 5% instances), nmod (26; 3% instances), case:loc (21; 3% instances), acl:relcl (19; 2% instances), cop (8; 1% instances), det (6; 1% instances), nsubj (6; 1% instances), clf (4; 1% instances), dep (4; 1% instances), obl:tmod (4; 1% instances), mark:rel (3; 0% instances), csubj (2; 0% instances), nummod (2; 0% instances), amod (1; 0% instances), dislocated (1; 0% instances), parataxis (1; 0% instances)

Children of PROPN nodes belong to 13 different parts of speech: PROPN (179; 23% instances), PUNCT (166; 21% instances), PART (139; 18% instances), ADP (82; 10% instances), NOUN (77; 10% instances), X (55; 7% instances), CCONJ (49; 6% instances), VERB (23; 3% instances), AUX (8; 1% instances), DET (6; 1% instances), PRON (3; 0% instances), ADJ (2; 0% instances), NUM (2; 0% instances)