Treebank Statistics: UD_Chinese-GSD: POS Tags: PROPN
There are 4934 PROPN
lemmas (22%), 4934 PROPN
types (22%) and 10740 PROPN
tokens (9%).
Out of 16 observed tags, the rank of PROPN
is: 2 in number of lemmas, 2 in number of types and 4 in number of tokens.
The 10 most frequent PROPN
lemmas: 中國、 美國、 日本、 香港、 李、 英國、 中華、 美、 台灣、 英
The 10 most frequent PROPN
types: 中國、 美國、 日本、 香港、 李、 英國、 中華、 美、 台灣、 英
The 10 most frequent ambiguous lemmas: 美 (PROPN 68, PART 1), 英 (PROPN 59, NOUN 1), 王 (PROPN 55, PART 19, NOUN 5), 日 (NOUN 382, PROPN 53, PART 7, NUM 2), 中 (ADP 380, NOUN 47, PROPN 42, VERB 4, PART 3), 張 (PROPN 41, NOUN 21), 林 (PROPN 25, PART 5, NOUN 1), 港 (PROPN 23, PART 12), 周 (PROPN 22, NOUN 6, PART 1), 清 (PROPN 22, PART 3, NOUN 2)
The 10 most frequent ambiguous types: 美 (PROPN 68, PART 1), 英 (PROPN 59, NOUN 1), 王 (PROPN 55, PART 19, NOUN 5), 日 (NOUN 382, PROPN 53, PART 7, NUM 2), 中 (ADP 380, NOUN 47, PROPN 42, VERB 4, PART 3), 張 (PROPN 41, NOUN 21), 林 (PROPN 25, PART 5, NOUN 1), 港 (PROPN 23, PART 12), 周 (PROPN 22, NOUN 6, PART 1), 清 (PROPN 22, PART 3, NOUN 2)
- 美
- 英
- 王
- 日
- 中
- ADP 380: 而 解藥 的 出現 也 使得 他們 在 公眾 中 得到 了 威望 , 並 將 北方 之 火 推上 權力 寶座 。
- NOUN 47: 他 花費 了 許多 時間 來 比較 加拿大 地質 調查 局 博物 館 中 的 恐龍 化石 。
- PROPN 42: 教會 每週 有 許多 不同 的 聚會 , 分別 有 英 文 、 中 文 、 福建 話 、 廣東 話 、 與 印尼 文 聚會 。
- VERB 4: 往返 的 航程 中 , 第一 砲塔 因 美 軍 的 轟炸 而 中 了 4 枚 炸彈 , 但 對 後續 戰鬥 沒有 影響 。
- PART 3: 民國 二 年 ( 1913 年 ) , 山西 省 政府 接管 大 朔 中 學校 , 將 校名 改 為 山西 省 立 第三 中學 。
- 張
- 林
- 港
- 周
- 清
Morphology
The form / lemma ratio of PROPN
is 1.000000 (the average of all parts of speech is 1.004819).
The 1st highest number of forms (1) was observed with the lemma “14572”: 14572.
The 2nd highest number of forms (1) was observed with the lemma “360”: 360.
The 3rd highest number of forms (1) was observed with the lemma “Casey”: Casey.
PROPN
does not occur with any features.
Relations
PROPN
nodes are attached to their parents using 21 different relations: nmod (3956; 37% instances), compound (2018; 19% instances), nsubj (1542; 14% instances), flat:name (1216; 11% instances), obj (746; 7% instances), conj (511; 5% instances), appos (395; 4% instances), obl (210; 2% instances), nsubj:pass (60; 1% instances), root (21; 0% instances), parataxis (18; 0% instances), iobj (11; 0% instances), obl:patient (10; 0% instances), ccomp (9; 0% instances), nmod:tmod (5; 0% instances), advcl (4; 0% instances), dislocated (2; 0% instances), flat:foreign (2; 0% instances), xcomp (2; 0% instances), acl:relcl (1; 0% instances), vocative (1; 0% instances)
Parents of PROPN
nodes belong to 13 different parts of speech: NOUN (3451; 32% instances), PART (2865; 27% instances), VERB (2445; 23% instances), PROPN (1887; 18% instances), ADJ (31; 0% instances), (21; 0% instances), X (16; 0% instances), NUM (13; 0% instances), PRON (6; 0% instances), ADP (2; 0% instances), ADV (1; 0% instances), DET (1; 0% instances), SYM (1; 0% instances)
7715 (72%) PROPN
nodes are leaves.
2110 (20%) PROPN
nodes have one child.
601 (6%) PROPN
nodes have two children.
314 (3%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 20.
Children of PROPN
nodes are attached using 27 different relations: flat:name (1216; 26% instances), case (834; 18% instances), punct (825; 18% instances), conj (569; 12% instances), nmod (349; 8% instances), appos (239; 5% instances), cc (190; 4% instances), acl (98; 2% instances), acl:relcl (55; 1% instances), cop (51; 1% instances), nsubj (49; 1% instances), parataxis (37; 1% instances), nummod (23; 0% instances), det (18; 0% instances), amod (14; 0% instances), advmod (9; 0% instances), dislocated (8; 0% instances), nmod:tmod (8; 0% instances), compound (5; 0% instances), csubj (4; 0% instances), advcl (3; 0% instances), clf (3; 0% instances), mark (3; 0% instances), flat:foreign (2; 0% instances), obl (2; 0% instances), ccomp (1; 0% instances), mark:rel (1; 0% instances)
Children of PROPN
nodes belong to 16 different parts of speech: PROPN (1887; 41% instances), PUNCT (825; 18% instances), PART (663; 14% instances), NOUN (471; 10% instances), ADP (274; 6% instances), CCONJ (190; 4% instances), VERB (91; 2% instances), X (79; 2% instances), AUX (51; 1% instances), NUM (25; 1% instances), DET (18; 0% instances), ADJ (15; 0% instances), PRON (12; 0% instances), ADV (9; 0% instances), SCONJ (4; 0% instances), SYM (2; 0% instances)