home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-HK: POS Tags: PROPN

There are 1 PROPN lemmas (6%), 77 PROPN types (4%) and 166 PROPN tokens (2%). Out of 16 observed tags, the rank of PROPN is: 12 in number of lemmas, 5 in number of types and 12 in number of tokens.

The 10 most frequent PROPN lemmas: _

The 10 most frequent PROPN types: 梁君彥、 香港、 陳奕迅、 廟街、 梁頌恒、 楊千嬅、 毛孟靜、 英國、 梁、 豪仔

The 10 most frequent ambiguous lemmas: _ (VERB 1848, NOUN 1766, PUNCT 1738, ADV 1158, PRON 875, PART 560, AUX 502, ADP 321, ADJ 301, DET 260, NUM 177, PROPN 166, CCONJ 99, SCONJ 68, INTJ 34, SYM 1)

The 10 most frequent ambiguous types: 梁耀忠 (PROPN 2, PRON 1)

Morphology

The form / lemma ratio of PROPN is 77.000000 (the average of all parts of speech is 107.937500).

The 1st highest number of forms (77) was observed with the lemma “”: Beyond, Philip, Wong, Yes!, 七十一, 三角形內角總和, 不再讓你孤單, 中國大陸, 九零三, 今天等我來, 今日, 伴遊, 分飛燕, 北極, 北極星, 吉隆坡, 塗謹申, 夏天的故事, 多多, 大家樂, 大陸, 天富, 姚松炎, 尹光, 廟街, 張國榮, 張學友, 從不喜歡孤單一個, 悟空, 抬起我的頭來, 撒亞, 新城電台, 有發生過, 李慧琼, 李紅, 梁, 梁君彥, 梁天琦, 梁美芬, 梁耀忠, 梁頌恒, 梅艷芳, 森美小儀dotdotdot, 楊千嬅, 每一個明天, 毛孟靜, 浮羅交怡, 港, 湯, 無線電視, 男兒當自強, 益力多, 秋, 紅姐, 羅文, 與我常在, 英國, 華星, 蟹蟹瘦身操, 許智峯, 謝偉俊, 讓一切隨風, 豪仔, 鄭多變, 金剛郵輪, 金燕子, 金田一, 阿彌陀佛, 阿B, 陳奕迅, 陳志全, 陳維安, 香港, 黎明, 黑夜不再來, 龍珠, 龜波氣功.

PROPN does not occur with any features.

Relations

PROPN nodes are attached to their parents using 17 different relations: obj (35; 21% instances), root (27; 16% instances), nsubj (25; 15% instances), nmod (20; 12% instances), compound (16; 10% instances), obl (15; 9% instances), conj (9; 5% instances), vocative (8; 5% instances), appos (3; 2% instances), acl (1; 1% instances), advcl (1; 1% instances), ccomp (1; 1% instances), dislocated (1; 1% instances), flat (1; 1% instances), obj:periph (1; 1% instances), obl:patient (1; 1% instances), xcomp (1; 1% instances)

Parents of PROPN nodes belong to 8 different parts of speech: VERB (81; 49% instances), NOUN (41; 25% instances), (27; 16% instances), PROPN (8; 5% instances), ADJ (3; 2% instances), ADV (2; 1% instances), AUX (2; 1% instances), PRON (2; 1% instances)

52 (31%) PROPN nodes are leaves.

67 (40%) PROPN nodes have one child.

28 (17%) PROPN nodes have two children.

19 (11%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 7.

Children of PROPN nodes are attached using 21 different relations: punct (64; 32% instances), flat (40; 20% instances), case (33; 17% instances), advmod (9; 5% instances), conj (9; 5% instances), cop (8; 4% instances), cc (5; 3% instances), nsubj (5; 3% instances), case:loc (3; 2% instances), compound (3; 2% instances), det (3; 2% instances), discourse:sp (3; 2% instances), acl (2; 1% instances), amod (2; 1% instances), aux (2; 1% instances), discourse (2; 1% instances), nummod (2; 1% instances), parataxis (2; 1% instances), csubj (1; 1% instances), nmod (1; 1% instances), obl (1; 1% instances)

Children of PROPN nodes belong to 15 different parts of speech: PUNCT (64; 32% instances), NOUN (51; 26% instances), PART (22; 11% instances), ADP (17; 9% instances), AUX (10; 5% instances), ADV (8; 4% instances), PROPN (8; 4% instances), CCONJ (4; 2% instances), VERB (4; 2% instances), ADJ (3; 2% instances), DET (3; 2% instances), NUM (2; 1% instances), PRON (2; 1% instances), INTJ (1; 1% instances), SYM (1; 1% instances)