home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-HK: POS Tags: PROPN

There are 1 PROPN lemmas (6%), 75 PROPN types (5%) and 157 PROPN tokens (2%). Out of 16 observed tags, the rank of PROPN is: 12 in number of lemmas, 5 in number of types and 12 in number of tokens.

The 10 most frequent PROPN lemmas: _

The 10 most frequent PROPN types: 梁君彥、 香港、 陳奕迅、 廟街、 楊千嬅、 毛孟靜、 英國、 梁、 豪仔、 龍珠

The 10 most frequent ambiguous lemmas: _ (VERB 1642, NOUN 1565, PUNCT 1533, ADV 1017, PRON 742, PART 521, AUX 405, ADP 282, ADJ 263, DET 234, NUM 168, PROPN 157, CCONJ 85, SCONJ 52, INTJ 34, SYM 1)

The 10 most frequent ambiguous types: 應 (AUX 6, PROPN 1, VERB 1), 梁耀忠 (PRON 1, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 75.000000 (the average of all parts of speech is 103.937500).

The 1st highest number of forms (75) was observed with the lemma “”: Beyond, Philip, Wong, Yes!, 七十一, 三角形內角總和, 不再讓你孤單, 中國大陸, 九零三, 今天等我來, 今日, 伴遊, 分飛燕, 北極, 北極星, 吉隆坡, 塗謹申, 夏天的故事, 多多, 大家樂, 大陸, 天富, 姚松炎, 尹光, 廟街, 張國榮, 張學友, 從不喜歡孤單一個, 悟空, 應, 抬起我的頭來, 撒亞, 新城電台, 有發生過, 李慧琼, 李紅, 梁, 梁君彥, 梁天琦, 梁耀忠, 梁頌恒, 梅艷芳, 森美小儀dotdotdot, 楊千嬅, 每一個明天, 毛孟靜, 浮羅交怡, 港, 湯, 無線電視, 男兒當自強, 益力多, 秋, 紅姐, 羅文, 與我常在, 英國, 華星, 蟹蟹瘦身操, 許智峯, 讓一切隨風, 豪仔, 鄭多變, 金剛郵輪, 金燕子, 金田一, 阿彌陀佛, 阿B, 陳奕迅, 陳維安, 香港, 黎明, 黑夜不再來, 龍珠, 龜波氣功.

PROPN does not occur with any features.

Relations

PROPN nodes are attached to their parents using 18 different relations: obj (34; 22% instances), root (25; 16% instances), nsubj (24; 15% instances), nmod (20; 13% instances), compound (16; 10% instances), obl (15; 10% instances), conj (9; 6% instances), appos (3; 2% instances), vocative (2; 1% instances), acl (1; 1% instances), advmod (1; 1% instances), aux (1; 1% instances), ccomp (1; 1% instances), dislocated (1; 1% instances), flat (1; 1% instances), obj:periph (1; 1% instances), obl:patient (1; 1% instances), xcomp (1; 1% instances)

Parents of PROPN nodes belong to 9 different parts of speech: VERB (74; 47% instances), NOUN (41; 26% instances), (25; 16% instances), PROPN (8; 5% instances), ADJ (3; 2% instances), AUX (2; 1% instances), PRON (2; 1% instances), ADP (1; 1% instances), ADV (1; 1% instances)

52 (33%) PROPN nodes are leaves.

59 (38%) PROPN nodes have one child.

29 (18%) PROPN nodes have two children.

17 (11%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 6.

Children of PROPN nodes are attached using 21 different relations: punct (63; 35% instances), case (33; 18% instances), flat (31; 17% instances), conj (9; 5% instances), cop (8; 4% instances), cc (5; 3% instances), nsubj (5; 3% instances), case:loc (3; 2% instances), compound (3; 2% instances), det (3; 2% instances), discourse:sp (3; 2% instances), acl (2; 1% instances), amod (2; 1% instances), discourse (2; 1% instances), nummod (2; 1% instances), parataxis (2; 1% instances), advmod (1; 1% instances), aux (1; 1% instances), csubj (1; 1% instances), nmod (1; 1% instances), obl (1; 1% instances)

Children of PROPN nodes belong to 14 different parts of speech: PUNCT (63; 35% instances), NOUN (42; 23% instances), PART (22; 12% instances), ADP (17; 9% instances), AUX (8; 4% instances), PROPN (8; 4% instances), VERB (5; 3% instances), CCONJ (4; 2% instances), ADJ (3; 2% instances), DET (3; 2% instances), NUM (2; 1% instances), PRON (2; 1% instances), INTJ (1; 1% instances), SYM (1; 1% instances)