home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-HK: POS Tags: PROPN

There are 16 PROPN lemmas (3%), 22 PROPN types (4%) and 35 PROPN tokens (2%). Out of 17 observed tags, the rank of PROPN is: 9 in number of lemmas, 6 in number of types and 12 in number of tokens.

The 10 most frequent PROPN lemmas: _、 廟街、 香港、 湯、 紅姐、 分飛燕、 大陸、 尹光、 從不喜歡孤單一個、 李紅

The 10 most frequent PROPN types: 廟街、 香港、 龍珠、 湯、 紅姐、 豪仔、 分飛燕、 多多、 大陸、 尹光

The 10 most frequent ambiguous lemmas: _ (VERB 114, PUNCT 111, NOUN 69, ADV 63, PART 54, PRON 49, ADJ 21, NUM 19, AUX 18, ADP 10, PROPN 10, DET 8, INTJ 5, SCONJ 1, X 1)

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of PROPN is 1.375000 (the average of all parts of speech is 1.221258).

The 1st highest number of forms (7) was observed with the lemma “_”: 多多, 悟空, 撒亞, 益力多, 豪仔, 龍珠, 龜波氣功.

The 2nd highest number of forms (1) was observed with the lemma “分飛燕”: 分飛燕.

The 3rd highest number of forms (1) was observed with the lemma “大陸”: 大陸.

PROPN does not occur with any features.

Relations

PROPN nodes are attached to their parents using 12 different relations: obj (11; 31% instances), nsubj (6; 17% instances), nmod (4; 11% instances), root (4; 11% instances), compound (2; 6% instances), dislocated (2; 6% instances), advmod (1; 3% instances), appos (1; 3% instances), conj (1; 3% instances), obl (1; 3% instances), vocative (1; 3% instances), xcomp (1; 3% instances)

Parents of PROPN nodes belong to 5 different parts of speech: VERB (21; 60% instances), NOUN (7; 20% instances), (4; 11% instances), ADJ (2; 6% instances), PROPN (1; 3% instances)

20 (57%) PROPN nodes are leaves.

8 (23%) PROPN nodes have one child.

2 (6%) PROPN nodes have two children.

5 (14%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 7.

Children of PROPN nodes are attached using 14 different relations: punct (15; 43% instances), case (5; 14% instances), advmod (2; 6% instances), conj (2; 6% instances), discourse:sp (2; 6% instances), acl (1; 3% instances), amod (1; 3% instances), clf (1; 3% instances), cop (1; 3% instances), discourse (1; 3% instances), nmod (1; 3% instances), nsubj (1; 3% instances), nummod (1; 3% instances), parataxis (1; 3% instances)

Children of PROPN nodes belong to 10 different parts of speech: PUNCT (15; 43% instances), PART (5; 14% instances), ADP (3; 9% instances), ADJ (2; 6% instances), ADV (2; 6% instances), NOUN (2; 6% instances), PRON (2; 6% instances), VERB (2; 6% instances), NUM (1; 3% instances), PROPN (1; 3% instances)