home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-HK: POS Tags: PROPN

There are 77 PROPN lemmas (5%), 77 PROPN types (4%) and 166 PROPN tokens (2%). Out of 16 observed tags, the rank of PROPN is: 5 in number of lemmas, 5 in number of types and 12 in number of tokens.

The 10 most frequent PROPN lemmas: 梁君彥、 香港、 陳奕迅、 廟街、 梁頌恒、 楊千嬅、 毛孟靜、 英國、 梁、 豪仔

The 10 most frequent PROPN types: 梁君彥、 香港、 陳奕迅、 廟街、 梁頌恒、 楊千嬅、 毛孟靜、 英國、 梁、 豪仔

The 10 most frequent ambiguous lemmas: 梁耀忠 (PROPN 2, PRON 1)

The 10 most frequent ambiguous types: 梁耀忠 (PROPN 2, PRON 1)

Morphology

The form / lemma ratio of PROPN is 1.000000 (the average of all parts of speech is 1.007013).

The 1st highest number of forms (1) was observed with the lemma “Beyond”: Beyond.

The 2nd highest number of forms (1) was observed with the lemma “Philip_”: Philip_.

The 3rd highest number of forms (1) was observed with the lemma “Wong”: Wong.

PROPN does not occur with any features.

Relations

PROPN nodes are attached to their parents using 17 different relations: obj (35; 21% instances), root (27; 16% instances), nsubj (25; 15% instances), nmod (20; 12% instances), compound (16; 10% instances), obl (15; 9% instances), conj (9; 5% instances), vocative (8; 5% instances), appos (3; 2% instances), acl (1; 1% instances), advcl (1; 1% instances), ccomp (1; 1% instances), dislocated (1; 1% instances), flat (1; 1% instances), obj:periph (1; 1% instances), obl:patient (1; 1% instances), xcomp (1; 1% instances)

Parents of PROPN nodes belong to 7 different parts of speech: VERB (83; 50% instances), NOUN (41; 25% instances), (27; 16% instances), PROPN (8; 5% instances), ADJ (3; 2% instances), ADV (2; 1% instances), PRON (2; 1% instances)

50 (30%) PROPN nodes are leaves.

64 (39%) PROPN nodes have one child.

34 (20%) PROPN nodes have two children.

18 (11%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 7.

Children of PROPN nodes are attached using 21 different relations: punct (70; 34% instances), flat (40; 19% instances), case (33; 16% instances), advmod (9; 4% instances), conj (9; 4% instances), cop (8; 4% instances), cc (5; 2% instances), nsubj (5; 2% instances), case:loc (3; 1% instances), compound (3; 1% instances), det (3; 1% instances), discourse:sp (3; 1% instances), acl (2; 1% instances), amod (2; 1% instances), aux (2; 1% instances), discourse (2; 1% instances), nummod (2; 1% instances), parataxis (2; 1% instances), csubj (1; 0% instances), nmod (1; 0% instances), obl (1; 0% instances)

Children of PROPN nodes belong to 15 different parts of speech: PUNCT (70; 34% instances), NOUN (51; 25% instances), PART (22; 11% instances), ADP (17; 8% instances), AUX (10; 5% instances), ADV (8; 4% instances), PROPN (8; 4% instances), CCONJ (4; 2% instances), VERB (4; 2% instances), ADJ (3; 1% instances), DET (3; 1% instances), NUM (2; 1% instances), PRON (2; 1% instances), INTJ (1; 0% instances), SYM (1; 0% instances)