home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Cantonese-HK: POS Tags: PROPN

There are 65 PROPN lemmas (6%), 65 PROPN types (6%) and 115 PROPN tokens (2%). Out of 15 observed tags, the rank of PROPN is: 5 in number of lemmas, 5 in number of types and 10 in number of tokens.

The 10 most frequent PROPN lemmas: 廟街, 香港, Eason, 豪仔, 陳奕迅, Yes!, 九零三, 龍珠, 千嬅, 楊千嬅

The 10 most frequent PROPN types: 廟街, 香港, Eason, 豪仔, 陳奕迅, Yes!, 九零三, 龍珠, 千嬅, 楊千嬅

The 10 most frequent ambiguous lemmas: 多多 (PROPN 2, ADV 1)

The 10 most frequent ambiguous types: 多多 (PROPN 2, ADV 1)

Morphology

The form / lemma ratio of PROPN is 1.000000 (the average of all parts of speech is 0.998084).

The 1st highest number of forms (1) was observed with the lemma “Beyond”: Beyond.

The 2nd highest number of forms (1) was observed with the lemma “Eason”: Eason.

The 3rd highest number of forms (1) was observed with the lemma “Philip”: Philip.

PROPN does not occur with any features.

Relations

PROPN nodes are attached to their parents using 15 different relations: obj (33; 29% instances), root (22; 19% instances), nsubj (14; 12% instances), obl (11; 10% instances), compound (9; 8% instances), conj (8; 7% instances), nmod (7; 6% instances), vocative (3; 3% instances), dislocated (2; 2% instances), advmod (1; 1% instances), appos (1; 1% instances), ccomp (1; 1% instances), flat (1; 1% instances), nsubj:periph (1; 1% instances), reparandum (1; 1% instances)

Parents of PROPN nodes belong to 8 different parts of speech: VERB (60; 52% instances), (22; 19% instances), NOUN (17; 15% instances), PROPN (9; 8% instances), ADJ (3; 3% instances), ADV (2; 2% instances), ADP (1; 1% instances), PRON (1; 1% instances)

51 (44%) PROPN nodes are leaves.

25 (22%) PROPN nodes have one child.

19 (17%) PROPN nodes have two children.

20 (17%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 7.

Children of PROPN nodes are attached using 19 different relations: punct (64; 44% instances), case (12; 8% instances), discourse:sp (11; 7% instances), conj (10; 7% instances), det (8; 5% instances), cop (7; 5% instances), discourse (5; 3% instances), nsubj (5; 3% instances), advmod (4; 3% instances), cc (4; 3% instances), reparandum (4; 3% instances), acl (3; 2% instances), case:loc (3; 2% instances), amod (2; 1% instances), ccomp (1; 1% instances), csubj (1; 1% instances), flat (1; 1% instances), nmod (1; 1% instances), parataxis (1; 1% instances)

Children of PROPN nodes belong to 13 different parts of speech: PUNCT (64; 44% instances), PART (18; 12% instances), NOUN (12; 8% instances), ADP (10; 7% instances), PROPN (9; 6% instances), VERB (7; 5% instances), AUX (6; 4% instances), INTJ (6; 4% instances), ADV (4; 3% instances), DET (4; 3% instances), CCONJ (3; 2% instances), ADJ (2; 1% instances), PRON (2; 1% instances)