home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Classical_Chinese-Kyoto: POS Tags: PROPN

There are 714 PROPN lemmas (13%), 714 PROPN types (13%) and 4741 PROPN tokens (4%). Out of 13 observed tags, the rank of PROPN is: 3 in number of lemmas, 3 in number of types and 6 in number of tokens.

The 10 most frequent PROPN lemmas: 孔子、 孟子、 周、 曾子、 舜、 齊、 文、 殷、 魯、 夏

The 10 most frequent PROPN types: 孔子、 孟子、 周、 曾子、 舜、 齊、 文、 殷、 魯、 夏

The 10 most frequent ambiguous lemmas: 周 (PROPN 145, VERB 25, ADV 3, NOUN 1), 齊 (VERB 119, PROPN 105, NOUN 31, ADV 9), 文 (PROPN 93, NOUN 70, VERB 33), 殷 (PROPN 87, VERB 5, ADV 1), 魯 (PROPN 80, VERB 1), 夏 (PROPN 74, NOUN 36, VERB 1), 武 (PROPN 51, NOUN 29, VERB 9), 虞 (PROPN 43, VERB 30, NOUN 10), 衞 (PROPN 43, VERB 2), 季 (PROPN 42, NUM 7, NOUN 3)

The 10 most frequent ambiguous types: 周 (PROPN 145, VERB 25, ADV 3, NOUN 1), 齊 (VERB 119, PROPN 105, NOUN 31, ADV 9), 文 (PROPN 93, NOUN 70, VERB 33), 殷 (PROPN 87, VERB 5, ADV 1), 魯 (PROPN 80, VERB 1), 夏 (PROPN 74, NOUN 36, VERB 1), 武 (PROPN 51, NOUN 29, VERB 9), 虞 (PROPN 43, VERB 30, NOUN 10), 衛 (PROPN 43, VERB 2), 季 (PROPN 42, NUM 7, NOUN 3)

Morphology

The form / lemma ratio of PROPN is 1.000000 (the average of all parts of speech is 1.002166).

The 1st highest number of forms (1) was observed with the lemma “丁”: 丁.

The 2nd highest number of forms (1) was observed with the lemma “三危”: 三危.

The 3rd highest number of forms (1) was observed with the lemma “不勝”: 不勝.

PROPN occurs with 2 features: NameType (4740; 100% instances), Case (1019; 21% instances)

PROPN occurs with 6 feature-value pairs: Case=Loc, NameType=Geo, NameType=Giv, NameType=Nat, NameType=Prs, NameType=Sur

PROPN occurs with 6 feature combinations. The most frequent feature combination is NameType=Giv (1779 tokens). Examples: 舜、 子貢、 子路、 堯、 子夏、 子游、 湯、 禹、 子張、 子思

Relations

PROPN nodes are attached to their parents using 18 different relations: nsubj (2008; 42% instances), nmod (698; 15% instances), flat (617; 13% instances), obj (471; 10% instances), compound (345; 7% instances), conj (213; 4% instances), obl:lmod (121; 3% instances), obl (120; 3% instances), root (94; 2% instances), dislocated (22; 0% instances), vocative (12; 0% instances), list (8; 0% instances), iobj (6; 0% instances), acl (2; 0% instances), amod (1; 0% instances), ccomp (1; 0% instances), flat:vv (1; 0% instances), nsubj:pass (1; 0% instances)

Parents of PROPN nodes belong to 8 different parts of speech: VERB (2636; 56% instances), NOUN (1187; 25% instances), PROPN (771; 16% instances), (94; 2% instances), PART (33; 1% instances), NUM (8; 0% instances), PRON (7; 0% instances), AUX (5; 0% instances)

3356 (71%) PROPN nodes are leaves.

1080 (23%) PROPN nodes have one child.

239 (5%) PROPN nodes have two children.

66 (1%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 7.

Children of PROPN nodes are attached using 22 different relations: case (639; 36% instances), flat (609; 34% instances), conj (212; 12% instances), nmod (65; 4% instances), discourse:sp (64; 4% instances), nsubj (61; 3% instances), amod (35; 2% instances), advmod (20; 1% instances), cc (17; 1% instances), nummod (12; 1% instances), list (10; 1% instances), cop (9; 1% instances), det (8; 0% instances), acl (7; 0% instances), discourse (7; 0% instances), csubj (6; 0% instances), mark (4; 0% instances), advcl (2; 0% instances), aux (2; 0% instances), dislocated (2; 0% instances), compound (1; 0% instances), obl (1; 0% instances)

Children of PROPN nodes belong to 10 different parts of speech: PROPN (771; 43% instances), SCONJ (328; 18% instances), ADP (257; 14% instances), PART (163; 9% instances), NOUN (160; 9% instances), VERB (51; 3% instances), ADV (20; 1% instances), PRON (20; 1% instances), NUM (12; 1% instances), AUX (11; 1% instances)