home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Classical_Chinese-Kyoto: POS Tags: PROPN

There are 2990 PROPN lemmas (31%), 2988 PROPN types (30%) and 19162 PROPN tokens (8%). Out of 13 observed tags, the rank of PROPN is: 1 in number of lemmas, 1 in number of types and 4 in number of tokens.

The 10 most frequent PROPN lemmas: 孔子、 周、 孟子、 秦、 齊、 魏、 漢、 晉、 武、 李

The 10 most frequent PROPN types: 孔子、 周、 孟子、 秦、 齊、 魏、 漢、 晉、 武、 李

The 10 most frequent ambiguous lemmas: 周 (PROPN 332, VERB 35, ADV 6, NOUN 2), 齊 (PROPN 262, VERB 162, NOUN 35, ADV 9), 漢 (PROPN 216, NOUN 1), 武 (PROPN 198, NOUN 59, VERB 20, ADV 2), 李 (PROPN 181, NOUN 8), 王 (NOUN 1438, PROPN 178, VERB 66), 楚 (PROPN 166, NOUN 1), 文 (PROPN 164, NOUN 117, VERB 44, ADV 1), 梁 (PROPN 138, NOUN 12), 劉 (PROPN 137, NOUN 4, SYM 1)

The 10 most frequent ambiguous types: 周 (PROPN 332, VERB 35, ADV 6, NOUN 2), 齊 (PROPN 262, VERB 162, NOUN 35, ADV 9), 漢 (PROPN 216, NOUN 1), 武 (PROPN 198, NOUN 59, VERB 20, ADV 2), 李 (PROPN 181, NOUN 8), 王 (NOUN 1438, PROPN 178, VERB 66), 楚 (PROPN 166, NOUN 1), 文 (PROPN 164, NOUN 117, VERB 44, ADV 1), 梁 (PROPN 138, NOUN 12), 劉 (PROPN 137, NOUN 4, SYM 1)

Morphology

The form / lemma ratio of PROPN is 0.999331 (the average of all parts of speech is 1.011910).

The 1st highest number of forms (2) was observed with the lemma “健”: 健, 建.

The 2nd highest number of forms (2) was observed with the lemma “吳”: 吳, 呉.

The 3rd highest number of forms (2) was observed with the lemma “啟”: 啓, 啟.

PROPN occurs with 2 features: NameType (19160; 100% instances), Case (5524; 29% instances)

PROPN occurs with 6 feature-value pairs: Case=Loc, NameType=Geo, NameType=Giv, NameType=Nat, NameType=Prs, NameType=Sur

PROPN occurs with 6 feature combinations. The most frequent feature combination is NameType=Giv (8179 tokens). Examples: 舜、 堯、 禹、 子貢、 子路、 信、 羽、 淵、 湯、 亮

Relations

PROPN nodes are attached to their parents using 21 different relations: nsubj (5627; 29% instances), nmod (3614; 19% instances), flat (3511; 18% instances), obj (2977; 16% instances), compound (1054; 6% instances), conj (1041; 5% instances), obl:lmod (398; 2% instances), root (367; 2% instances), obl (307; 2% instances), iobj (65; 0% instances), amod (60; 0% instances), list (46; 0% instances), dislocated (27; 0% instances), acl (18; 0% instances), vocative (14; 0% instances), advcl (11; 0% instances), parataxis (10; 0% instances), flat:vv (6; 0% instances), ccomp (5; 0% instances), nsubj:pass (3; 0% instances), xcomp (1; 0% instances)

Parents of PROPN nodes belong to 9 different parts of speech: VERB (8918; 47% instances), NOUN (6081; 32% instances), PROPN (3607; 19% instances), (367; 2% instances), PART (136; 1% instances), NUM (34; 0% instances), PRON (11; 0% instances), AUX (7; 0% instances), ADV (1; 0% instances)

14159 (74%) PROPN nodes are leaves.

3764 (20%) PROPN nodes have one child.

957 (5%) PROPN nodes have two children.

282 (1%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 28.

Children of PROPN nodes are attached using 28 different relations: flat (2574; 38% instances), case (1344; 20% instances), conj (1087; 16% instances), nmod (895; 13% instances), nsubj (390; 6% instances), amod (95; 1% instances), discourse:sp (78; 1% instances), cc (49; 1% instances), cop (48; 1% instances), advmod (35; 1% instances), nummod (34; 1% instances), list (33; 0% instances), acl (32; 0% instances), compound (15; 0% instances), det (15; 0% instances), mark (11; 0% instances), csubj (9; 0% instances), discourse (8; 0% instances), flat:vv (4; 0% instances), obl:tmod (4; 0% instances), dislocated (3; 0% instances), obl (3; 0% instances), obl:lmod (3; 0% instances), advcl (2; 0% instances), aux (2; 0% instances), parataxis (2; 0% instances), expl (1; 0% instances), fixed (1; 0% instances)

Children of PROPN nodes belong to 11 different parts of speech: PROPN (3607; 53% instances), NOUN (1335; 20% instances), ADP (774; 11% instances), SCONJ (554; 8% instances), PART (194; 3% instances), VERB (141; 2% instances), AUX (50; 1% instances), PRON (45; 1% instances), ADV (42; 1% instances), NUM (34; 1% instances), CCONJ (1; 0% instances)