Treebank Statistics: UD_Classical_Chinese-Kyoto: POS Tags: PROPN
There are 4783 PROPN
lemmas (35%), 4798 PROPN
types (34%) and 45765 PROPN
tokens (11%).
Out of 14 observed tags, the rank of PROPN
is: 1 in number of lemmas, 1 in number of types and 3 in number of tokens.
The 10 most frequent PROPN
lemmas: 秦、 齊、 魏、 楚、 趙、 韓、 周、 燕、 張、 李
The 10 most frequent PROPN
types: 秦、 齊、 魏、 楚、 趙、 韓、 周、 燕、 張、 李
The 10 most frequent ambiguous lemmas: 齊 (PROPN 1625, VERB 178, NOUN 34, ADV 11), 楚 (PROPN 1360, NOUN 2, VERB 1), 周 (PROPN 762, VERB 50, ADV 10, NOUN 2), 燕 (PROPN 705, VERB 28, ADV 13, NOUN 11), 張 (PROPN 433, VERB 37, NOUN 1), 李 (PROPN 399, NOUN 11), 王 (NOUN 4834, PROPN 348, VERB 100), 梁 (PROPN 316, NOUN 40, VERB 1), 漢 (PROPN 295, NOUN 2), 文 (PROPN 280, NOUN 155, VERB 52, ADV 1)
The 10 most frequent ambiguous types: 齊 (PROPN 1625, VERB 178, NOUN 34, ADV 11), 楚 (PROPN 1360, NOUN 2, VERB 1), 周 (PROPN 762, VERB 50, ADV 10, NOUN 2), 燕 (PROPN 705, VERB 28, ADV 13, NOUN 11), 張 (PROPN 433, VERB 37, NOUN 1), 李 (PROPN 399, NOUN 11), 王 (NOUN 4834, PROPN 348, VERB 100), 梁 (PROPN 316, NOUN 40, VERB 1), 漢 (PROPN 295, NOUN 2), 文 (PROPN 280, NOUN 155, VERB 52, ADV 1)
- 齊
- 楚
- 周
- 燕
- 張
- 李
- 王
- 梁
- 漢
- 文
Morphology
The form / lemma ratio of PROPN
is 1.003136 (the average of all parts of speech is 1.013130).
The 1st highest number of forms (2) was observed with the lemma “上黨”: 上党, 上黨.
The 2nd highest number of forms (2) was observed with the lemma “侂冑”: 侂冑, 侂胄.
The 3rd highest number of forms (2) was observed with the lemma “傅巖”: 傅巌, 傅巖.
PROPN
occurs with 2 features: NameType (45764; 100% instances), Case (19092; 42% instances)
PROPN
occurs with 6 feature-value pairs: Case=Loc
, NameType=Geo
, NameType=Giv
, NameType=Nat
, NameType=Prs
, NameType=Sur
PROPN
occurs with 6 feature combinations.
The most frequent feature combination is NameType=Giv
(15455 tokens).
Examples: 儀、 舜、 須菩提、 堯、 秦、 禹、 湯、 衍、 光、 茂
Relations
PROPN
nodes are attached to their parents using 23 different relations: nsubj (11860; 26% instances), obj (8433; 18% instances), flat (8051; 18% instances), nmod (7192; 16% instances), compound (3605; 8% instances), conj (2976; 7% instances), obl:lmod (1496; 3% instances), root (865; 2% instances), obl (612; 1% instances), nsubj:outer (154; 0% instances), vocative (120; 0% instances), iobj (119; 0% instances), amod (96; 0% instances), list (46; 0% instances), acl (35; 0% instances), dislocated (28; 0% instances), ccomp (22; 0% instances), parataxis (22; 0% instances), advcl (20; 0% instances), nsubj:pass (6; 0% instances), xcomp (4; 0% instances), csubj (2; 0% instances), clf (1; 0% instances)
Parents of PROPN
nodes belong to 10 different parts of speech: VERB (21930; 48% instances), NOUN (13195; 29% instances), PROPN (9460; 21% instances), (865; 2% instances), PART (221; 0% instances), NUM (58; 0% instances), PRON (21; 0% instances), AUX (11; 0% instances), ADV (3; 0% instances), ADP (1; 0% instances)
32526 (71%) PROPN
nodes are leaves.
10569 (23%) PROPN
nodes have one child.
2101 (5%) PROPN
nodes have two children.
569 (1%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 41.
Children of PROPN
nodes are attached using 29 different relations: flat (6612; 39% instances), case (4014; 24% instances), conj (3038; 18% instances), nmod (1414; 8% instances), nsubj (421; 2% instances), amod (339; 2% instances), discourse:sp (166; 1% instances), nummod (162; 1% instances), nsubj:outer (142; 1% instances), cc (140; 1% instances), compound (124; 1% instances), cop (82; 0% instances), advmod (63; 0% instances), acl (58; 0% instances), mark (42; 0% instances), det (36; 0% instances), list (33; 0% instances), csubj (16; 0% instances), obl:tmod (13; 0% instances), discourse (9; 0% instances), dislocated (7; 0% instances), obl (6; 0% instances), advcl (5; 0% instances), aux (4; 0% instances), flat:vv (4; 0% instances), obl:lmod (4; 0% instances), parataxis (3; 0% instances), expl (1; 0% instances), vocative (1; 0% instances)
Children of PROPN
nodes belong to 12 different parts of speech: PROPN (9460; 56% instances), ADP (2276; 13% instances), NOUN (2201; 13% instances), SCONJ (1832; 11% instances), VERB (383; 2% instances), PART (350; 2% instances), NUM (168; 1% instances), ADV (101; 1% instances), PRON (91; 1% instances), AUX (86; 1% instances), CCONJ (10; 0% instances), INTJ (1; 0% instances)