home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Classical_Chinese-Kyoto: POS Tags: PROPN

There are 4783 PROPN lemmas (35%), 4798 PROPN types (34%) and 45765 PROPN tokens (11%). Out of 14 observed tags, the rank of PROPN is: 1 in number of lemmas, 1 in number of types and 3 in number of tokens.

The 10 most frequent PROPN lemmas: 秦、 齊、 魏、 楚、 趙、 韓、 周、 燕、 張、 李

The 10 most frequent PROPN types: 秦、 齊、 魏、 楚、 趙、 韓、 周、 燕、 張、 李

The 10 most frequent ambiguous lemmas: 齊 (PROPN 1625, VERB 178, NOUN 34, ADV 11), 楚 (PROPN 1360, NOUN 2, VERB 1), 周 (PROPN 762, VERB 50, ADV 10, NOUN 2), 燕 (PROPN 705, VERB 28, ADV 13, NOUN 11), 張 (PROPN 433, VERB 37, NOUN 1), 李 (PROPN 399, NOUN 11), 王 (NOUN 4834, PROPN 348, VERB 100), 梁 (PROPN 316, NOUN 40, VERB 1), 漢 (PROPN 295, NOUN 2), 文 (PROPN 280, NOUN 155, VERB 52, ADV 1)

The 10 most frequent ambiguous types: 齊 (PROPN 1625, VERB 178, NOUN 34, ADV 11), 楚 (PROPN 1360, NOUN 2, VERB 1), 周 (PROPN 762, VERB 50, ADV 10, NOUN 2), 燕 (PROPN 705, VERB 28, ADV 13, NOUN 11), 張 (PROPN 433, VERB 37, NOUN 1), 李 (PROPN 399, NOUN 11), 王 (NOUN 4834, PROPN 348, VERB 100), 梁 (PROPN 316, NOUN 40, VERB 1), 漢 (PROPN 295, NOUN 2), 文 (PROPN 280, NOUN 155, VERB 52, ADV 1)

Morphology

The form / lemma ratio of PROPN is 1.003136 (the average of all parts of speech is 1.013130).

The 1st highest number of forms (2) was observed with the lemma “上黨”: 上党, 上黨.

The 2nd highest number of forms (2) was observed with the lemma “侂冑”: 侂冑, 侂胄.

The 3rd highest number of forms (2) was observed with the lemma “傅巖”: 傅巌, 傅巖.

PROPN occurs with 2 features: NameType (45764; 100% instances), Case (19092; 42% instances)

PROPN occurs with 6 feature-value pairs: Case=Loc, NameType=Geo, NameType=Giv, NameType=Nat, NameType=Prs, NameType=Sur

PROPN occurs with 6 feature combinations. The most frequent feature combination is NameType=Giv (15455 tokens). Examples: 儀、 舜、 須菩提、 堯、 秦、 禹、 湯、 衍、 光、 茂

Relations

PROPN nodes are attached to their parents using 23 different relations: nsubj (11860; 26% instances), obj (8433; 18% instances), flat (8051; 18% instances), nmod (7192; 16% instances), compound (3605; 8% instances), conj (2976; 7% instances), obl:lmod (1496; 3% instances), root (865; 2% instances), obl (612; 1% instances), nsubj:outer (154; 0% instances), vocative (120; 0% instances), iobj (119; 0% instances), amod (96; 0% instances), list (46; 0% instances), acl (35; 0% instances), dislocated (28; 0% instances), ccomp (22; 0% instances), parataxis (22; 0% instances), advcl (20; 0% instances), nsubj:pass (6; 0% instances), xcomp (4; 0% instances), csubj (2; 0% instances), clf (1; 0% instances)

Parents of PROPN nodes belong to 10 different parts of speech: VERB (21930; 48% instances), NOUN (13195; 29% instances), PROPN (9460; 21% instances), (865; 2% instances), PART (221; 0% instances), NUM (58; 0% instances), PRON (21; 0% instances), AUX (11; 0% instances), ADV (3; 0% instances), ADP (1; 0% instances)

32526 (71%) PROPN nodes are leaves.

10569 (23%) PROPN nodes have one child.

2101 (5%) PROPN nodes have two children.

569 (1%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 41.

Children of PROPN nodes are attached using 29 different relations: flat (6612; 39% instances), case (4014; 24% instances), conj (3038; 18% instances), nmod (1414; 8% instances), nsubj (421; 2% instances), amod (339; 2% instances), discourse:sp (166; 1% instances), nummod (162; 1% instances), nsubj:outer (142; 1% instances), cc (140; 1% instances), compound (124; 1% instances), cop (82; 0% instances), advmod (63; 0% instances), acl (58; 0% instances), mark (42; 0% instances), det (36; 0% instances), list (33; 0% instances), csubj (16; 0% instances), obl:tmod (13; 0% instances), discourse (9; 0% instances), dislocated (7; 0% instances), obl (6; 0% instances), advcl (5; 0% instances), aux (4; 0% instances), flat:vv (4; 0% instances), obl:lmod (4; 0% instances), parataxis (3; 0% instances), expl (1; 0% instances), vocative (1; 0% instances)

Children of PROPN nodes belong to 12 different parts of speech: PROPN (9460; 56% instances), ADP (2276; 13% instances), NOUN (2201; 13% instances), SCONJ (1832; 11% instances), VERB (383; 2% instances), PART (350; 2% instances), NUM (168; 1% instances), ADV (101; 1% instances), PRON (91; 1% instances), AUX (86; 1% instances), CCONJ (10; 0% instances), INTJ (1; 0% instances)