Treebank Statistics: UD_Classical_Chinese-Kyoto: POS Tags: NOUN
There are 3800 NOUN lemmas (28%), 3857 NOUN types (28%) and 123065 NOUN tokens (28%).
Out of 14 observed tags, the rank of NOUN is: 2 in number of lemmas, 2 in number of types and 2 in number of tokens.
The 10 most frequent NOUN lemmas: 王、 人、 子、 君、 天、 國、 下、 臣、 公、 上
The 10 most frequent NOUN types: 王、 人、 子、 君、 天、 國、 下、 臣、 公、 上
The 10 most frequent ambiguous lemmas: 王 (NOUN 4834, PROPN 348, VERB 100), 子 (NOUN 3054, PRON 417, VERB 28, PROPN 8, PART 2, NUM 1), 君 (NOUN 2442, PROPN 23, VERB 23), 天 (NOUN 2196, VERB 6), 國 (NOUN 1870, PROPN 7, VERB 2), 下 (NOUN 1822, VERB 335, ADV 7), 臣 (NOUN 1752, VERB 60, PROPN 5, ADV 2), 公 (NOUN 1420, VERB 43, PROPN 14, ADV 8, PRON 1), 上 (NOUN 1409, VERB 181, ADV 29), 兵 (NOUN 1278, VERB 1)
The 10 most frequent ambiguous types: 王 (NOUN 4834, PROPN 348, VERB 100), 子 (NOUN 3054, PRON 417, VERB 28, PROPN 8, PART 2, NUM 1), 君 (NOUN 2442, PROPN 23, VERB 23), 天 (NOUN 2196, VERB 6), 國 (NOUN 1870, PROPN 7, VERB 2), 下 (NOUN 1822, VERB 335, ADV 7), 臣 (NOUN 1752, VERB 60, PROPN 5, ADV 2), 公 (NOUN 1420, VERB 43, PROPN 14, ADV 8, PRON 1), 上 (NOUN 1409, VERB 181, ADV 29), 兵 (NOUN 1278, VERB 1)
- 王
- 子
- 君
- 天
- 國
- 下
- 臣
- 公
- 上
- 兵
Morphology
The form / lemma ratio of NOUN is 1.015000 (the average of all parts of speech is 1.013130).
The 1st highest number of forms (2) was observed with the lemma “內”: 內, 内.
The 2nd highest number of forms (2) was observed with the lemma “冰”: 冰, 氷.
The 3rd highest number of forms (2) was observed with the lemma “勳”: 勛, 勳.
NOUN occurs with 3 features: Case (32519; 26% instances), NounType (913; 1% instances), Degree (1; 0% instances)
NOUN occurs with 4 feature-value pairs: Case=Loc, Case=Tem, Degree=Pos, NounType=Clf
NOUN occurs with 5 feature combinations.
The most frequent feature combination is _ (89632 tokens).
Examples: 王、 人、 子、 君、 國、 臣、 公、 兵、 事、 帝
Relations
NOUN nodes are attached to their parents using 28 different relations: obj (36528; 30% instances), nsubj (29541; 24% instances), nmod (22804; 19% instances), conj (7534; 6% instances), root (5707; 5% instances), obl:tmod (4059; 3% instances), obl (3480; 3% instances), obl:lmod (3473; 3% instances), flat (3379; 3% instances), clf (2066; 2% instances), compound (1605; 1% instances), nsubj:outer (675; 1% instances), iobj (439; 0% instances), parataxis (402; 0% instances), amod (291; 0% instances), ccomp (213; 0% instances), acl (208; 0% instances), advcl (171; 0% instances), csubj (100; 0% instances), dislocated (91; 0% instances), vocative (71; 0% instances), list (67; 0% instances), flat:foreign (64; 0% instances), compound:redup (58; 0% instances), xcomp (27; 0% instances), nsubj:pass (6; 0% instances), orphan (4; 0% instances), csubj:outer (2; 0% instances)
Parents of NOUN nodes belong to 13 different parts of speech: VERB (76666; 62% instances), NOUN (34146; 28% instances), (5707; 5% instances), NUM (3191; 3% instances), PROPN (2205; 2% instances), PART (925; 1% instances), PRON (122; 0% instances), AUX (51; 0% instances), ADV (44; 0% instances), ADP (3; 0% instances), INTJ (2; 0% instances), SYM (2; 0% instances), SCONJ (1; 0% instances)
61050 (50%) NOUN nodes are leaves.
43121 (35%) NOUN nodes have one child.
13319 (11%) NOUN nodes have two children.
5575 (5%) NOUN nodes have three or more children.
The highest child degree of a NOUN node is 27.
Children of NOUN nodes are attached using 35 different relations: nmod (27428; 31% instances), amod (12814; 14% instances), case (10081; 11% instances), conj (7313; 8% instances), compound (5226; 6% instances), det (5012; 6% instances), flat (4785; 5% instances), nummod (4310; 5% instances), nsubj (3140; 4% instances), cop (2211; 2% instances), discourse:sp (2117; 2% instances), acl (1835; 2% instances), advmod (1069; 1% instances), csubj (385; 0% instances), cc (372; 0% instances), mark (218; 0% instances), obl:tmod (175; 0% instances), obl (112; 0% instances), discourse (96; 0% instances), nsubj:outer (86; 0% instances), parataxis (83; 0% instances), aux (76; 0% instances), list (71; 0% instances), advcl (69; 0% instances), flat:foreign (64; 0% instances), obl:lmod (62; 0% instances), compound:redup (56; 0% instances), clf (36; 0% instances), expl (34; 0% instances), dislocated (32; 0% instances), flat:vv (31; 0% instances), fixed (15; 0% instances), vocative (9; 0% instances), csubj:outer (2; 0% instances), nsubj:pass (1; 0% instances)
Children of NOUN nodes belong to 13 different parts of speech: NOUN (34146; 38% instances), VERB (14217; 16% instances), PROPN (13195; 15% instances), PRON (5723; 6% instances), SCONJ (5649; 6% instances), ADP (4702; 5% instances), NUM (4516; 5% instances), PART (3003; 3% instances), AUX (2293; 3% instances), ADV (1805; 2% instances), CCONJ (161; 0% instances), INTJ (11; 0% instances), SYM (5; 0% instances)