home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Classical_Chinese-Kyoto: POS Tags: NOUN

There are 3800 NOUN lemmas (28%), 3857 NOUN types (28%) and 123065 NOUN tokens (28%). Out of 14 observed tags, the rank of NOUN is: 2 in number of lemmas, 2 in number of types and 2 in number of tokens.

The 10 most frequent NOUN lemmas: 王、 人、 子、 君、 天、 國、 下、 臣、 公、 上

The 10 most frequent NOUN types: 王、 人、 子、 君、 天、 國、 下、 臣、 公、 上

The 10 most frequent ambiguous lemmas: 王 (NOUN 4834, PROPN 348, VERB 100), 子 (NOUN 3054, PRON 417, VERB 28, PROPN 8, PART 2, NUM 1), 君 (NOUN 2442, PROPN 23, VERB 23), 天 (NOUN 2196, VERB 6), 國 (NOUN 1870, PROPN 7, VERB 2), 下 (NOUN 1822, VERB 335, ADV 7), 臣 (NOUN 1752, VERB 60, PROPN 5, ADV 2), 公 (NOUN 1420, VERB 43, PROPN 14, ADV 8, PRON 1), 上 (NOUN 1409, VERB 181, ADV 29), 兵 (NOUN 1278, VERB 1)

The 10 most frequent ambiguous types: 王 (NOUN 4834, PROPN 348, VERB 100), 子 (NOUN 3054, PRON 417, VERB 28, PROPN 8, PART 2, NUM 1), 君 (NOUN 2442, PROPN 23, VERB 23), 天 (NOUN 2196, VERB 6), 國 (NOUN 1870, PROPN 7, VERB 2), 下 (NOUN 1822, VERB 335, ADV 7), 臣 (NOUN 1752, VERB 60, PROPN 5, ADV 2), 公 (NOUN 1420, VERB 43, PROPN 14, ADV 8, PRON 1), 上 (NOUN 1409, VERB 181, ADV 29), 兵 (NOUN 1278, VERB 1)

Morphology

The form / lemma ratio of NOUN is 1.015000 (the average of all parts of speech is 1.013130).

The 1st highest number of forms (2) was observed with the lemma “內”: 內, 内.

The 2nd highest number of forms (2) was observed with the lemma “冰”: 冰, 氷.

The 3rd highest number of forms (2) was observed with the lemma “勳”: 勛, 勳.

NOUN occurs with 3 features: Case (32519; 26% instances), NounType (913; 1% instances), Degree (1; 0% instances)

NOUN occurs with 4 feature-value pairs: Case=Loc, Case=Tem, Degree=Pos, NounType=Clf

NOUN occurs with 5 feature combinations. The most frequent feature combination is _ (89632 tokens). Examples: 王、 人、 子、 君、 國、 臣、 公、 兵、 事、 帝

Relations

NOUN nodes are attached to their parents using 27 different relations: obj (36544; 30% instances), nsubj (29531; 24% instances), nmod (22803; 19% instances), conj (7535; 6% instances), root (5705; 5% instances), obl:tmod (4058; 3% instances), obl (3480; 3% instances), obl:lmod (3473; 3% instances), flat (3379; 3% instances), clf (2067; 2% instances), compound (1605; 1% instances), nsubj:outer (676; 1% instances), iobj (438; 0% instances), parataxis (402; 0% instances), amod (291; 0% instances), ccomp (212; 0% instances), acl (208; 0% instances), advcl (171; 0% instances), csubj (101; 0% instances), dislocated (91; 0% instances), vocative (71; 0% instances), list (67; 0% instances), flat:foreign (64; 0% instances), compound:redup (58; 0% instances), xcomp (27; 0% instances), nsubj:pass (6; 0% instances), csubj:outer (2; 0% instances)

Parents of NOUN nodes belong to 13 different parts of speech: VERB (76675; 62% instances), NOUN (34144; 28% instances), (5705; 5% instances), NUM (3190; 3% instances), PROPN (2201; 2% instances), PART (925; 1% instances), PRON (122; 0% instances), AUX (51; 0% instances), ADV (44; 0% instances), ADP (3; 0% instances), INTJ (2; 0% instances), SYM (2; 0% instances), SCONJ (1; 0% instances)

61052 (50%) NOUN nodes are leaves.

43121 (35%) NOUN nodes have one child.

13320 (11%) NOUN nodes have two children.

5572 (5%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 27.

Children of NOUN nodes are attached using 35 different relations: nmod (27426; 31% instances), amod (12815; 14% instances), case (10081; 11% instances), conj (7311; 8% instances), compound (5226; 6% instances), det (5013; 6% instances), flat (4785; 5% instances), nummod (4310; 5% instances), nsubj (3138; 4% instances), cop (2211; 2% instances), discourse:sp (2116; 2% instances), acl (1832; 2% instances), advmod (1068; 1% instances), csubj (387; 0% instances), cc (372; 0% instances), mark (218; 0% instances), obl:tmod (174; 0% instances), obl (113; 0% instances), discourse (96; 0% instances), nsubj:outer (87; 0% instances), parataxis (83; 0% instances), aux (76; 0% instances), list (71; 0% instances), advcl (67; 0% instances), flat:foreign (64; 0% instances), obl:lmod (62; 0% instances), compound:redup (56; 0% instances), clf (37; 0% instances), expl (34; 0% instances), dislocated (32; 0% instances), flat:vv (31; 0% instances), fixed (15; 0% instances), vocative (9; 0% instances), csubj:outer (2; 0% instances), nsubj:pass (1; 0% instances)

Children of NOUN nodes belong to 13 different parts of speech: NOUN (34144; 38% instances), VERB (14214; 16% instances), PROPN (13195; 15% instances), PRON (5723; 6% instances), SCONJ (5649; 6% instances), ADP (4702; 5% instances), NUM (4516; 5% instances), PART (3001; 3% instances), AUX (2293; 3% instances), ADV (1805; 2% instances), CCONJ (161; 0% instances), INTJ (11; 0% instances), SYM (5; 0% instances)