home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Classical_Chinese-Kyoto: POS Tags: NOUN

There are 2832 NOUN lemmas (29%), 2863 NOUN types (29%) and 69890 NOUN tokens (30%). Out of 13 observed tags, the rank of NOUN is: 2 in number of lemmas, 2 in number of types and 2 in number of tokens.

The 10 most frequent NOUN lemmas: 人、 子、 王、 天、 君、 下、 禮、 上、 帝、 公

The 10 most frequent NOUN types: 人、 子、 王、 天、 君、 下、 禮、 上、 帝、 公

The 10 most frequent ambiguous lemmas: 子 (NOUN 2315, PRON 149, VERB 19, PROPN 6, NUM 1), 王 (NOUN 1438, PROPN 178, VERB 66), 天 (NOUN 1329, VERB 6), 君 (NOUN 972, VERB 19), 下 (NOUN 950, VERB 147, ADV 6), 禮 (NOUN 899, VERB 25, ADV 1, PROPN 1), 上 (NOUN 886, VERB 110, ADV 14), 帝 (NOUN 823, VERB 4), 公 (NOUN 810, VERB 34, ADV 5, PROPN 5, PRON 1), 國 (NOUN 737, PROPN 4, VERB 1)

The 10 most frequent ambiguous types: 子 (NOUN 2315, PRON 149, VERB 19, PROPN 6, NUM 1), 王 (NOUN 1438, PROPN 178, VERB 66), 天 (NOUN 1329, VERB 6), 君 (NOUN 972, VERB 19), 下 (NOUN 950, VERB 147, ADV 6), 禮 (NOUN 900, VERB 25, ADV 1, PROPN 1), 上 (NOUN 886, VERB 110, ADV 14), 帝 (NOUN 823, VERB 4), 公 (NOUN 810, VERB 34, ADV 5, PROPN 5, PRON 1), 國 (NOUN 737, PROPN 4, VERB 1)

Morphology

The form / lemma ratio of NOUN is 1.010946 (the average of all parts of speech is 1.011910).

The 1st highest number of forms (2) was observed with the lemma “內”: 內, 内.

The 2nd highest number of forms (2) was observed with the lemma “冰”: 冰, 氷.

The 3rd highest number of forms (2) was observed with the lemma “古”: 古, 禮.

NOUN occurs with 3 features: Case (17287; 25% instances), NounType (540; 1% instances), Degree (3; 0% instances)

NOUN occurs with 4 feature-value pairs: Case=Loc, Case=Tem, Degree=Pos, NounType=Class

NOUN occurs with 5 feature combinations. The most frequent feature combination is _ (52060 tokens). Examples: 人、 子、 王、 君、 禮、 帝、 公、 民、 國、 夫

Relations

NOUN nodes are attached to their parents using 25 different relations: obj (20962; 30% instances), nsubj (16582; 24% instances), nmod (12835; 18% instances), conj (4793; 7% instances), root (3526; 5% instances), flat (2190; 3% instances), obl (2094; 3% instances), obl:tmod (2073; 3% instances), obl:lmod (1822; 3% instances), clf (1139; 2% instances), compound (672; 1% instances), iobj (234; 0% instances), parataxis (218; 0% instances), advcl (142; 0% instances), acl (132; 0% instances), ccomp (89; 0% instances), dislocated (88; 0% instances), csubj (77; 0% instances), list (66; 0% instances), amod (65; 0% instances), flat:vv (30; 0% instances), compound:redup (23; 0% instances), vocative (18; 0% instances), xcomp (14; 0% instances), nsubj:pass (6; 0% instances)

Parents of NOUN nodes belong to 12 different parts of speech: VERB (42688; 61% instances), NOUN (19809; 28% instances), (3526; 5% instances), NUM (1788; 3% instances), PROPN (1335; 2% instances), PART (645; 1% instances), PRON (59; 0% instances), AUX (26; 0% instances), ADV (10; 0% instances), SYM (2; 0% instances), ADP (1; 0% instances), INTJ (1; 0% instances)

35788 (51%) NOUN nodes are leaves.

23220 (33%) NOUN nodes have one child.

7581 (11%) NOUN nodes have two children.

3301 (5%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 27.

Children of NOUN nodes are attached using 31 different relations: nmod (14861; 30% instances), amod (6661; 13% instances), case (6034; 12% instances), conj (4692; 9% instances), flat (3102; 6% instances), det (3064; 6% instances), nummod (2419; 5% instances), nsubj (2001; 4% instances), compound (1794; 4% instances), cop (1364; 3% instances), discourse:sp (1291; 3% instances), acl (792; 2% instances), advmod (759; 2% instances), csubj (267; 1% instances), cc (211; 0% instances), mark (157; 0% instances), obl:tmod (75; 0% instances), list (70; 0% instances), obl (67; 0% instances), parataxis (56; 0% instances), aux (47; 0% instances), advcl (39; 0% instances), discourse (36; 0% instances), compound:redup (23; 0% instances), obl:lmod (21; 0% instances), clf (19; 0% instances), flat:vv (18; 0% instances), dislocated (17; 0% instances), fixed (13; 0% instances), expl (10; 0% instances), vocative (1; 0% instances)

Children of NOUN nodes belong to 13 different parts of speech: NOUN (19809; 40% instances), VERB (7512; 15% instances), PROPN (6081; 12% instances), PRON (3448; 7% instances), SCONJ (3226; 6% instances), ADP (2917; 6% instances), NUM (2532; 5% instances), PART (1894; 4% instances), AUX (1414; 3% instances), ADV (1056; 2% instances), CCONJ (82; 0% instances), INTJ (5; 0% instances), SYM (5; 0% instances)