home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Classical_Chinese-Kyoto: POS Tags: NOUN

There are 1929 NOUN lemmas (35%), 1931 NOUN types (35%) and 38700 NOUN tokens (30%). Out of 13 observed tags, the rank of NOUN is: 2 in number of lemmas, 2 in number of types and 2 in number of tokens.

The 10 most frequent NOUN lemmas: 人、 子、 天、 禮、 君、 民、 下、 王、 君子、 道

The 10 most frequent NOUN types: 人、 子、 天、 禮、 君、 民、 下、 王、 君子、 道

The 10 most frequent ambiguous lemmas: 子 (NOUN 1534, PRON 115, VERB 13, NUM 1), 天 (NOUN 960, VERB 4), 禮 (NOUN 816, VERB 17, ADV 1), 君 (NOUN 671, VERB 15), 下 (NOUN 524, VERB 56, ADV 6), 王 (NOUN 504, VERB 35, PROPN 10), 道 (NOUN 456, VERB 44), 夫 (NOUN 446, PART 182, PRON 32), 國 (NOUN 437, PROPN 3), 父 (NOUN 360, VERB 2)

The 10 most frequent ambiguous types: 子 (NOUN 1534, PRON 115, VERB 13, NUM 1), 天 (NOUN 960, VERB 4), 禮 (NOUN 817, VERB 17, ADV 1), 君 (NOUN 671, VERB 15), 下 (NOUN 524, VERB 56, ADV 6), 王 (NOUN 504, VERB 35, PROPN 10), 道 (NOUN 456, VERB 44), 夫 (NOUN 446, PART 182, PRON 32), 國 (NOUN 437, PROPN 3), 父 (NOUN 360, VERB 2)

Morphology

The form / lemma ratio of NOUN is 1.001037 (the average of all parts of speech is 1.002166).

The 1st highest number of forms (2) was observed with the lemma “內”: 內, 内.

The 2nd highest number of forms (2) was observed with the lemma “古”: 古, 禮.

The 3rd highest number of forms (2) was observed with the lemma “將”: 將, 牂.

NOUN occurs with 3 features: Case (8549; 22% instances), NounType (345; 1% instances), Degree (1; 0% instances)

NOUN occurs with 4 feature-value pairs: Case=Loc, Case=Tem, Degree=Pos, NounType=Class

NOUN occurs with 5 feature combinations. The most frequent feature combination is _ (29805 tokens). Examples: 人、 子、 禮、 君、 民、 王、 君子、 夫、 國、 道

Relations

NOUN nodes are attached to their parents using 25 different relations: obj (11584; 30% instances), nsubj (9962; 26% instances), nmod (6372; 16% instances), conj (2779; 7% instances), root (1849; 5% instances), obl (1367; 4% instances), flat (1207; 3% instances), obl:tmod (1066; 3% instances), obl:lmod (1047; 3% instances), clf (458; 1% instances), compound (398; 1% instances), advcl (122; 0% instances), acl (88; 0% instances), dislocated (82; 0% instances), csubj (57; 0% instances), iobj (54; 0% instances), ccomp (50; 0% instances), list (45; 0% instances), parataxis (31; 0% instances), amod (24; 0% instances), compound:redup (19; 0% instances), vocative (14; 0% instances), xcomp (11; 0% instances), flat:vv (10; 0% instances), nsubj:pass (4; 0% instances)

Parents of NOUN nodes belong to 12 different parts of speech: VERB (24486; 63% instances), NOUN (10936; 28% instances), (1849; 5% instances), NUM (745; 2% instances), PART (454; 1% instances), PROPN (160; 0% instances), PRON (40; 0% instances), AUX (17; 0% instances), ADV (9; 0% instances), SYM (2; 0% instances), ADP (1; 0% instances), INTJ (1; 0% instances)

20141 (52%) NOUN nodes are leaves.

12832 (33%) NOUN nodes have one child.

4253 (11%) NOUN nodes have two children.

1474 (4%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 20.

Children of NOUN nodes are attached using 31 different relations: nmod (6605; 25% instances), case (4409; 17% instances), amod (3311; 13% instances), conj (2767; 10% instances), det (2097; 8% instances), nummod (1223; 5% instances), flat (1221; 5% instances), nsubj (1120; 4% instances), discourse:sp (965; 4% instances), compound (797; 3% instances), advmod (507; 2% instances), cop (437; 2% instances), acl (271; 1% instances), csubj (171; 1% instances), cc (145; 1% instances), mark (123; 0% instances), aux (38; 0% instances), list (37; 0% instances), parataxis (32; 0% instances), discourse (31; 0% instances), obl:tmod (27; 0% instances), obl (26; 0% instances), advcl (22; 0% instances), compound:redup (19; 0% instances), dislocated (15; 0% instances), obl:lmod (13; 0% instances), fixed (12; 0% instances), clf (9; 0% instances), expl (7; 0% instances), flat:vv (6; 0% instances), vocative (1; 0% instances)

Children of NOUN nodes belong to 13 different parts of speech: NOUN (10936; 41% instances), VERB (3710; 14% instances), SCONJ (2510; 9% instances), PRON (2281; 9% instances), ADP (1931; 7% instances), PART (1425; 5% instances), NUM (1289; 5% instances), PROPN (1187; 4% instances), ADV (651; 2% instances), AUX (478; 2% instances), CCONJ (61; 0% instances), INTJ (3; 0% instances), SYM (2; 0% instances)