home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Classical_Chinese-TueCL: POS Tags: NOUN

There are 97 NOUN lemmas (33%), 97 NOUN types (33%) and 174 NOUN tokens (27%). Out of 13 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: 里、 天、 南、 冥、 名、 歲、 水、 上、 世、 人

The 10 most frequent NOUN types: 里、 天、 南、 冥、 名、 歲、 水、 上、 世、 人

The 10 most frequent ambiguous lemmas: 南 (NOUN 6, ADJ 1, VERB 1), 冥 (NOUN 4, VERB 2), 上 (NOUN 3, VERB 2), 後 (NOUN 3, ADV 1), 知 (VERB 7, NOUN 3), 者 (PART 15, NOUN 3), 風 (NOUN 3, VERB 1), 鵬 (NOUN 3, PROPN 1), 下 (NOUN 2, VERB 1), 今 (NOUN 2, ADV 1)

The 10 most frequent ambiguous types: 南 (NOUN 6, ADJ 1, VERB 1), 冥 (NOUN 4, VERB 2), 上 (NOUN 3, VERB 2), 後 (NOUN 3, ADV 1), 知 (VERB 7, NOUN 3), 者 (PART 15, NOUN 3), 風 (NOUN 3, VERB 1), 鵬 (NOUN 3, PROPN 1), 下 (NOUN 2, VERB 1), 今 (NOUN 2, ADV 1)

Morphology

The form / lemma ratio of NOUN is 1.000000 (the average of all parts of speech is 1.006873).

The 1st highest number of forms (1) was observed with the lemma “上”: 上.

The 2nd highest number of forms (1) was observed with the lemma “下”: 下.

The 3rd highest number of forms (1) was observed with the lemma “世”: 世.

NOUN occurs with 3 features: Case (62; 36% instances), NounType (11; 6% instances), Degree (6; 3% instances)

NOUN occurs with 4 feature-value pairs: Case=Loc, Case=Tem, Degree=Pos, NounType=Clf

NOUN occurs with 5 feature combinations. The most frequent feature combination is _ (95 tokens). Examples: 名、 水、 人、 知、 翼、 者、 舟、 雲、 風、 鳥

Relations

NOUN nodes are attached to their parents using 17 different relations: obj (54; 31% instances), nsubj (38; 22% instances), nmod (27; 16% instances), root (12; 7% instances), obl:tmod (8; 5% instances), parataxis (7; 4% instances), conj (5; 3% instances), obl:lmod (5; 3% instances), advcl (4; 2% instances), obl (4; 2% instances), ccomp (2; 1% instances), dislocated (2; 1% instances), flat (2; 1% instances), amod (1; 1% instances), clf (1; 1% instances), compound (1; 1% instances), csubj (1; 1% instances)

Parents of NOUN nodes belong to 10 different parts of speech: VERB (109; 63% instances), NOUN (35; 20% instances), (12; 7% instances), AUX (6; 3% instances), PROPN (4; 2% instances), ADJ (3; 2% instances), PART (2; 1% instances), CCONJ (1; 1% instances), NUM (1; 1% instances), PRON (1; 1% instances)

65 (37%) NOUN nodes are leaves.

75 (43%) NOUN nodes have one child.

26 (15%) NOUN nodes have two children.

8 (5%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 4.

Children of NOUN nodes are attached using 19 different relations: nmod (31; 20% instances), amod (24; 16% instances), nummod (22; 14% instances), case (21; 14% instances), nsubj (12; 8% instances), discourse:sp (7; 5% instances), parataxis (6; 4% instances), det (5; 3% instances), discourse (5; 3% instances), acl (4; 3% instances), advmod (3; 2% instances), flat (3; 2% instances), mark (3; 2% instances), cc (2; 1% instances), cop (2; 1% instances), compound (1; 1% instances), conj (1; 1% instances), dislocated (1; 1% instances), obl:tmod (1; 1% instances)

Children of NOUN nodes belong to 13 different parts of speech: NOUN (35; 23% instances), NUM (24; 16% instances), VERB (19; 12% instances), PART (17; 11% instances), ADJ (16; 10% instances), PRON (14; 9% instances), SCONJ (12; 8% instances), ADP (5; 3% instances), ADV (4; 3% instances), AUX (2; 1% instances), CCONJ (2; 1% instances), DET (2; 1% instances), PROPN (2; 1% instances)