home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-CFL: POS Tags: NOUN

There are 612 NOUN lemmas (37%), 612 NOUN types (37%) and 1352 NOUN tokens (19%). Out of 15 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: 个、 人、 时候、 天、 次、 朋友、 自行车、 旅行、 事、 时间

The 10 most frequent NOUN types: 个、 人、 时候、 天、 次、 朋友、 自行车、 旅行、 事、 时间

The 10 most frequent ambiguous lemmas: 个 (NOUN 85, DET 1), 人 (NOUN 42, PRON 1), 旅行 (NOUN 16, VERB 10), 生活 (NOUN 13, VERB 2), 以前 (NOUN 5, ADP 2), 以后 (ADP 13, NOUN 4), 上 (ADP 28, VERB 10, NOUN 3), 后来 (NOUN 3, ADV 1), 下 (VERB 6, NOUN 2, ADP 1), 之前 (ADP 3, NOUN 2)

The 10 most frequent ambiguous types: 个 (NOUN 85, DET 1), 人 (NOUN 42, PRON 1), 旅行 (NOUN 16, VERB 10), 生活 (NOUN 13, VERB 2), 以前 (NOUN 5, ADP 2), 印象 (NOUN 5, VERB 1), 以后 (ADP 13, NOUN 4), 上 (ADP 28, VERB 10, NOUN 3), 后来 (NOUN 3, ADV 1), 下 (VERB 6, NOUN 2, ADP 1)

Morphology

The form / lemma ratio of NOUN is 1.000000 (the average of all parts of speech is 1.001198).

The 1st highest number of forms (1) was observed with the lemma “12月”: 12月.

The 2nd highest number of forms (1) was observed with the lemma “2012年”: 2012年.

The 3rd highest number of forms (1) was observed with the lemma “2013年”: 2013年.

NOUN does not occur with any features.

Relations

NOUN nodes are attached to their parents using 32 different relations: obj (396; 29% instances), nsubj (192; 14% instances), obl:tmod (133; 10% instances), clf (127; 9% instances), nmod (111; 8% instances), obl (111; 8% instances), conj (43; 3% instances), compound (38; 3% instances), root (33; 2% instances), compound:vo (25; 2% instances), appos (17; 1% instances), advcl (16; 1% instances), parataxis (15; 1% instances), advmod:df (13; 1% instances), ccomp (13; 1% instances), det (13; 1% instances), dislocated (10; 1% instances), xcomp (7; 1% instances), advmod (6; 0% instances), dep (6; 0% instances), obl:patient (5; 0% instances), acl (4; 0% instances), obl:agent (4; 0% instances), amod (3; 0% instances), flat (3; 0% instances), case:loc (2; 0% instances), case (1; 0% instances), cc (1; 0% instances), compound:dir (1; 0% instances), iobj (1; 0% instances), nummod (1; 0% instances), vocative (1; 0% instances)

Parents of NOUN nodes belong to 10 different parts of speech: VERB (861; 64% instances), NOUN (231; 17% instances), NUM (90; 7% instances), ADJ (62; 5% instances), DET (33; 2% instances), PRON (33; 2% instances), (33; 2% instances), PROPN (6; 0% instances), AUX (2; 0% instances), ADP (1; 0% instances)

510 (38%) NOUN nodes are leaves.

488 (36%) NOUN nodes have one child.

224 (17%) NOUN nodes have two children.

130 (10%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 8.

Children of NOUN nodes are attached using 29 different relations: nmod (249; 17% instances), case (173; 12% instances), amod (142; 10% instances), det (132; 9% instances), nummod (131; 9% instances), punct (115; 8% instances), acl (101; 7% instances), case:loc (74; 5% instances), cop (51; 4% instances), compound (50; 3% instances), nsubj (47; 3% instances), advmod (34; 2% instances), conj (34; 2% instances), cc (31; 2% instances), parataxis (21; 1% instances), mark:rel (15; 1% instances), obl (9; 1% instances), mark (8; 1% instances), appos (6; 0% instances), discourse:sp (6; 0% instances), advcl (5; 0% instances), dep (5; 0% instances), flat (3; 0% instances), clf (2; 0% instances), obj (2; 0% instances), discourse (1; 0% instances), mark:adv (1; 0% instances), obl:tmod (1; 0% instances), xcomp (1; 0% instances)

Children of NOUN nodes belong to 14 different parts of speech: NOUN (231; 16% instances), ADP (181; 12% instances), PRON (154; 11% instances), ADJ (145; 10% instances), NUM (133; 9% instances), VERB (127; 9% instances), DET (120; 8% instances), PUNCT (116; 8% instances), PART (87; 6% instances), AUX (51; 4% instances), PROPN (35; 2% instances), ADV (32; 2% instances), CCONJ (30; 2% instances), SCONJ (8; 1% instances)