Statistics of NOUN in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Chinese-GSDSimp: POS Tags: `NOUN`

There are 8125 NOUN lemmas (36%), 8126 NOUN types (36%) and 34044 NOUN tokens (28%). Out of 16 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: 年、个、月、人、日、等、种、次、人口、名

The 10 most frequent NOUN types: 年、个、月、日、人、等、种、次、人口、名

The 10 most frequent ambiguous lemmas: 年 (NOUN 1558, PART 6), 月 (NOUN 604, PART 1), 人 (NOUN 385, PART 240, VERB 1), 日 (NOUN 382, PROPN 53, PART 7, NUM 2), 等 (NOUN 231, VERB 4, PART 1), 种 (NOUN 187, PART 5, VERB 1), 次 (NOUN 149, VERB 4, PART 3, NUM 1), 名 (NOUN 128, PART 6, VERB 3), 大学 (NOUN 120, PROPN 1), 世界 (NOUN 107, PROPN 1)

The 10 most frequent ambiguous types: 年 (NOUN 1558, PART 6), 月 (NOUN 604, PART 1), 日 (NOUN 382, PROPN 53, PART 7, NUM 2), 人 (NOUN 365, PART 240, VERB 1), 等 (NOUN 231, VERB 3, PART 1), 种 (NOUN 187, PART 5, VERB 1), 次 (NOUN 149, VERB 4, PART 3, NUM 1), 名 (NOUN 128, PART 6, VERB 3), 大学 (NOUN 120, PROPN 1), 世界 (NOUN 107, PROPN 1)

年
- NOUN 1558: 1355 年，勃兰登堡被神圣罗马帝国皇帝查理四世升为选侯国。
- PART 6: 1961 年 3 月 3 日，奇尔沙治号再次前往西太平洋巡航。
月
- NOUN 604: 五月二十一日，努尔哈赤出城迎接前来沈阳的科尔沁部奥巴贝勒。
- PART 1: 1961 年 3 月 3 日，奇尔沙治号再次前往西太平洋巡航。
日
- NOUN 382: 五月二十一日，努尔哈赤出城迎接前来沈阳的科尔沁部奥巴贝勒。
- PROPN 53: 虽则日语存在 “ 二段音阶 ” 的变化，但只凭这二音变化，也没有协音的可能。
- PART 7: 1961 年 3 月 3 日，奇尔沙治号再次前往西太平洋巡航。
- NUM 2: 经过进一步改良后，由 2009 年 8 月起，每逢星期日会有两列韩制列车行走将军澳线。
人
- NOUN 365: 总面积 24.44 平方公里，人口 3108 人，人口密度 127.2 人 / 平方公里（ 2009 年）。
- PART 240: 随后爱斯基摩人和维京人相继定居于此。
- VERB 1: 1920 年 11 月 21 日，柯林斯的小队在都柏林的不同地区干掉了 18 个英国特工（人称 “ 开罗帮 ” ）。
等
- NOUN 231: 南京其他重要的水体还包括从六合区流过的滁河、高淳的固城湖、溧水的石臼湖等。
- VERB 3: 当电流等于零时，量度截止电压，就可以得到光电子的最大动能。
- PART 1: NX-01 最初的武器装备只有等离子炮与氚核鱼雷，并由电磁极化船甲保护。
种
- NOUN 187: 半腰座椅，亦称半腰位、半截座椅，铁路车辆座位的一种。
- PART 5: 岛蚺是模里西斯的地方特有种，因此只分布于当地一带。
- VERB 1: 有些种的花是两性同花的，有些是单性的。
次
- NOUN 149: 古巨基于 2006 年度得到四台联颁音乐大奖歌曲大奖成为继陈慧琳之后连续夺得最多次歌曲奖的歌手。
- VERB 4: 根据美国舆论调查指出，当时隆美尔是美国仅次于希特勒的知名度最高的德国人。
- PART 3: 这个结果来自于次文化成员多样化的喜好。
- NUM 1: 2 月 2 日，中日代表在外交部迎宾馆开始极端秘密的会谈，中方代表是外交部长陆征祥和次长曹汝霖。
名
- NOUN 128: 他每天赶着马车到灾区逐村收养灾童，总人数近 800 名。
- PART 6: 国际天文联会（ IAU ）将为该组卫星保留因纽特神话名。
- VERB 3: 杜兰戈，又名杜兰戈维多利亚，杜兰戈城，为墨西哥中部杜兰戈州的首府。
大学
- NOUN 120: 该部位于北京大学医学部逸夫楼 7 楼。
- PROPN 1: 其校区背靠马鞍山麓，并被西洋坪路、大学西路、学府路这三条公路所包围。
世界
- NOUN 107: 由于加拿大在二战后签署了联合国世界人权宣言，加拿大政府必须废除与宣言抵触的排华法案。
- PROPN 1: 2006 年，第18 届世界杯足球赛在慕尼黑的专业足球场安联球场开幕。

Morphology

The form / lemma ratio of NOUN is 1.000123 (the average of all parts of speech is 1.004572).

The 1st highest number of forms (2) was observed with the lemma “人”: 人, 人们.

The 2nd highest number of forms (1) was observed with the lemma “m”: m.

The 3rd highest number of forms (1) was observed with the lemma “n=1”: n=1.

NOUN occurs with 1 features: Number (20; 0% instances)

NOUN occurs with 1 feature-value pairs: Number=Plur

NOUN occurs with 2 feature combinations. The most frequent feature combination is _ (34024 tokens). Examples: 年、个、月、日、人、等、种、次、人口、名

Relations

NOUN nodes are attached to their parents using 27 different relations: nmod (9827; 29% instances), obj (5675; 17% instances), nsubj (5518; 16% instances), obl (2581; 8% instances), clf (2247; 7% instances), compound (1954; 6% instances), conj (1659; 5% instances), nmod:tmod (1555; 5% instances), acl (571; 2% instances), root (571; 2% instances), appos (515; 2% instances), parataxis (398; 1% instances), advcl (225; 1% instances), ccomp (193; 1% instances), nsubj:pass (159; 0% instances), obl:patient (141; 0% instances), xcomp (78; 0% instances), iobj (48; 0% instances), obl:agent (44; 0% instances), csubj (34; 0% instances), acl:relcl (15; 0% instances), amod (10; 0% instances), dislocated (10; 0% instances), nsubj:outer (6; 0% instances), nummod (6; 0% instances), case (2; 0% instances), orphan (2; 0% instances)

Parents of NOUN nodes belong to 14 different parts of speech: VERB (14740; 43% instances), NOUN (11231; 33% instances), PART (3706; 11% instances), NUM (2233; 7% instances), ADJ (761; 2% instances), (571; 2% instances), PROPN (450; 1% instances), DET (216; 1% instances), X (65; 0% instances), ADP (36; 0% instances), PRON (17; 0% instances), ADV (14; 0% instances), SYM (3; 0% instances), AUX (1; 0% instances)

14838 (44%) NOUN nodes are leaves.

8112 (24%) NOUN nodes have one child.

5403 (16%) NOUN nodes have two children.

5691 (17%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 14.

Children of NOUN nodes are attached using 32 different relations: nmod (12921; 31% instances), nummod (6104; 15% instances), case (5958; 14% instances), punct (4637; 11% instances), amod (1793; 4% instances), conj (1648; 4% instances), acl:relcl (1518; 4% instances), det (1319; 3% instances), cop (1165; 3% instances), nsubj (1110; 3% instances), cc (995; 2% instances), appos (866; 2% instances), acl (501; 1% instances), parataxis (360; 1% instances), advmod (208; 1% instances), mark (82; 0% instances), advcl (71; 0% instances), obl (68; 0% instances), csubj (60; 0% instances), nmod:tmod (54; 0% instances), dislocated (36; 0% instances), compound (33; 0% instances), ccomp (20; 0% instances), xcomp (17; 0% instances), mark:rel (15; 0% instances), obj (12; 0% instances), aux (10; 0% instances), discourse (8; 0% instances), nsubj:outer (2; 0% instances), orphan (2; 0% instances), mark:adv (1; 0% instances), obl:patient (1; 0% instances)

Children of NOUN nodes belong to 16 different parts of speech: NOUN (11231; 27% instances), NUM (6201; 15% instances), PUNCT (4637; 11% instances), PART (4381; 11% instances), PROPN (3460; 8% instances), ADP (3409; 8% instances), VERB (2114; 5% instances), ADJ (1726; 4% instances), AUX (1176; 3% instances), DET (1128; 3% instances), CCONJ (992; 2% instances), PRON (588; 1% instances), X (255; 1% instances), ADV (207; 0% instances), SCONJ (86; 0% instances), SYM (4; 0% instances)

Treebank Statistics: UD_Chinese-GSDSimp: POS Tags: NOUN

Morphology

Relations

Treebank Statistics: UD_Chinese-GSDSimp: POS Tags: `NOUN`