Statistics of NUM in UD_Chinese-GSDSimp

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Chinese-GSDSimp: POS Tags: `NUM`

There are 1254 NUM lemmas (6%), 1254 NUM types (6%) and 6659 NUM tokens (5%). Out of 16 observed tags, the rank of NUM is: 4 in number of lemmas, 4 in number of types and 6 in number of tokens.

The 10 most frequent NUM lemmas: 一、两、三、 1、第一、 3、 12、 5、 2、 8

The 10 most frequent NUM types: 一、两、三、 1、第一、 3、 12、 5、 2、 8

The 10 most frequent ambiguous lemmas: 一 (NUM 1124, NOUN 1), 第一 (NUM 117, ADJ 1, PROPN 1), 多 (NUM 83, ADV 28, ADJ 16, PART 3), 双 (NUM 35, NOUN 1), 很多 (NUM 33, ADJ 4), 单 (NUM 26, PART 2), 半 (NUM 24, PART 6), 数 (NUM 22, PART 15), 九 (NUM 16, PROPN 2), 众多 (ADJ 8, NUM 8)

The 10 most frequent ambiguous types: 一 (NUM 1124, NOUN 1), 第一 (NUM 117, ADJ 1, PROPN 1), 多 (NUM 83, ADV 28, ADJ 16, PART 3), 双 (NUM 35, NOUN 1), 很多 (NUM 33, ADJ 4), 单 (NUM 26, PART 2), 半 (NUM 24, PART 6), 数 (NUM 22, PART 15), 九 (NUM 16, PROPN 2), 众多 (ADJ 8, NUM 8)

一
- NUM 1124: 其测试包含了美术治疗法，认知行为治疗和洞察疗法，同时给行为分析提供了一个理论性的交流平台。
- NOUN 1: 这一修正案涉及公民权利和平等法律保护，最初提出是为了解决南北战争后昔日奴隶的相关问题。
第一
- NUM 117: 北京站是当时中国大陆规模最大、设备最先进的铁路车站，也是第一个现代化大型铁路客运站。
- ADJ 1: KKR 的资本募集主要局限于一小部分投资者，这其中就包括希尔曼（ Hillman ）家族和第一芝加哥银行。
- PROPN 1: 而此前《第一财经日报》的报道称中国的油价已经高于美国，在这些问题下人们开始疑问为什么中国油价会如此之高。
多
- NUM 83: 而且学校的伙食和住宿条件也多年遭到在校生的诟病。
- ADV 28: 近年来，肯亚女子长距离田径项目也开始崭露头角，而这些女运动员们也多为卡伦金人。
- ADJ 16: 后来卡通造型的桑德斯上校（由演员 Randy Quaid 配音），出现在越来越多的肯德基广告中。
- PART 3: 研讨会和讲座可以享用多媒体演示、视频会议和同时由数个不同地点的通讯的设备的支持。
双
- NUM 35: 实际上的双筒望远镜当然多少有些误差。
- NOUN 1: 而复写眼持有者因为这双特殊的眼睛可以直接跳过这一步，在其后的学习中也要比一般修习者快上数倍。
很多
- NUM 33: 由于这次失事原因涉及很多敏感的争议性，因此最后仍未有一个具体及统一的事故调查报告。
- ADJ 4: 在很多城市，罗素被扣上异端的帽子，随之而来的批评家数量也是直线上升。
单
- NUM 26: 一般来说，同一款间格的单边单位比非单边的呎价约贵 20% 。
- PART 2: 各地的分部办事处之前会准备好足够的邀请单，以便在发放时给住户。
半
- NUM 24: 半腰座椅，亦称半腰位、半截座椅，铁路车辆座位的一种。
- PART 6: 她的姥姥讲俄语，并且是半俄国血统半威尔士血统。
数
- NUM 22: 研讨会和讲座可以享用多媒体演示、视频会议和同时由数个不同地点的通讯的设备的支持。
- PART 15: 与静态酒不同，较高糖份的葡萄并不是气泡酒的上选原料，所以葡萄植株的挂果数也会比较多。
九
- NUM 16: 1920 年（大正九年） 8 月，泽田俊郎归国到仙台结婚，后偕新婚妻子再度回到满洲。
- PROPN 2: 2000 年代九广铁路计划兴建落马洲支线经过该地，引起了对该处生态影响的关注。
众多
- ADJ 8: 在众多书迷的推动下，恶灵系列在 2010 年 11 月 19 日再度以 Ghost Hunt 之名重新出版。
- NUM 8: 酒店成为众多社会名流，富商聚集的场所。

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.004572).

The 1st highest number of forms (1) was observed with the lemma “-15”: -15.

The 2nd highest number of forms (1) was observed with the lemma “-154”: -154.

The 3rd highest number of forms (1) was observed with the lemma “-300”: -300.

NUM occurs with 1 features: NumType (6658; 100% instances)

NUM occurs with 2 feature-value pairs: NumType=Card, NumType=Ord

NUM occurs with 3 feature combinations. The most frequent feature combination is NumType=Card (6257 tokens). Examples: 一、两、三、 1、 3、 12、 5、 2、 8、 10

Relations

NUM nodes are attached to their parents using 18 different relations: nummod (6237; 94% instances), obj (61; 1% instances), obl (58; 1% instances), conj (57; 1% instances), root (53; 1% instances), nmod (51; 1% instances), parataxis (44; 1% instances), nsubj (30; 0% instances), acl (12; 0% instances), nmod:tmod (12; 0% instances), appos (11; 0% instances), compound (10; 0% instances), advcl (6; 0% instances), amod (6; 0% instances), ccomp (5; 0% instances), xcomp (4; 0% instances), flat (1; 0% instances), nsubj:pass (1; 0% instances)

Parents of NUM nodes belong to 9 different parts of speech: NOUN (6201; 93% instances), VERB (171; 3% instances), PART (95; 1% instances), NUM (72; 1% instances), (53; 1% instances), PROPN (26; 0% instances), X (24; 0% instances), ADJ (16; 0% instances), SYM (1; 0% instances)

4238 (64%) NUM nodes are leaves.

2233 (34%) NUM nodes have one child.

72 (1%) NUM nodes have two children.

116 (2%) NUM nodes have three or more children.

The highest child degree of a NUM node is 7.

Children of NUM nodes are attached using 24 different relations: clf (2033; 71% instances), punct (193; 7% instances), nmod (142; 5% instances), nsubj (97; 3% instances), cop (93; 3% instances), case (59; 2% instances), conj (57; 2% instances), cc (44; 2% instances), advmod (35; 1% instances), acl (22; 1% instances), det (15; 1% instances), parataxis (12; 0% instances), nummod (10; 0% instances), appos (8; 0% instances), nmod:tmod (7; 0% instances), obl (5; 0% instances), csubj (4; 0% instances), mark (2; 0% instances), acl:relcl (1; 0% instances), advcl (1; 0% instances), amod (1; 0% instances), ccomp (1; 0% instances), obj (1; 0% instances), xcomp (1; 0% instances)

Children of NUM nodes belong to 16 different parts of speech: NOUN (2233; 79% instances), PUNCT (193; 7% instances), AUX (93; 3% instances), PART (77; 3% instances), NUM (72; 3% instances), CCONJ (44; 2% instances), ADV (35; 1% instances), VERB (24; 1% instances), ADP (17; 1% instances), DET (15; 1% instances), PROPN (14; 0% instances), PRON (12; 0% instances), X (6; 0% instances), SYM (4; 0% instances), ADJ (3; 0% instances), SCONJ (2; 0% instances)

Treebank Statistics: UD_Chinese-GSDSimp: POS Tags: NUM

Morphology

Relations

Treebank Statistics: UD_Chinese-GSDSimp: POS Tags: `NUM`