home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Classical_Chinese-Kyoto: POS Tags: NUM

There are 193 NUM lemmas (2%), 193 NUM types (2%) and 4616 NUM tokens (2%). Out of 13 observed tags, the rank of NUM is: 5 in number of lemmas, 5 in number of types and 8 in number of tokens.

The 10 most frequent NUM lemmas: 三、 一、 五、 二、 四、 百、 六、 十、 九、 七

The 10 most frequent NUM types: 三、 一、 五、 二、 四、 百、 六、 十、 九、 七

The 10 most frequent ambiguous lemmas: 一 (NUM 600, VERB 11), 二 (NUM 337, VERB 2), 百 (NUM 323, PROPN 1), 萬 (NUM 126, PROPN 28), 丁 (NUM 11, PROPN 9, NOUN 1), 仲 (PROPN 52, NUM 9, NOUN 3), 季 (PROPN 73, NUM 7, NOUN 5), 兆 (NOUN 8, NUM 6, VERB 3, ADV 1, PROPN 1), 己 (PRON 137, NUM 5), 孟 (PROPN 42, NOUN 6, NUM 4)

The 10 most frequent ambiguous types: 一 (NUM 600, VERB 11), 二 (NUM 337, VERB 2), 百 (NUM 323, PROPN 1), 萬 (NUM 126, PROPN 28), 丁 (NUM 11, PROPN 9, NOUN 1), 仲 (PROPN 52, NUM 9, NOUN 3), 季 (PROPN 73, NUM 7, NOUN 5), 兆 (NOUN 8, NUM 6, VERB 3, ADV 1, PROPN 1), 己 (PRON 137, NUM 5), 孟 (PROPN 42, NOUN 6, NUM 4)

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.011910).

The 1st highest number of forms (1) was observed with the lemma “一”: 一.

The 2nd highest number of forms (1) was observed with the lemma “一十”: 一十.

The 3rd highest number of forms (1) was observed with the lemma “一十七”: 一十七.

NUM occurs with 1 features: NumType (55; 1% instances)

NUM occurs with 1 feature-value pairs: NumType=Ord

NUM occurs with 2 feature combinations. The most frequent feature combination is _ (4561 tokens). Examples: 三、 一、 五、 二、 四、 百、 六、 十、 九、 七

Relations

NUM nodes are attached to their parents using 19 different relations: nummod (2982; 65% instances), root (872; 19% instances), obj (265; 6% instances), nsubj (205; 4% instances), conj (104; 2% instances), compound (84; 2% instances), acl (30; 1% instances), obl (26; 1% instances), flat (17; 0% instances), ccomp (8; 0% instances), advcl (5; 0% instances), obl:tmod (4; 0% instances), csubj (3; 0% instances), dislocated (3; 0% instances), parataxis (3; 0% instances), iobj (2; 0% instances), clf (1; 0% instances), list (1; 0% instances), xcomp (1; 0% instances)

Parents of NUM nodes belong to 9 different parts of speech: NOUN (2532; 55% instances), VERB (888; 19% instances), (872; 19% instances), PART (141; 3% instances), NUM (140; 3% instances), PROPN (34; 1% instances), PRON (4; 0% instances), AUX (3; 0% instances), ADV (2; 0% instances)

2939 (64%) NUM nodes are leaves.

791 (17%) NUM nodes have one child.

652 (14%) NUM nodes have two children.

234 (5%) NUM nodes have three or more children.

The highest child degree of a NUM node is 6.

Children of NUM nodes are attached using 27 different relations: clf (1117; 39% instances), nsubj (664; 23% instances), csubj (200; 7% instances), conj (193; 7% instances), nmod (177; 6% instances), case (94; 3% instances), advmod (64; 2% instances), discourse:sp (60; 2% instances), amod (46; 2% instances), cc (39; 1% instances), det (37; 1% instances), nummod (31; 1% instances), parataxis (26; 1% instances), cop (23; 1% instances), flat (16; 1% instances), obj (16; 1% instances), obl:tmod (16; 1% instances), acl (14; 0% instances), advcl (6; 0% instances), dislocated (6; 0% instances), obl (5; 0% instances), obl:lmod (5; 0% instances), discourse (3; 0% instances), list (3; 0% instances), aux (2; 0% instances), mark (2; 0% instances), nsubj:pass (1; 0% instances)

Children of NUM nodes belong to 11 different parts of speech: NOUN (1788; 62% instances), VERB (331; 12% instances), PART (274; 10% instances), NUM (140; 5% instances), SCONJ (85; 3% instances), ADV (83; 3% instances), PRON (49; 2% instances), ADP (45; 2% instances), PROPN (34; 1% instances), AUX (25; 1% instances), CCONJ (12; 0% instances)