home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Classical_Chinese-Kyoto: POS Tags: NUM

There are 97 NUM lemmas (2%), 97 NUM types (2%) and 2337 NUM tokens (2%). Out of 13 observed tags, the rank of NUM is: 5 in number of lemmas, 5 in number of types and 10 in number of tokens.

The 10 most frequent NUM lemmas: 三、 一、 五、 百、 四、 二、 九、 六、 七、 十

The 10 most frequent NUM types: 三、 一、 五、 百、 四、 二、 九、 六、 七、 十

The 10 most frequent ambiguous lemmas: 一 (NUM 314, VERB 8), 二 (NUM 140, VERB 2), 萬 (NUM 53, PROPN 25), 仲 (PROPN 30, NUM 9, NOUN 3), 季 (PROPN 42, NUM 7, NOUN 3), 丁 (NUM 6, PROPN 1), 兆 (NOUN 6, NUM 5, VERB 3, ADV 1), 孟 (PROPN 27, NOUN 5, NUM 4), 甲 (NOUN 14, NUM 4), 辛 (NUM 4, NOUN 2, VERB 2, PROPN 1)

The 10 most frequent ambiguous types: 一 (NUM 314, VERB 8), 二 (NUM 140, VERB 2), 萬 (NUM 53, PROPN 25), 仲 (PROPN 30, NUM 9, NOUN 3), 季 (PROPN 42, NUM 7, NOUN 3), 丁 (NUM 6, PROPN 1), 兆 (NOUN 6, NUM 5, VERB 3, ADV 1), 孟 (PROPN 27, NOUN 5, NUM 4), 甲 (NOUN 14, NUM 4), 辛 (NUM 4, NOUN 2, VERB 2, PROPN 1)

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.002166).

The 1st highest number of forms (1) was observed with the lemma “一”: 一.

The 2nd highest number of forms (1) was observed with the lemma “一百”: 一百.

The 3rd highest number of forms (1) was observed with the lemma “丁”: 丁.

NUM occurs with 1 features: NumType (34; 1% instances)

NUM occurs with 1 feature-value pairs: NumType=Ord

NUM occurs with 2 feature combinations. The most frequent feature combination is _ (2303 tokens). Examples: 三、 一、 五、 百、 四、 二、 九、 六、 七、 十

Relations

NUM nodes are attached to their parents using 16 different relations: nummod (1557; 67% instances), root (319; 14% instances), obj (140; 6% instances), nsubj (134; 6% instances), conj (68; 3% instances), compound (54; 2% instances), acl (30; 1% instances), obl (13; 1% instances), flat (11; 0% instances), advcl (3; 0% instances), dislocated (3; 0% instances), ccomp (1; 0% instances), csubj (1; 0% instances), iobj (1; 0% instances), list (1; 0% instances), parataxis (1; 0% instances)

Parents of NUM nodes belong to 8 different parts of speech: NOUN (1289; 55% instances), VERB (505; 22% instances), (319; 14% instances), PART (121; 5% instances), NUM (85; 4% instances), PROPN (12; 1% instances), AUX (3; 0% instances), PRON (3; 0% instances)

1663 (71%) NUM nodes are leaves.

289 (12%) NUM nodes have one child.

294 (13%) NUM nodes have two children.

91 (4%) NUM nodes have three or more children.

The highest child degree of a NUM node is 6.

Children of NUM nodes are attached using 25 different relations: clf (448; 38% instances), nsubj (290; 24% instances), conj (89; 7% instances), case (77; 6% instances), nmod (50; 4% instances), discourse:sp (38; 3% instances), cc (30; 3% instances), csubj (26; 2% instances), advmod (25; 2% instances), det (23; 2% instances), cop (15; 1% instances), obj (13; 1% instances), nummod (12; 1% instances), flat (11; 1% instances), amod (8; 1% instances), acl (5; 0% instances), dislocated (5; 0% instances), obl:tmod (5; 0% instances), advcl (3; 0% instances), discourse (3; 0% instances), list (3; 0% instances), obl (3; 0% instances), obl:lmod (3; 0% instances), aux (2; 0% instances), nsubj:pass (1; 0% instances)

Children of NUM nodes belong to 11 different parts of speech: NOUN (745; 63% instances), PART (100; 8% instances), NUM (85; 7% instances), SCONJ (67; 6% instances), VERB (62; 5% instances), ADP (35; 3% instances), PRON (34; 3% instances), ADV (26; 2% instances), AUX (17; 1% instances), CCONJ (9; 1% instances), PROPN (8; 1% instances)