home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Classical_Chinese-TueCL: POS Tags: NUM

There are 13 NUM lemmas (4%), 14 NUM types (5%) and 27 NUM tokens (4%). Out of 13 observed tags, the rank of NUM is: 5 in number of lemmas, 5 in number of types and 6 in number of tokens.

The 10 most frequent NUM lemmas: 一、 九、 千、 三、 五百、 八、 六、 萬、 三千、 二

The 10 most frequent NUM types: 一、 千、 三、 九、 九萬、 五百、 八千、 六、 萬、 三千

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of NUM is 1.076923 (the average of all parts of speech is 1.006873).

The 1st highest number of forms (2) was observed with the lemma “九”: 九, 九萬.

The 2nd highest number of forms (1) was observed with the lemma “一”: 一.

The 3rd highest number of forms (1) was observed with the lemma “三”: 三.

NUM does not occur with any features.

Relations

NUM nodes are attached to their parents using 3 different relations: nummod (23; 85% instances), nmod (3; 11% instances), nsubj (1; 4% instances)

Parents of NUM nodes belong to 3 different parts of speech: NOUN (24; 89% instances), NUM (2; 7% instances), VERB (1; 4% instances)

21 (78%) NUM nodes are leaves.

6 (22%) NUM nodes have one child.

The highest child degree of a NUM node is 1.

Children of NUM nodes are attached using 5 different relations: amod (2; 33% instances), clf (1; 17% instances), det (1; 17% instances), nmod (1; 17% instances), nummod (1; 17% instances)

Children of NUM nodes belong to 4 different parts of speech: ADV (2; 33% instances), NUM (2; 33% instances), DET (1; 17% instances), NOUN (1; 17% instances)