Treebank Statistics: UD_Chinese-HK: POS Tags: NUM
There are 30 NUM
lemmas (2%), 30 NUM
types (2%) and 177 NUM
tokens (2%).
Out of 16 observed tags, the rank of NUM
is: 10 in number of lemmas, 10 in number of types and 11 in number of tokens.
The 10 most frequent NUM
lemmas: 一、 兩、 三、 三十、 五、 十、 一百、 二、 四、 二十
The 10 most frequent NUM
types: 一、 兩、 三、 三十、 五、 十、 一百、 二、 四、 二十
The 10 most frequent ambiguous lemmas: 一 (NUM 76, ADV 1), 個 (NOUN 54, NUM 1, PART 1), 幾 (ADV 1, DET 1, NUM 1)
The 10 most frequent ambiguous types: 一 (NUM 76, ADV 1), 個 (NOUN 54, NUM 1, PART 1), 幾 (ADV 1, DET 1, NUM 1)
- 一
- 個
- 幾
Morphology
The form / lemma ratio of NUM
is 1.000000 (the average of all parts of speech is 1.007013).
The 1st highest number of forms (1) was observed with the lemma “一”: 一.
The 2nd highest number of forms (1) was observed with the lemma “一百”: 一百.
The 3rd highest number of forms (1) was observed with the lemma “七”: 七.
NUM
does not occur with any features.
Relations
NUM
nodes are attached to their parents using 6 different relations: nummod (162; 92% instances), conj (10; 6% instances), advcl (2; 1% instances), clf (1; 1% instances), obj (1; 1% instances), root (1; 1% instances)
Parents of NUM
nodes belong to 6 different parts of speech: NOUN (163; 92% instances), VERB (6; 3% instances), NUM (4; 2% instances), PROPN (2; 1% instances), DET (1; 1% instances), (1; 1% instances)
107 (60%) NUM
nodes are leaves.
65 (37%) NUM
nodes have one child.
4 (2%) NUM
nodes have two children.
1 (1%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 4.
Children of NUM
nodes are attached using 7 different relations: clf (60; 78% instances), punct (8; 10% instances), advmod (3; 4% instances), conj (3; 4% instances), appos (1; 1% instances), cc (1; 1% instances), det (1; 1% instances)
Children of NUM
nodes belong to 6 different parts of speech: NOUN (60; 78% instances), PUNCT (8; 10% instances), NUM (4; 5% instances), ADV (3; 4% instances), CCONJ (1; 1% instances), DET (1; 1% instances)