Treebank Statistics: UD_Chinese-HK: POS Tags: DET
There are 38 DET
lemmas (2%), 38 DET
types (2%) and 260 DET
tokens (3%).
Out of 16 observed tags, the rank of DET
is: 8 in number of lemmas, 8 in number of types and 10 in number of tokens.
The 10 most frequent DET
lemmas: 這、 那、 什麼、 其他、 每、 任何、 這些、 所有、 多、 這個
The 10 most frequent DET
types: 這、 那、 什麼、 其他、 每、 任何、 這些、 所有、 多、 這個
The 10 most frequent ambiguous lemmas: 這 (DET 52, PRON 19), 那 (DET 28, ADV 9, PRON 3), 什麼 (DET 22, PRON 10), 這些 (DET 13, PRON 3), 多 (DET 8, ADJ 5, ADV 4), 這個 (DET 8, PRON 1), 下 (VERB 10, DET 6, ADP 5, ADV 4), 此 (DET 6, PRON 2), 甚麼 (DET 5, PRON 4, NOUN 1), 一些 (DET 4, ADV 1)
The 10 most frequent ambiguous types: 這 (DET 52, PRON 19), 那 (DET 28, ADV 9, PRON 3), 什麼 (DET 22, PRON 10), 這些 (DET 13, PRON 3), 多 (DET 8, ADJ 5, ADV 4), 這個 (DET 8, PRON 1), 下 (VERB 10, DET 6, ADP 5, ADV 4), 此 (DET 6, PRON 2), 甚麼 (DET 5, PRON 4, NOUN 1), 一些 (DET 4, ADV 1)
- 這
- 那
- 什麼
- 這些
- 多
- 這個
- 下
- 此
- 甚麼
- 一些
Morphology
The form / lemma ratio of DET
is 1.000000 (the average of all parts of speech is 1.007013).
The 1st highest number of forms (1) was observed with the lemma “一些”: 一些.
The 2nd highest number of forms (1) was observed with the lemma “上”: 上.
The 3rd highest number of forms (1) was observed with the lemma “下”: 下.
DET
does not occur with any features.
Relations
DET
nodes are attached to their parents using 8 different relations: det (251; 97% instances), amod (2; 1% instances), obj (2; 1% instances), conj (1; 0% instances), discourse (1; 0% instances), nmod (1; 0% instances), nsubj (1; 0% instances), root (1; 0% instances)
Parents of DET
nodes belong to 6 different parts of speech: NOUN (247; 95% instances), VERB (7; 3% instances), PROPN (3; 1% instances), ADJ (1; 0% instances), NUM (1; 0% instances), (1; 0% instances)
210 (81%) DET
nodes are leaves.
49 (19%) DET
nodes have one child.
0 (0%) DET
nodes have two children.
1 (0%) DET
nodes have three or more children.
The highest child degree of a DET
node is 3.
Children of DET
nodes are attached using 7 different relations: clf (43; 83% instances), acl (2; 4% instances), advmod (2; 4% instances), discourse:sp (2; 4% instances), cc (1; 2% instances), nummod (1; 2% instances), punct (1; 2% instances)
Children of DET
nodes belong to 7 different parts of speech: NOUN (43; 83% instances), ADV (2; 4% instances), PART (2; 4% instances), VERB (2; 4% instances), CCONJ (1; 2% instances), NUM (1; 2% instances), PUNCT (1; 2% instances)