home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-HK: POS Tags: DET

There are 13 DET lemmas (3%), 15 DET types (3%) and 44 DET tokens (2%). Out of 17 observed tags, the rank of DET is: 11 in number of lemmas, 10 in number of types and 11 in number of tokens.

The 10 most frequent DET lemmas: 這、 _、 那、 什麼、 每、 上、 多少、 整、 有些、 幾

The 10 most frequent DET types: 這、 那、 什麼、 每、 上、 多少、 整、 有些、 這些、 些

The 10 most frequent ambiguous lemmas: 這 (DET 12, PRON 1), _ (VERB 114, PUNCT 111, NOUN 69, ADV 63, PART 54, PRON 49, ADJ 21, NUM 19, AUX 18, ADP 10, PROPN 10, DET 8, INTJ 5, SCONJ 1, X 1), 那 (DET 6, ADV 2, PRON 2), 什麼 (DET 3, PRON 3)

The 10 most frequent ambiguous types: 這 (DET 12, PRON 3), 那 (DET 7, ADV 3, PRON 2), 什麼 (DET 6, PRON 4), 多少 (DET 2, PRON 1), 這些 (PRON 3, DET 2), 多 (ADV 3, DET 1)

Morphology

The form / lemma ratio of DET is 1.153846 (the average of all parts of speech is 1.221258).

The 1st highest number of forms (5) was observed with the lemma “_”: 些, 什麼, 多, 這些, 那.

The 2nd highest number of forms (1) was observed with the lemma “上”: 上.

The 3rd highest number of forms (1) was observed with the lemma “什麼”: 什麼.

DET does not occur with any features.

Relations

DET nodes are attached to their parents using 1 different relations: det (44; 100% instances)

Parents of DET nodes belong to 2 different parts of speech: NOUN (43; 98% instances), PRON (1; 2% instances)

33 (75%) DET nodes are leaves.

11 (25%) DET nodes have one child.

The highest child degree of a DET node is 1.

Children of DET nodes are attached using 2 different relations: clf (10; 91% instances), advmod (1; 9% instances)

Children of DET nodes belong to 2 different parts of speech: NOUN (10; 91% instances), ADV (1; 9% instances)