home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-CFL: POS Tags: DET

There are 30 DET lemmas (2%), 30 DET types (2%) and 127 DET tokens (2%). Out of 15 observed tags, the rank of DET is: 8 in number of lemmas, 8 in number of types and 12 in number of tokens.

The 10 most frequent DET lemmas: 这、 那、 很多、 每、 什么、 一些、 几、 哪、 有的、 这些

The 10 most frequent DET types: 这、 那、 很多、 每、 什么、 一些、 几、 哪、 有的、 这些

The 10 most frequent ambiguous lemmas: 这 (DET 24, PRON 15, VERB 1), 那 (DET 21, PRON 6), 每 (DET 13, PRON 3, NOUN 1), 什么 (DET 7, PRON 6), 几 (DET 4, NUM 4), 有的 (PRON 4, DET 3), 所有 (DET 2, PRON 2), 本 (DET 2, NOUN 1), 那样 (PRON 4, DET 2), 一点 (ADV 1, DET 1, NOUN 1)

The 10 most frequent ambiguous types: 这 (DET 24, PRON 15), 那 (DET 21, PRON 6), 每 (DET 13, PRON 3, NOUN 1), 什么 (DET 7, PRON 6), 几 (DET 4, NUM 4), 有的 (PRON 4, DET 3), 所有 (DET 2, PRON 2), 本 (DET 2, NOUN 1), 那样 (PRON 4, DET 2), 一点 (ADV 1, DET 1, NOUN 1)

Morphology

The form / lemma ratio of DET is 1.000000 (the average of all parts of speech is 1.009709).

The 1st highest number of forms (1) was observed with the lemma “一些”: 一些.

The 2nd highest number of forms (1) was observed with the lemma “一点”: 一点.

The 3rd highest number of forms (1) was observed with the lemma “个”: 个.

DET does not occur with any features.

Relations

DET nodes are attached to their parents using 6 different relations: det (119; 94% instances), nmod (3; 2% instances), advmod (2; 2% instances), clf (1; 1% instances), nsubj (1; 1% instances), obj (1; 1% instances)

Parents of DET nodes belong to 5 different parts of speech: NOUN (121; 95% instances), ADJ (2; 2% instances), VERB (2; 2% instances), DET (1; 1% instances), PROPN (1; 1% instances)

85 (67%) DET nodes are leaves.

39 (31%) DET nodes have one child.

3 (2%) DET nodes have two children.

The highest child degree of a DET node is 2.

Children of DET nodes are attached using 4 different relations: clf (34; 76% instances), case (7; 16% instances), advmod (3; 7% instances), nummod (1; 2% instances)

Children of DET nodes belong to 5 different parts of speech: NOUN (33; 73% instances), PART (7; 16% instances), ADV (3; 7% instances), DET (1; 2% instances), NUM (1; 2% instances)