This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home zh/pos issue tracker

DET: determiner

This document is a placeholder for the language-specific documentation for DET.


Treebank Statistics (UD_Chinese)

There are 1 DET lemmas (7%), 104 DET types (0%) and 1297 DET tokens (1%). Out of 15 observed tags, the rank of DET is: 6 in number of lemmas, 10 in number of types and 12 in number of tokens.

The 10 most frequent DET lemmas: _

The 10 most frequent DET types: 這、 該、 這些、 其他、 此、 所有、 各、 另、 任何、 每

The 10 most frequent ambiguous lemmas: _ (NOUN 34043, VERB 20468, PUNCT 17047, PART 13172, PROPN 10741, NUM 6659, ADV 5749, ADP 5424, ADJ 3032, PRON 1776, CONJ 1740, DET 1297, X 1209, AUX 889, SYM 37)

The 10 most frequent ambiguous types: 這 (DET 316, PRON 77), 這些 (DET 86, PRON 1), 此 (PRON 129, DET 65, ADP 1), 所有 (DET 55, VERB 2), 各 (DET 43, ADV 1), 另 (DET 38, ADV 2), 每 (DET 29, ADV 8), 全 (DET 23, ADV 3, PROPN 1), 整個 (DET 23, NOUN 1), 同 (DET 22, ADP 13, PART 2, CONJ 1, NOUN 1, ADV 1)

Morphology

The form / lemma ratio of DET is 104.000000 (the average of all parts of speech is 1499.200000).

The 1st highest number of forms (104) was observed with the lemma “_”: 一切, 上, 下, 以上, 以下, 任, 任何, 何, 全, 全套, 全部, 全體, 其他, 其它, 其餘, 別, 前, 前任, 另, 另外, 各, 各個, 各州, 各式, 各種, 各種各樣, 各級, 各項, 各類, 同, 同年, 後, 所有, 整, 整個, 整場, 整塊, 整套, 整所, 整架, 整片, 整顆, 是次, 有的, 本, 本屆, 本班, 某, 某些, 某個, 某種, 此, 此套, 此次, 此種, 此等, 此項, 此類, 歷屆, 毎年, 每, 每位, 每個, 每元, 每卡, 每周, 每天, 每年, 每座, 每戶, 每所, 每日, 每枚, 每次, 每段, 每片, 每秒, 每組, 每週, 每邊, 每間, 每隊, 每集, 當屆, 眾, 該, 該屆, 該批, 該族, 該條, 該段, 該組, 該集, 諸, 這, 這些, 這次, 這種, 那, 那些, 首, 首任, 首條, 首部.

DET does not occur with any features.

Relations

DET nodes are attached to their parents using 9 different relations: det (1206; 93% instances), advmod (31; 2% instances), nsubj (27; 2% instances), nmod:tmod (20; 2% instances), nmod (4; 0% instances), amod (3; 0% instances), conj (3; 0% instances), case:pref (2; 0% instances), acl (1; 0% instances)

Parents of DET nodes belong to 6 different parts of speech: NOUN (1117; 86% instances), PART (79; 6% instances), VERB (67; 5% instances), PROPN (18; 1% instances), NUM (15; 1% instances), X (1; 0% instances)

1266 (98%) DET nodes are leaves.

29 (2%) DET nodes have one child.

2 (0%) DET nodes have two children.

The highest child degree of a DET node is 2.

Children of DET nodes are attached using 4 different relations: case:dec (27; 82% instances), advmod (3; 9% instances), nmod (2; 6% instances), case (1; 3% instances)

Children of DET nodes belong to 4 different parts of speech: PART (27; 82% instances), ADV (3; 9% instances), NOUN (2; 6% instances), ADP (1; 3% instances)


DET in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]