home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-GSD: POS Tags: DET

There are 137 DET lemmas (1%), 137 DET types (1%) and 1329 DET tokens (1%). Out of 16 observed tags, the rank of DET is: 10 in number of lemmas, 10 in number of types and 14 in number of tokens.

The 10 most frequent DET lemmas: 這、 該、 這些、 其他、 此、 所有、 各、 另、 任何、 每

The 10 most frequent DET types: 這、 該、 這些、 其他、 此、 所有、 各、 另、 任何、 每

The 10 most frequent ambiguous lemmas: 這 (DET 316, PRON 77), 該 (DET 159, ADP 1, AUX 1), 這些 (DET 86, PRON 1), 此 (PRON 129, DET 65, ADP 1), 所有 (DET 55, VERB 2), 各 (DET 43, ADV 1), 另 (DET 38, ADV 2), 每 (DET 29, ADV 8), 全 (DET 23, ADV 3, PROPN 1), 整個 (DET 23, NOUN 1)

The 10 most frequent ambiguous types: 這 (DET 316, PRON 77), 該 (DET 159, ADP 1), 這些 (DET 86, PRON 1), 此 (PRON 129, DET 65, ADP 1), 所有 (DET 55, VERB 2), 各 (DET 43, ADV 1), 另 (DET 38, ADV 2), 每 (DET 29, ADV 8), 全 (DET 23, ADV 3, PROPN 1), 整個 (DET 23, NOUN 1)

Morphology

The form / lemma ratio of DET is 1.000000 (the average of all parts of speech is 1.004819).

The 1st highest number of forms (1) was observed with the lemma “$5,000”: $5,000.

The 2nd highest number of forms (1) was observed with the lemma “A330”: A330.

The 3rd highest number of forms (1) was observed with the lemma “AEG”: AEG.

DET does not occur with any features.

Relations

DET nodes are attached to their parents using 8 different relations: det (1240; 93% instances), obl (32; 2% instances), nsubj (27; 2% instances), nmod:tmod (20; 2% instances), amod (3; 0% instances), conj (3; 0% instances), nmod (3; 0% instances), acl (1; 0% instances)

Parents of DET nodes belong to 6 different parts of speech: NOUN (1144; 86% instances), PART (83; 6% instances), VERB (67; 5% instances), PROPN (18; 1% instances), NUM (15; 1% instances), X (2; 0% instances)

1261 (95%) DET nodes are leaves.

49 (4%) DET nodes have one child.

11 (1%) DET nodes have two children.

8 (1%) DET nodes have three or more children.

The highest child degree of a DET node is 5.

Children of DET nodes are attached using 15 different relations: case (63; 60% instances), nmod (9; 9% instances), punct (6; 6% instances), conj (5; 5% instances), advmod (4; 4% instances), flat:foreign (4; 4% instances), cc (3; 3% instances), nsubj (2; 2% instances), nummod (2; 2% instances), obj (2; 2% instances), acl (1; 1% instances), advcl (1; 1% instances), amod (1; 1% instances), nmod:tmod (1; 1% instances), xcomp (1; 1% instances)

Children of DET nodes belong to 10 different parts of speech: PART (63; 60% instances), NOUN (13; 12% instances), X (7; 7% instances), PUNCT (6; 6% instances), ADV (4; 4% instances), CCONJ (3; 3% instances), NUM (3; 3% instances), VERB (3; 3% instances), ADP (2; 2% instances), PROPN (1; 1% instances)