Treebank Statistics: UD_Chinese-GSDSimp: POS Tags: DET
There are 135 DET
lemmas (1%), 135 DET
types (1%) and 1329 DET
tokens (1%).
Out of 16 observed tags, the rank of DET
is: 10 in number of lemmas, 10 in number of types and 14 in number of tokens.
The 10 most frequent DET
lemmas: 这、 该、 这些、 其他、 此、 所有、 各、 另、 任何、 每
The 10 most frequent DET
types: 这、 该、 这些、 其他、 此、 所有、 各、 另、 任何、 每
The 10 most frequent ambiguous lemmas: 这 (DET 316, PRON 77), 该 (DET 159, ADP 1, AUX 1), 这些 (DET 86, PRON 1), 此 (PRON 129, DET 65, ADP 1), 所有 (DET 55, VERB 2), 各 (DET 43, ADV 1), 另 (DET 38, ADV 2), 每 (DET 29, ADV 8), 全 (DET 23, ADV 3, PROPN 1), 整个 (DET 23, NOUN 1)
The 10 most frequent ambiguous types: 这 (DET 316, PRON 77), 该 (DET 159, ADP 1), 这些 (DET 86, PRON 1), 此 (PRON 129, DET 65, ADP 1), 所有 (DET 55, VERB 2), 各 (DET 43, ADV 1), 另 (DET 38, ADV 2), 每 (DET 29, ADV 8), 全 (DET 23, ADV 3, PROPN 1), 整个 (DET 23, NOUN 1)
- 这
- 该
- 这些
- 此
- 所有
- 各
- 另
- 每
- 全
- 整个
Morphology
The form / lemma ratio of DET
is 1.000000 (the average of all parts of speech is 1.004660).
The 1st highest number of forms (1) was observed with the lemma “$5,000”: $5,000.
The 2nd highest number of forms (1) was observed with the lemma “A330”: A330.
The 3rd highest number of forms (1) was observed with the lemma “AEG”: AEG.
DET
does not occur with any features.
Relations
DET
nodes are attached to their parents using 8 different relations: det (1240; 93% instances), obl (32; 2% instances), nsubj (27; 2% instances), nmod:tmod (20; 2% instances), amod (3; 0% instances), conj (3; 0% instances), nmod (3; 0% instances), acl (1; 0% instances)
Parents of DET
nodes belong to 6 different parts of speech: NOUN (1144; 86% instances), PART (83; 6% instances), VERB (67; 5% instances), PROPN (18; 1% instances), NUM (15; 1% instances), X (2; 0% instances)
1261 (95%) DET
nodes are leaves.
49 (4%) DET
nodes have one child.
11 (1%) DET
nodes have two children.
8 (1%) DET
nodes have three or more children.
The highest child degree of a DET
node is 5.
Children of DET
nodes are attached using 15 different relations: case (63; 60% instances), nmod (9; 9% instances), punct (6; 6% instances), conj (5; 5% instances), advmod (4; 4% instances), flat:foreign (4; 4% instances), cc (3; 3% instances), nsubj (2; 2% instances), nummod (2; 2% instances), obj (2; 2% instances), acl (1; 1% instances), advcl (1; 1% instances), amod (1; 1% instances), nmod:tmod (1; 1% instances), xcomp (1; 1% instances)
Children of DET
nodes belong to 10 different parts of speech: PART (63; 60% instances), NOUN (13; 12% instances), X (7; 7% instances), PUNCT (6; 6% instances), ADV (4; 4% instances), CCONJ (3; 3% instances), NUM (3; 3% instances), VERB (3; 3% instances), ADP (2; 2% instances), PROPN (1; 1% instances)