home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-PatentChar: POS Tags: DET

There are 1 DET lemmas (7%), 14 DET types (2%) and 39 DET tokens (1%). Out of 15 observed tags, the rank of DET is: 6 in number of lemmas, 9 in number of types and 12 in number of tokens.

The 10 most frequent DET lemmas: _

The 10 most frequent DET types: 该、 一种、 所有、 各、 每、 多、 一个、 一条、 两个、 个

The 10 most frequent ambiguous lemmas: _ (NOUN 1661, VERB 948, PUNCT 560, ADJ 474, PART 346, ADP 259, NUM 185, CCONJ 106, ADV 68, PROPN 60, PRON 48, DET 39, X 14, SCONJ 10, AUX 6)

The 10 most frequent ambiguous types: 一种 (DET 6, NUM 5), 所有 (DET 5, ADJ 2), 各 (DET 3, ADJ 1), 多 (ADJ 6, DET 2, NUM 1), 个 (NOUN 9, DET 1, NUM 1), 第一 (NUM 67, VERB 2, DET 1, NOUN 1)

Morphology

The form / lemma ratio of DET is 14.000000 (the average of all parts of speech is 50.400000).

The 1st highest number of forms (14) was observed with the lemma “_”: 一个, 一条, 一种, 两个, 个, 各, 哪个, 多, 所有, 每, 每个, 第一, 该, 这.

DET does not occur with any features.

Relations

DET nodes are attached to their parents using 3 different relations: dep (32; 82% instances), det (6; 15% instances), conj (1; 3% instances)

Parents of DET nodes belong to 5 different parts of speech: NOUN (35; 90% instances), DET (1; 3% instances), NUM (1; 3% instances), PROPN (1; 3% instances), VERB (1; 3% instances)

36 (92%) DET nodes are leaves.

2 (5%) DET nodes have one child.

0 (0%) DET nodes have two children.

1 (3%) DET nodes have three or more children.

The highest child degree of a DET node is 3.

Children of DET nodes are attached using 4 different relations: advmod (2; 40% instances), dep (1; 20% instances), obl (1; 20% instances), punct (1; 20% instances)

Children of DET nodes belong to 4 different parts of speech: ADV (2; 40% instances), DET (1; 20% instances), NOUN (1; 20% instances), PUNCT (1; 20% instances)