home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-PUD: POS Tags: DET

There are 1 DET lemmas (7%), 45 DET types (1%) and 355 DET tokens (2%). Out of 15 observed tags, the rank of DET is: 6 in number of lemmas, 10 in number of types and 12 in number of tokens.

The 10 most frequent DET lemmas: _

The 10 most frequent DET types: 這、 該、 這些、 其他、 那、 所有、 另、 任何、 此、 整個

The 10 most frequent ambiguous lemmas: _ (NOUN 5410, VERB 3467, PUNCT 2902, PART 1881, PROPN 1361, ADP 1288, ADV 1283, NUM 873, PRON 710, ADJ 650, AUX 618, DET 355, X 306, CCONJ 283, SCONJ 28)

The 10 most frequent ambiguous types: 這 (DET 107, PRON 45), 那 (DET 21, PRON 6, ADV 2), 此 (PRON 20, DET 9), 上 (ADP 63, DET 4, NOUN 2, VERB 1), 下 (ADP 18, NOUN 5, DET 4), 全 (DET 4, ADV 2), 前 (ADP 23, DET 3), 另外 (ADV 4, DET 2), 任 (DET 1, NOUN 1), 後 (ADP 38, ADV 2, DET 1)

Morphology

The form / lemma ratio of DET is 45.000000 (the average of all parts of speech is 388.466667).

The 1st highest number of forms (45) was observed with the lemma “_”: 上, 下, 任, 任何, 全, 全副, 全部, 其他, 其它, 其餘, 前, 另, 另外, 各, 各個, 各天, 各樣, 各種, 同, 後, 所有, 整個, 有的, 本, 某, 某些, 某種, 此, 此次, 每, 每位, 每個, 每升, 每天, 每幅, 每年, 每首, 該, 該項, 這, 這些, 那, 那些, 頭, 頭個.

DET does not occur with any features.

Relations

DET nodes are attached to their parents using 10 different relations: det (335; 94% instances), nmod (6; 2% instances), advmod (3; 1% instances), case:loc (3; 1% instances), nsubj (2; 1% instances), obl:tmod (2; 1% instances), ccomp (1; 0% instances), compound (1; 0% instances), obj (1; 0% instances), obl:agent (1; 0% instances)

Parents of DET nodes belong to 7 different parts of speech: NOUN (325; 92% instances), VERB (16; 5% instances), PROPN (6; 2% instances), ADJ (3; 1% instances), NUM (3; 1% instances), DET (1; 0% instances), PRON (1; 0% instances)

347 (98%) DET nodes are leaves.

5 (1%) DET nodes have one child.

2 (1%) DET nodes have two children.

1 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 4.

Children of DET nodes are attached using 7 different relations: case (6; 46% instances), compound (2; 15% instances), acl:relcl (1; 8% instances), advmod (1; 8% instances), cop (1; 8% instances), mark (1; 8% instances), nsubj (1; 8% instances)

Children of DET nodes belong to 7 different parts of speech: PART (6; 46% instances), NOUN (2; 15% instances), ADP (1; 8% instances), ADV (1; 8% instances), AUX (1; 8% instances), DET (1; 8% instances), VERB (1; 8% instances)