home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Korean: POS Tags: DET

There are 1 DET lemmas (3%), 22 DET types (0%) and 539 DET tokens (1%). Out of 11 observed tags, the rank of DET is: 6 in number of lemmas, 9 in number of types and 8 in number of tokens.

The 10 most frequent DET lemmas: _

The 10 most frequent DET types: 이, 그, 몇, 모든, 각, 어느, 어떤, 이들, 무슨, “그

The 10 most frequent ambiguous lemmas: _ (NOUN 32099, VERB 18517, ADV 11605, ADJ 2715, PUNCT 1972, ADP 835, PRON 677, DET 539, NUM 532, CCONJ 176, X 23)

The 10 most frequent ambiguous types: 이 (DET 216, PRON 29, NOUN 19), 그 (DET 128, PRON 36), 모든 (DET 36, NOUN 1), 각 (DET 22, NOUN 3), 어떤 (DET 11, ADJ 2), 이들 (DET 11, PRON 3, NOUN 1), 양 (DET 5, NOUN 5), 이러한 (ADJ 21, DET 5), 여느 (DET 3, ADJ 1), “이 (PRON 3, DET 2, NOUN 1)

Morphology

The form / lemma ratio of DET is 22.000000 (the average of all parts of speech is 963.631579).

The 1st highest number of forms (22) was observed with the lemma “_”: “그, “어떤, “이, 각, 그, 그런, 몇, 몇몇, 모든, 무슨, 뭔, 아무, 양, 어느, 어떠한, 어떤, 여느, 온, 이, 이들, 이러한, 이번.

DET does not occur with any features.

Relations

DET nodes are attached to their parents using 1 different relations: det (539; 100% instances)

Parents of DET nodes belong to 6 different parts of speech: NOUN (380; 71% instances), ADV (126; 23% instances), VERB (30; 6% instances), ADJ (1; 0% instances), NUM (1; 0% instances), PRON (1; 0% instances)

537 (100%) DET nodes are leaves.

2 (0%) DET nodes have one child.

The highest child degree of a DET node is 1.

Children of DET nodes are attached using 2 different relations: advmod (1; 50% instances), obj (1; 50% instances)

Children of DET nodes belong to 2 different parts of speech: ADV (1; 50% instances), NOUN (1; 50% instances)