home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Korean-GSD: POS Tags: DET

There are 29 DET lemmas (0%), 29 DET types (0%) and 573 DET tokens (1%). Out of 16 observed tags, the rank of DET is: 12 in number of lemmas, 12 in number of types and 9 in number of tokens.

The 10 most frequent DET lemmas: 이, 그, 몇, 모든, 각, 어떤, 어느, 이+들, 약, 무슨

The 10 most frequent DET types: 이, 그, 몇, 모든, 각, 어떤, 어느, 이들, 약, 무슨

The 10 most frequent ambiguous lemmas: 이 (DET 223, ADP 45, PRON 29, NOUN 19), 그 (DET 133, PRON 36), 모든 (DET 36, NOUN 1), 각 (DET 24, NOUN 4), 어떤 (DET 12, ADJ 2), 이+들 (DET 11, PRON 3, NOUN 1), 약 (NOUN 25, DET 8), 양 (NOUN 6, DET 5), 이러+하+ㄴ (ADJ 21, DET 5), 그런 (ADJ 12, DET 4)

The 10 most frequent ambiguous types: 이 (DET 223, ADP 45, PRON 29, NOUN 19), 그 (DET 133, PRON 36), 모든 (DET 36, NOUN 1), 각 (DET 24, NOUN 4), 어떤 (DET 12, ADJ 2), 이들 (DET 11, PRON 3, NOUN 1), 약 (NOUN 25, DET 8), 양 (NOUN 6, DET 5), 이러한 (ADJ 21, DET 5), 그런 (ADJ 12, DET 4)

Morphology

The form / lemma ratio of DET is 1.000000 (the average of all parts of speech is 1.000681).

The 1st highest number of forms (1) was observed with the lemma “각”: 각.

The 2nd highest number of forms (1) was observed with the lemma “그”: 그.

The 3rd highest number of forms (1) was observed with the lemma “그런”: 그런.

DET occurs with 1 features: NumType (2; 0% instances)

DET occurs with 1 feature-value pairs: NumType=Card

DET occurs with 2 feature combinations. The most frequent feature combination is _ (571 tokens). Examples: 이, 그, 몇, 모든, 각, 어떤, 어느, 이들, 약, 무슨

Relations

DET nodes are attached to their parents using 3 different relations: det (569; 99% instances), appos (3; 1% instances), root (1; 0% instances)

Parents of DET nodes belong to 8 different parts of speech: NOUN (401; 70% instances), ADV (127; 22% instances), VERB (38; 7% instances), ADJ (2; 0% instances), NUM (2; 0% instances), PRON (1; 0% instances), PUNCT (1; 0% instances), (1; 0% instances)

564 (98%) DET nodes are leaves.

4 (1%) DET nodes have one child.

1 (0%) DET nodes have two children.

4 (1%) DET nodes have three or more children.

The highest child degree of a DET node is 6.

Children of DET nodes are attached using 6 different relations: flat (14; 52% instances), punct (9; 33% instances), acl:relcl (1; 4% instances), advmod (1; 4% instances), amod (1; 4% instances), obj (1; 4% instances)

Children of DET nodes belong to 6 different parts of speech: NOUN (13; 48% instances), PUNCT (9; 33% instances), ADV (2; 7% instances), ADJ (1; 4% instances), PROPN (1; 4% instances), VERB (1; 4% instances)