home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Korean-KSL: POS Tags: DET

There are 62 DET lemmas (0%), 61 DET types (0%) and 2067 DET tokens (2%). Out of 14 observed tags, the rank of DET is: 10 in number of lemmas, 12 in number of types and 9 in number of tokens.

The 10 most frequent DET lemmas: 그, 이, 어떤, 이런, 여러, 다른, 그런, 모든, 어느, 몇

The 10 most frequent DET types: 그, 이, 어떤, 이런, 여러, 다른, 그런, 모든, 어느, 몇

The 10 most frequent ambiguous lemmas: 그 (DET 726, PRON 2, ADV 1), 이 (DET 515, ADP 6, ADJ 2, AUX 2, NUM 2, NOUN 1, VERB 1), 한 (NUM 166, DET 25, SCONJ 3), 각 (DET 9, ADV 1, NOUN 1), 두 (NUM 112, DET 7, ADP 2, ADV 1), 새 (DET 6, NOUN 1), 저 (PRON 14, DET 6, ADV 1), 둘째 (NUM 26, DET 5, ADV 1), 그것 (DET 4, NOUN 1, PRON 1), 이러하+ㄴ (ADJ 12, DET 4)

The 10 most frequent ambiguous types: 그 (DET 726, ADV 2, PRON 2), 이 (DET 516, ADP 6, NUM 3, NOUN 2), 다른 (ADJ 401, DET 91, VERB 1), 그런 (DET 68, ADJ 2), 한 (NUM 167, VERB 64, DET 25, AUX 11, X 10, SCONJ 3, ADV 2, NOUN 1), 아무 (DET 18, ADV 1), 각 (DET 9, ADV 1, NOUN 1), 두 (NUM 112, DET 7, ADP 2, ADV 1), 새 (DET 6, NOUN 1), 저 (PRON 14, DET 6, ADV 1)

Morphology

The form / lemma ratio of DET is 0.983871 (the average of all parts of speech is 1.007876).

The 1st highest number of forms (1) was observed with the lemma “”+이”: “이.

The 2nd highest number of forms (1) was observed with the lemma “각”: 각.

The 3rd highest number of forms (1) was observed with the lemma “그”: 그.

DET occurs with 1 features: Typo (12; 1% instances)

DET occurs with 1 feature-value pairs: Typo=Yes

DET occurs with 2 feature combinations. The most frequent feature combination is _ (2055 tokens). Examples: 그, 이, 어떤, 이런, 여러, 다른, 그런, 모든, 어느, 몇

Relations

DET nodes are attached to their parents using 9 different relations: det (1867; 90% instances), amod (152; 7% instances), obl (17; 1% instances), nmod (12; 1% instances), nsubj (7; 0% instances), acl (4; 0% instances), flat (4; 0% instances), appos (3; 0% instances), nmod:poss (1; 0% instances)

Parents of DET nodes belong to 7 different parts of speech: NOUN (1543; 75% instances), ADV (455; 22% instances), ADJ (34; 2% instances), VERB (29; 1% instances), PRON (3; 0% instances), NUM (2; 0% instances), DET (1; 0% instances)

2031 (98%) DET nodes are leaves.

27 (1%) DET nodes have one child.

7 (0%) DET nodes have two children.

2 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 3.

Children of DET nodes are attached using 9 different relations: flat (22; 47% instances), case (15; 32% instances), obl (3; 6% instances), nsubj (2; 4% instances), advcl (1; 2% instances), det (1; 2% instances), goeswith (1; 2% instances), nmod (1; 2% instances), obj (1; 2% instances)

Children of DET nodes belong to 7 different parts of speech: NOUN (17; 36% instances), ADP (16; 34% instances), ADV (9; 19% instances), PRON (2; 4% instances), DET (1; 2% instances), VERB (1; 2% instances), X (1; 2% instances)