home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Korean-KSL: POS Tags: DET

There are 85 DET lemmas (0%), 84 DET types (0%) and 2402 DET tokens (2%). Out of 16 observed tags, the rank of DET is: 8 in number of lemmas, 10 in number of types and 9 in number of tokens.

The 10 most frequent DET lemmas: 그, 이, 어떤, 이런, 여러, 그런, 다른, 모든, 어느, 몇

The 10 most frequent DET types: 그, 이, 어떤, 이런, 여러, 그런, 다른, 모든, 어느, 몇

The 10 most frequent ambiguous lemmas: 그 (DET 847, PRON 2, ADV 1), 이 (DET 549, ADP 9, ADJ 2, AUX 2, NUM 2, NOUN 1, VERB 1), 한 (NUM 193, DET 26, SCONJ 3), 각 (DET 13, ADV 1, NOUN 1), 두 (NUM 129, DET 8, ADP 2, ADV 1), 새 (DET 7, NOUN 1), 첫 (NUM 28, DET 7, ADV 1), 저 (PRON 16, DET 6, ADV 1), 둘째 (NUM 26, DET 5, ADV 1), 전 (DET 5, ADP 4)

The 10 most frequent ambiguous types: 그 (DET 847, ADV 2, PRON 2, CCONJ 1), 이 (DET 550, ADP 9, NUM 3, NOUN 2), 그런 (DET 98, ADJ 2), 다른 (ADJ 513, DET 91, VERB 1), 한 (NUM 195, VERB 74, DET 26, X 12, AUX 11, SCONJ 3, ADV 2, NOUN 1), 아무 (DET 20, ADV 1), 각 (DET 13, ADV 1, NOUN 1), 두 (NUM 129, DET 8, ADP 2, ADV 1), 새 (DET 7, NOUN 1), 첫 (NUM 29, DET 7, ADV 1)

Morphology

The form / lemma ratio of DET is 0.988235 (the average of all parts of speech is 1.008073).

The 1st highest number of forms (1) was observed with the lemma “”+이”: “이.

The 2nd highest number of forms (1) was observed with the lemma “각”: 각.

The 3rd highest number of forms (1) was observed with the lemma “그”: 그.

DET occurs with 2 features: PronType (2402; 100% instances), Typo (23; 1% instances)

DET occurs with 2 feature-value pairs: PronType=Dem, Typo=Yes

DET occurs with 2 feature combinations. The most frequent feature combination is PronType=Dem (2379 tokens). Examples: 그, 이, 어떤, 이런, 여러, 그런, 다른, 모든, 어느, 몇

Relations

DET nodes are attached to their parents using 12 different relations: det (2162; 90% instances), amod (161; 7% instances), obl (25; 1% instances), nmod (13; 1% instances), nsubj (11; 0% instances), root (11; 0% instances), acl (5; 0% instances), nmod:poss (5; 0% instances), flat (4; 0% instances), appos (3; 0% instances), advcl (1; 0% instances), obj (1; 0% instances)

Parents of DET nodes belong to 9 different parts of speech: NOUN (1779; 74% instances), ADV (522; 22% instances), VERB (44; 2% instances), ADJ (37; 2% instances), (11; 0% instances), PRON (5; 0% instances), NUM (2; 0% instances), AUX (1; 0% instances), DET (1; 0% instances)

2361 (98%) DET nodes are leaves.

29 (1%) DET nodes have one child.

10 (0%) DET nodes have two children.

2 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 3.

Children of DET nodes are attached using 13 different relations: flat (21; 38% instances), case (14; 25% instances), nsubj (6; 11% instances), obl (3; 5% instances), punct (3; 5% instances), acl (1; 2% instances), advcl (1; 2% instances), amod (1; 2% instances), det (1; 2% instances), goeswith (1; 2% instances), nmod (1; 2% instances), nmod:poss (1; 2% instances), obj (1; 2% instances)

Children of DET nodes belong to 9 different parts of speech: NOUN (21; 38% instances), ADP (15; 27% instances), ADV (9; 16% instances), PUNCT (3; 5% instances), ADJ (2; 4% instances), PRON (2; 4% instances), DET (1; 2% instances), VERB (1; 2% instances), X (1; 2% instances)