Treebank Statistics: UD_Korean: POS Tags: PUNCT
There are 28 PUNCT lemmas (74%), 53 PUNCT types (0%) and 6634 PUNCT tokens (9%).
Out of 11 observed tags, the rank of PUNCT is: 1 in number of lemmas, 8 in number of types and 4 in number of tokens.
The 10 most frequent PUNCT lemmas: ., _, !, ?, ), ‘, .., …, %, ,
The 10 most frequent PUNCT types: ., ,, !, ?, ),, ), ‘, .., …, %
The 10 most frequent ambiguous lemmas: _ (NOUN 32099, VERB 18517, ADV 11605, ADJ 2715, PUNCT 1972, ADP 835, PRON 677, DET 539, NUM 532, CCONJ 176, X 23)
The 10 most frequent ambiguous types: ~ (PUNCT 6, X 1), 둘러보고 (VERB 2, PUNCT 1), 미니홈피 (NOUN 1, PUNCT 1), 부처 (NOUN 1, PUNCT 1), 의미를 (NOUN 6, PUNCT 1), 타고 (VERB 31, PUNCT 1), 티아라 (NOUN 9, PUNCT 1)
- ~
- 둘러보고
- 미니홈피
- 부처
- 의미를
- 타고
- 티아라
Morphology
The form / lemma ratio of PUNCT is 1.892857 (the average of all parts of speech is 963.631579).
The 1st highest number of forms (31) was observed with the lemma “_”: !, “,, %),, %,, ‘’,, ‘,, ()’,, )),, ),, )., )》,, ,, -, ., /SMS, ?, ~, •, 〉,, 》,, 」,, 것이였다, 둘러보고, 미니홈피, 복원, 부처, 의미를, 인권, 타고, 티아라, 효민.
The 2nd highest number of forms (1) was observed with the lemma “!”: !.
The 3rd highest number of forms (1) was observed with the lemma “””: “.
PUNCT does not occur with any features.
Relations
PUNCT nodes are attached to their parents using 2 different relations: punct (6633; 100% instances), root (1; 0% instances)
Parents of PUNCT nodes belong to 10 different parts of speech: VERB (4347; 66% instances), NOUN (1444; 22% instances), ADJ (661; 10% instances), ADV (138; 2% instances), NUM (35; 1% instances), CCONJ (3; 0% instances), PRON (3; 0% instances), ADP (1; 0% instances), PUNCT (1; 0% instances), (1; 0% instances)
6633 (100%) PUNCT nodes are leaves.
0 (0%) PUNCT nodes have one child.
1 (0%) PUNCT nodes have two children.
The highest child degree of a PUNCT node is 2.
Children of PUNCT nodes are attached using 2 different relations: dep (1; 50% instances), punct (1; 50% instances)
Children of PUNCT nodes belong to 2 different parts of speech: PUNCT (1; 50% instances), VERB (1; 50% instances)