Treebank Statistics: UD_Korean-PUD: POS Tags: PART
There are 98 PART
lemmas (4%), 101 PART
types (1%) and 496 PART
tokens (3%).
Out of 13 observed tags, the rank of PART
is: 4 in number of lemmas, 7 in number of types and 8 in number of tokens.
The 10 most frequent PART
lemmas: 는, 의, 고, 에, 도, 라고, 가, 와, 에서, 이
The 10 most frequent PART
types: 는, 의, 고, 에, 도, 라고, 가, 와, 에서, 이
The 10 most frequent ambiguous lemmas: 도 (PART 26, NOUN 2), 가 (PART 20, NOUN 1), 이 (PRON 23, PART 16), 과 (PART 10, NOUN 1), 만 (PART 5, NUM 1), 있다 (ADJ 8, PART 4), 열기 (NOUN 1, PART 1)
The 10 most frequent ambiguous types: 고 (PART 38, AUX 1), 도 (PART 26, NOUN 1), 가 (PART 20, AUX 7), 와 (PART 19, VERB 1), 이 (DET 52, PART 16, NOUN 3, PRON 1, PROPN 1), 만 (DET 12, PART 5, NUM 1), 있다고 (ADJ 8, PART 4), 보다 (ADV 2, PART 2), 들 (VERB 3, PART 1), 뿐 (NOUN 2, PART 1)
- 고
- 도
- 가
- 와
- 이
- DET 52: 이 부분에서 게임과 우리 일상 생활 사이의 유사점을 찾을 수 있습니다 .
- PART 16: 경찰 대변인은 “ 몇 마디 주고 받은 “ 후 “ 언쟁 “ 이 벌어졌지만 부상자는 없었다고 연합 통신에 밝혔다 .
- NOUN 3: 카리브 해는 1492 년까지 유라시아 사람들에게 는 알려지지 않은 곳 이었는데 이 때 크리스토퍼 콜럼버스 가 아시아 항로 탐색을 위한 목적으로 처음으로 카리브 해를 항해했다 .
- PRON 1: 원래 기단은 1950 년대에 일기 예보를 위해 사용되었지만 기상학자들은 1973 년 이 아이디어를 기반으로 종관 기후학 이란 분야를 만들기 시작했다 .
- PROPN 1: 이 박사는 “ 아고라 ( Agora ) 는 초대를 받아야만 입장할 수 있었지만 이 시장의 대부분은 검색 방법만 알면 쉽게 접근할 수 있다 ” 라고 덧붙였다 .
- 만
- 있다고
- 보다
- 들
- 뿐
Morphology
The form / lemma ratio of PART
is 1.030612 (the average of all parts of speech is 3.165468).
The 1st highest number of forms (2) was observed with the lemma “되기”: 되기를, 되기에.
The 2nd highest number of forms (2) was observed with the lemma “됨”: 됨으로써, 됨은.
The 3rd highest number of forms (2) was observed with the lemma “쓰기”: 쓰기도, 쓰기로.
PART
occurs with 8 features: Polite (282; 57% instances), Case (167; 34% instances), VerbForm (72; 15% instances), Mood (40; 8% instances), Tense (22; 4% instances), Form (18; 4% instances), Voice (3; 1% instances), PronType (2; 0% instances)
PART
occurs with 15 feature-value pairs: Case=Acc
, Case=Gen
, Case=Nom
, Form=Aux
, Form=Compl
, Mood=Imp
, Mood=Ind
, Polite=Form
, PronType=Int
, Tense=Fut
, Tense=Past
, VerbForm=Fin
, VerbForm=Ger
, Voice=Cau
, Voice=Pass
PART
occurs with 25 feature combinations.
The most frequent feature combination is _
(155 tokens).
Examples: 는, 고, 도, 라고, 만, 까지, 이라고, 밖에, 들, 마다
Relations
PART
nodes are attached to their parents using 9 different relations: case (404; 81% instances), advcl (61; 12% instances), aux (9; 2% instances), ccomp (9; 2% instances), root (4; 1% instances), conj (3; 1% instances), csubj (3; 1% instances), advmod (2; 0% instances), nsubj (1; 0% instances)
Parents of PART
nodes belong to 8 different parts of speech: NOUN (257; 52% instances), PROPN (172; 35% instances), VERB (37; 7% instances), ADJ (11; 2% instances), PART (8; 2% instances), PRON (6; 1% instances), (4; 1% instances), ADV (1; 0% instances)
411 (83%) PART
nodes are leaves.
20 (4%) PART
nodes have one child.
26 (5%) PART
nodes have two children.
39 (8%) PART
nodes have three or more children.
The highest child degree of a PART
node is 7.
Children of PART
nodes are attached using 15 different relations: advmod (51; 23% instances), obj (40; 18% instances), advcl (39; 17% instances), nsubj (35; 16% instances), aux (25; 11% instances), punct (14; 6% instances), case (5; 2% instances), compound:lvc (4; 2% instances), ccomp (3; 1% instances), iobj (3; 1% instances), compound (1; 0% instances), conj (1; 0% instances), csubj (1; 0% instances), nsubj:pass (1; 0% instances), obl (1; 0% instances)
Children of PART
nodes belong to 9 different parts of speech: NOUN (110; 49% instances), VERB (41; 18% instances), ADV (17; 8% instances), PUNCT (14; 6% instances), PROPN (12; 5% instances), PRON (11; 5% instances), ADJ (10; 4% instances), PART (8; 4% instances), NUM (1; 0% instances)