home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Korean-PUD: POS Tags: PART

There are 98 PART lemmas (4%), 101 PART types (1%) and 496 PART tokens (3%). Out of 13 observed tags, the rank of PART is: 4 in number of lemmas, 7 in number of types and 8 in number of tokens.

The 10 most frequent PART lemmas: 는, 의, 고, 에, 도, 라고, 가, 와, 에서, 이

The 10 most frequent PART types: 는, 의, 고, 에, 도, 라고, 가, 와, 에서, 이

The 10 most frequent ambiguous lemmas: 도 (PART 26, NOUN 2), 가 (PART 20, NOUN 1), 이 (PRON 23, PART 16), 과 (PART 10, NOUN 1), 만 (PART 5, NUM 1), 있다 (ADJ 8, PART 4), 열기 (NOUN 1, PART 1)

The 10 most frequent ambiguous types: 고 (PART 38, AUX 1), 도 (PART 26, NOUN 1), 가 (PART 20, AUX 7), 와 (PART 19, VERB 1), 이 (DET 52, PART 16, NOUN 3, PRON 1, PROPN 1), 만 (DET 12, PART 5, NUM 1), 있다고 (ADJ 8, PART 4), 보다 (ADV 2, PART 2), 들 (VERB 3, PART 1), 뿐 (NOUN 2, PART 1)

Morphology

The form / lemma ratio of PART is 1.030612 (the average of all parts of speech is 3.165468).

The 1st highest number of forms (2) was observed with the lemma “되기”: 되기를, 되기에.

The 2nd highest number of forms (2) was observed with the lemma “됨”: 됨으로써, 됨은.

The 3rd highest number of forms (2) was observed with the lemma “쓰기”: 쓰기도, 쓰기로.

PART occurs with 8 features: Polite (282; 57% instances), Case (258; 52% instances), VerbForm (72; 15% instances), Mood (40; 8% instances), Tense (22; 4% instances), Form (18; 4% instances), Voice (3; 1% instances), PronType (2; 0% instances)

PART occurs with 17 feature-value pairs: Case=Acc, Case=Advb, Case=Comp, Case=Gen, Case=Nom, Form=Aux, Form=Compl, Mood=Imp, Mood=Ind, Polite=Form, PronType=Int, Tense=Fut, Tense=Past, VerbForm=Fin, VerbForm=Ger, Voice=Cau, Voice=Pass

PART occurs with 28 feature combinations. The most frequent feature combination is _ (155 tokens). Examples: 는, 고, 도, 라고, 만, 까지, 이라고, 밖에, 들, 마다

Relations

PART nodes are attached to their parents using 9 different relations: dep:prt (404; 81% instances), advcl (61; 12% instances), aux (9; 2% instances), ccomp (9; 2% instances), root (4; 1% instances), conj (3; 1% instances), csubj (3; 1% instances), advmod (2; 0% instances), nsubj (1; 0% instances)

Parents of PART nodes belong to 8 different parts of speech: NOUN (257; 52% instances), PROPN (172; 35% instances), VERB (37; 7% instances), ADJ (11; 2% instances), PART (8; 2% instances), PRON (6; 1% instances), (4; 1% instances), ADV (1; 0% instances)

411 (83%) PART nodes are leaves.

20 (4%) PART nodes have one child.

26 (5%) PART nodes have two children.

39 (8%) PART nodes have three or more children.

The highest child degree of a PART node is 7.

Children of PART nodes are attached using 15 different relations: advmod (51; 23% instances), obj (40; 18% instances), advcl (39; 17% instances), nsubj (35; 16% instances), aux (25; 11% instances), punct (14; 6% instances), dep:prt (5; 2% instances), compound:lvc (4; 2% instances), ccomp (3; 1% instances), iobj (3; 1% instances), compound (1; 0% instances), conj (1; 0% instances), csubj (1; 0% instances), nsubj:pass (1; 0% instances), obl (1; 0% instances)

Children of PART nodes belong to 9 different parts of speech: NOUN (110; 49% instances), VERB (41; 18% instances), ADV (17; 8% instances), PUNCT (14; 6% instances), PROPN (12; 5% instances), PRON (11; 5% instances), ADJ (10; 4% instances), PART (8; 4% instances), NUM (1; 0% instances)