home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Korean-PUD: POS Tags: PART

There are 91 PART lemmas (3%), 94 PART types (1%) and 478 PART tokens (3%). Out of 13 observed tags, the rank of PART is: 4 in number of lemmas, 8 in number of types and 9 in number of tokens.

The 10 most frequent PART lemmas: 는, 의, 고, 에, 도, 라고, 가, 와, 에서, 이

The 10 most frequent PART types: 는, 의, 고, 에, 도, 라고, 가, 와, 에서, 이

The 10 most frequent ambiguous lemmas: 도 (PART 26, NOUN 2), 가 (PART 20, NOUN 1), 이 (AUX 436, PRON 23, PART 16), 과 (PART 10, NOUN 1), 만 (PART 5, NUM 1), 열기 (NOUN 1, PART 1)

The 10 most frequent ambiguous types: 고 (PART 38, AUX 1), 도 (PART 26, NOUN 1), 가 (PART 20, AUX 7), 와 (PART 19, VERB 1), 이 (DET 52, PART 16, NOUN 3, PRON 1, PROPN 1), 만 (DET 12, PART 5, NUM 1), 보다 (ADV 2, PART 2), 있음을 (PART 2, AUX 1), 들 (VERB 3, PART 1), 뿐 (NOUN 2, PART 1)

Morphology

The form / lemma ratio of PART is 1.032967 (the average of all parts of speech is 3.181543).

The 1st highest number of forms (2) was observed with the lemma “되기”: 되기를, 되기에.

The 2nd highest number of forms (2) was observed with the lemma “됨”: 됨으로써, 됨은.

The 3rd highest number of forms (2) was observed with the lemma “쓰기”: 쓰기도, 쓰기로.

PART occurs with 8 features: Polite (278; 58% instances), Case (166; 35% instances), VerbForm (54; 11% instances), Mood (26; 5% instances), Tense (19; 4% instances), Form (18; 4% instances), Voice (3; 1% instances), PronType (2; 0% instances)

PART occurs with 15 feature-value pairs: Case=Acc, Case=Gen, Case=Nom, Form=Aux, Form=Compl, Mood=Imp, Mood=Ind, Polite=Form, PronType=Int, Tense=Fut, Tense=Past, VerbForm=Fin, VerbForm=Ger, Voice=Cau, Voice=Pass

PART occurs with 25 feature combinations. The most frequent feature combination is _ (155 tokens). Examples: 는, 고, 도, 라고, 만, 까지, 이라고, 밖에, 들, 마다

Relations

PART nodes are attached to their parents using 10 different relations: case (404; 85% instances), advcl (49; 10% instances), ccomp (8; 2% instances), root (7; 1% instances), csubj (3; 1% instances), advmod (2; 0% instances), conj (2; 0% instances), acl:relcl (1; 0% instances), dep (1; 0% instances), nsubj (1; 0% instances)

Parents of PART nodes belong to 8 different parts of speech: NOUN (246; 51% instances), PROPN (172; 36% instances), VERB (30; 6% instances), ADJ (11; 2% instances), (7; 1% instances), PRON (6; 1% instances), PART (5; 1% instances), ADV (1; 0% instances)

402 (84%) PART nodes are leaves.

20 (4%) PART nodes have one child.

24 (5%) PART nodes have two children.

32 (7%) PART nodes have three or more children.

The highest child degree of a PART node is 6.

Children of PART nodes are attached using 14 different relations: advcl (37; 19% instances), obj (35; 18% instances), obl (35; 18% instances), nsubj (30; 16% instances), advmod (16; 8% instances), punct (14; 7% instances), aux (9; 5% instances), case (3; 2% instances), ccomp (3; 2% instances), compound:lvc (3; 2% instances), iobj (3; 2% instances), conj (1; 1% instances), csubj (1; 1% instances), nsubj:pass (1; 1% instances)

Children of PART nodes belong to 11 different parts of speech: NOUN (105; 55% instances), VERB (22; 12% instances), ADV (14; 7% instances), PUNCT (14; 7% instances), AUX (9; 5% instances), ADJ (7; 4% instances), PRON (7; 4% instances), PROPN (6; 3% instances), PART (5; 3% instances), CCONJ (1; 1% instances), NUM (1; 1% instances)