home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Korean-Kaist: POS Tags: PART

There are 168 PART lemmas (0%), 167 PART types (0%) and 268 PART tokens (0%). Out of 17 observed tags, the rank of PART is: 11 in number of lemmas, 12 in number of types and 15 in number of tokens.

The 10 most frequent PART lemmas: 는, 무+의미+하+ㄴ, 유+의미+하+ㄴ, 제+2+차, 라는, 제+1+의, 제+2+의, 반+소모사, 제+1, 제+3+세계

The 10 most frequent PART types: 는, 제2차, 무의미한, 유의미한, 라는, 제1의, 제2의, 반소모사, 제1, 제3세계

The 10 most frequent ambiguous lemmas: 는 (ADP 69, PART 30), 라는 (ADP 7, PART 7), 들+은 (PART 2, VERB 2, NOUN 1), 하+ㄴ (VERB 173, PART 2), 하+는 (VERB 394, PART 2, PROPN 2), 대 (NOUN 31, PART 1, PROPN 1), 되+어+가+ㅆ+다 (PART 1, VERB 1), 들 (NOUN 2, PART 1), 들+을 (NOUN 1, PART 1, VERB 1), 시키+고 (CCONJ 3, PART 1)

The 10 most frequent ambiguous types: 는 (ADP 69, PART 30, VERB 1), 무의미한 (PART 8, VERB 6), 유의미한 (PART 8, VERB 2), 라는 (AUX 55, ADP 8, PART 7), 무의미하게 (PART 3, VERB 2, SCONJ 1), 불가능한 (VERB 19, PART 3, ADJ 1), 들은 (VERB 12, PART 2, NOUN 1), 하는 (VERB 394, AUX 198, PART 2, PROPN 2), 한 (NUM 577, VERB 173, ADJ 69, NOUN 46, AUX 41, PROPN 32, DET 4, PART 2), 대 (NOUN 31, PART 1, PROPN 1, SCONJ 1)

Morphology

The form / lemma ratio of PART is 0.994048 (the average of all parts of speech is 0.998034).

The 1st highest number of forms (1) was observed with the lemma “고+엔트로피+의”: 고엔트로피의.

The 2nd highest number of forms (1) was observed with the lemma “구+지배+계급+의”: 구지배계급의.

The 3rd highest number of forms (1) was observed with the lemma “는”: 는.

PART does not occur with any features.

Relations

PART nodes are attached to their parents using 16 different relations: case (64; 24% instances), compound (57; 21% instances), acl (38; 14% instances), nmod (27; 10% instances), amod (15; 6% instances), conj (15; 6% instances), root (11; 4% instances), nsubj (8; 3% instances), obl (7; 3% instances), obj (6; 2% instances), advcl (5; 2% instances), dislocated (5; 2% instances), advmod (3; 1% instances), ccomp (3; 1% instances), dep (2; 1% instances), xcomp (2; 1% instances)

Parents of PART nodes belong to 10 different parts of speech: NOUN (103; 38% instances), VERB (73; 27% instances), ADV (44; 16% instances), CCONJ (13; 5% instances), (11; 4% instances), SCONJ (8; 3% instances), NUM (5; 2% instances), PROPN (5; 2% instances), ADJ (4; 1% instances), PART (2; 1% instances)

208 (78%) PART nodes are leaves.

31 (12%) PART nodes have one child.

10 (4%) PART nodes have two children.

19 (7%) PART nodes have three or more children.

The highest child degree of a PART node is 5.

Children of PART nodes are attached using 19 different relations: punct (15; 13% instances), dislocated (14; 12% instances), dep (13; 11% instances), advmod (9; 8% instances), conj (9; 8% instances), advcl (8; 7% instances), nsubj (8; 7% instances), ccomp (7; 6% instances), obj (7; 6% instances), cc (5; 4% instances), nmod (5; 4% instances), amod (4; 3% instances), obl (4; 3% instances), aux (3; 3% instances), acl (2; 2% instances), compound (1; 1% instances), iobj (1; 1% instances), nummod (1; 1% instances), xcomp (1; 1% instances)

Children of PART nodes belong to 13 different parts of speech: NOUN (46; 39% instances), ADV (17; 15% instances), PUNCT (15; 13% instances), VERB (13; 11% instances), CCONJ (8; 7% instances), SCONJ (5; 4% instances), AUX (3; 3% instances), ADJ (2; 2% instances), PART (2; 2% instances), PROPN (2; 2% instances), X (2; 2% instances), NUM (1; 1% instances), PRON (1; 1% instances)