home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Korean-Kaist: POS Tags: X

There are 506 X lemmas (0%), 506 X types (1%) and 699 X tokens (0%). Out of 17 observed tags, the rank of X is: 9 in number of lemmas, 9 in number of types and 14 in number of tokens.

The 10 most frequent X lemmas: EC, 프롤레타리아, TV, S, the, km, ATM, PCS, 에퀴티, ABC

The 10 most frequent X types: EC, 프롤레타리아, TV, S, the, km, ATM, PCS, 에퀴티, ABC

The 10 most frequent ambiguous lemmas: 프롤레타리아 (X 17, NOUN 16), km (X 7, SYM 4), 부르조아 (NOUN 7, X 4), C (X 2, SYM 1), KGB (PROPN 3, X 2), 꼬뻬라찌프 (PROPN 2, X 1), 더 (ADV 401, X 1), 레코드 (NOUN 1, X 1), 리스크 (NOUN 1, X 1), 삐로제닉 (PROPN 1, X 1)

The 10 most frequent ambiguous types: 프롤레타리아 (X 17, NOUN 16), km (X 7, SYM 4), 부르조아 (NOUN 7, X 4), C (X 2, SYM 1), KGB (PROPN 3, X 2), 꼬뻬라찌프 (PROPN 2, X 1), 더 (ADV 401, X 1), 레코드 (NOUN 1, X 1), 리스크 (NOUN 1, X 1), 삐로제닉 (PROPN 1, X 1)

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 0.998034).

The 1st highest number of forms (1) was observed with the lemma “230+b”: 230b.

The 2nd highest number of forms (1) was observed with the lemma “3+D”: 3D.

The 3rd highest number of forms (1) was observed with the lemma “A”: A.

X does not occur with any features.

Relations

X nodes are attached to their parents using 13 different relations: appos (253; 36% instances), flat (182; 26% instances), compound (170; 24% instances), conj (32; 5% instances), dislocated (12; 2% instances), dep (11; 2% instances), root (11; 2% instances), nmod (10; 1% instances), obj (7; 1% instances), advcl (5; 1% instances), nsubj (4; 1% instances), csubj (1; 0% instances), obl (1; 0% instances)

Parents of X nodes belong to 12 different parts of speech: NOUN (311; 44% instances), X (197; 28% instances), PROPN (79; 11% instances), ADV (40; 6% instances), VERB (31; 4% instances), CCONJ (16; 2% instances), (11; 2% instances), NUM (8; 1% instances), PART (2; 0% instances), SCONJ (2; 0% instances), ADJ (1; 0% instances), PRON (1; 0% instances)

334 (48%) X nodes are leaves.

48 (7%) X nodes have one child.

169 (24%) X nodes have two children.

148 (21%) X nodes have three or more children.

The highest child degree of a X node is 10.

Children of X nodes are attached using 19 different relations: punct (564; 62% instances), flat (197; 22% instances), case (28; 3% instances), appos (26; 3% instances), dep (26; 3% instances), conj (22; 2% instances), nummod (13; 1% instances), acl (7; 1% instances), nmod (7; 1% instances), cop (5; 1% instances), dislocated (5; 1% instances), cc (3; 0% instances), ccomp (2; 0% instances), nsubj (2; 0% instances), advcl (1; 0% instances), advmod (1; 0% instances), amod (1; 0% instances), compound (1; 0% instances), obl (1; 0% instances)

Children of X nodes belong to 13 different parts of speech: PUNCT (564; 62% instances), X (197; 22% instances), NOUN (67; 7% instances), ADP (27; 3% instances), VERB (16; 2% instances), NUM (14; 2% instances), PROPN (7; 1% instances), CCONJ (6; 1% instances), AUX (5; 1% instances), ADV (4; 0% instances), ADJ (2; 0% instances), PRON (2; 0% instances), SCONJ (1; 0% instances)