home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Korean-Kaist: POS Tags: CCONJ

There are 7793 CCONJ lemmas (8%), 7750 CCONJ types (8%) and 19002 CCONJ tokens (5%). Out of 17 observed tags, the rank of CCONJ is: 4 in number of lemmas, 4 in number of types and 5 in number of tokens.

The 10 most frequent CCONJ lemmas: 그러나, 그리고, 따라서, 즉, 그런데, 또한, 및, 다음+과, 또는, 이+와

The 10 most frequent CCONJ types: 그러나, 그리고, 따라서, 즉, 그런데, 또한, 및, 다음과, 또는, 이와

The 10 most frequent ambiguous lemmas: 그러나 (CCONJ 1271, ADV 8), 그리고 (CCONJ 690, ADV 18), 따라서 (CCONJ 457, ADV 24), 즉 (CCONJ 343, ADV 41), 그런데 (CCONJ 300, ADV 1), 또한 (CCONJ 265, ADV 150), 또는 (CCONJ 192, ADV 8), 이+와 (CCONJ 179, ADV 4), 또 (ADV 202, CCONJ 176), 그래서 (CCONJ 174, ADV 11)

The 10 most frequent ambiguous types: 그러나 (CCONJ 1271, ADV 8), 그리고 (CCONJ 689, ADV 18, VERB 2, ADJ 1, SCONJ 1), 따라서 (CCONJ 457, SCONJ 37, ADV 24), 즉 (CCONJ 343, ADV 41), 그런데 (CCONJ 300, ADV 1), 또한 (CCONJ 266, ADV 150), 또는 (CCONJ 191, ADV 8), 이와 (CCONJ 179, ADV 4), 또 (ADV 202, CCONJ 176), 그래서 (CCONJ 174, ADV 11, SCONJ 1)

Morphology

The form / lemma ratio of CCONJ is 0.994482 (the average of all parts of speech is 0.998034).

The 1st highest number of forms (2) was observed with the lemma “갖+고”: 갖고, 작고.

The 2nd highest number of forms (2) was observed with the lemma “거치+며”: 거치며, 거치며출판산업도.

The 3rd highest number of forms (2) was observed with the lemma “견디+어+내+고”: 견뎌내고, 견디어내고.

CCONJ does not occur with any features.

Relations

CCONJ nodes are attached to their parents using 22 different relations: cc (5504; 29% instances), root (3137; 17% instances), acl (1344; 7% instances), conj (1217; 6% instances), obj (1208; 6% instances), nmod (888; 5% instances), ccomp (850; 4% instances), amod (777; 4% instances), nsubj (704; 4% instances), advcl (664; 3% instances), compound (654; 3% instances), dislocated (609; 3% instances), obl (456; 2% instances), xcomp (414; 2% instances), advmod (211; 1% instances), dep (137; 1% instances), case (81; 0% instances), fixed (80; 0% instances), iobj (24; 0% instances), flat (22; 0% instances), csubj (17; 0% instances), mark (4; 0% instances)

Parents of CCONJ nodes belong to 13 different parts of speech: VERB (8117; 43% instances), NOUN (3314; 17% instances), (3137; 17% instances), ADV (1305; 7% instances), CCONJ (1254; 7% instances), SCONJ (857; 5% instances), ADJ (800; 4% instances), PROPN (164; 1% instances), PRON (24; 0% instances), NUM (13; 0% instances), PART (8; 0% instances), X (6; 0% instances), INTJ (3; 0% instances)

5843 (31%) CCONJ nodes are leaves.

5777 (30%) CCONJ nodes have one child.

4393 (23%) CCONJ nodes have two children.

2989 (16%) CCONJ nodes have three or more children.

The highest child degree of a CCONJ node is 10.

Children of CCONJ nodes are attached using 23 different relations: conj (13662; 55% instances), obj (2077; 8% instances), nsubj (1069; 4% instances), advcl (992; 4% instances), nmod (958; 4% instances), dislocated (841; 3% instances), compound (761; 3% instances), advmod (705; 3% instances), ccomp (669; 3% instances), punct (649; 3% instances), obl (619; 3% instances), amod (576; 2% instances), acl (546; 2% instances), xcomp (173; 1% instances), dep (114; 0% instances), csubj (91; 0% instances), det (66; 0% instances), iobj (50; 0% instances), cc (49; 0% instances), nummod (49; 0% instances), discourse (2; 0% instances), appos (1; 0% instances), case (1; 0% instances)

Children of CCONJ nodes belong to 16 different parts of speech: NOUN (9221; 37% instances), VERB (5812; 24% instances), ADV (3484; 14% instances), SCONJ (1698; 7% instances), CCONJ (1254; 5% instances), ADJ (1184; 5% instances), PROPN (980; 4% instances), PUNCT (649; 3% instances), PRON (215; 1% instances), NUM (118; 0% instances), DET (66; 0% instances), X (16; 0% instances), PART (13; 0% instances), SYM (7; 0% instances), INTJ (2; 0% instances), ADP (1; 0% instances)