home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Korean-PUD: POS Tags: AUX

There are 6 AUX lemmas (0%), 144 AUX types (2%) and 663 AUX tokens (4%). Out of 13 observed tags, the rank of AUX is: 9 in number of lemmas, 7 in number of types and 5 in number of tokens.

The 10 most frequent AUX lemmas: 이, _, 있, 않, 하, 싶

The 10 most frequent AUX types: 인, 이다, 이었다, 이라, 였다, 있다, 있는, 이며, 일, 라

The 10 most frequent ambiguous lemmas: 이 (AUX 436, PRON 23, PART 16), _ (NOUN 4295, VERB 1439, PROPN 1030, ADJ 596, ADV 516, DET 462, CCONJ 125, AUX 104, X 47, NUM 27, PRON 24, PUNCT 1)

The 10 most frequent ambiguous types: 인 (AUX 149, PROPN 1), 있다 (ADJ 45, VERB 24, AUX 20), 있는 (ADJ 50, AUX 16, VERB 7), 일 (NOUN 32, AUX 12, PROPN 3), 라 (AUX 9, NOUN 1), 않았다 (AUX 8, VERB 5), 가 (PART 20, AUX 7), 한다 (VERB 20, AUX 7), 못했다 (AUX 6, VERB 4), 않은 (AUX 6, VERB 3, ADJ 1)

Morphology

The form / lemma ratio of AUX is 24.000000 (the average of all parts of speech is 3.181543).

The 1st highest number of forms (53) was observed with the lemma “_”: 가, 가는, 고, 나서, 내기, 내는, 낼, 냈다, 놓고, 놓는, 놓았다, 놓은, 니, 달라고, 두고, 둘, 라, 라는, 라면, 말고, 말라고, 못하게, 못하고, 못한다는, 못할, 못했다, 못했을, 버렸고, 버렸다, 버렸으며, 버린, 본, 봤으며, 여서, 오는, 왔지만, 주는, 준, 준다, 치우고, 한, 한다, 한다고, 한다는, 한다던, 할, 합니다, 해, 해야, 했는데, 했다, 했다고, 했을.

The 2nd highest number of forms (43) was observed with the lemma “이”: 였고, 였기, 였는데, 였는지, 였다, 였던, 였어, 였으며, 였음, 였지만, 이거나, 이고, 이기, 이다, 이다., 이던, 이든, 이라, 이라는, 이란, 이며, 이세요, 이어서, 이어야, 이었고, 이었기, 이었는데, 이었다, 이었던, 이었으며, 이었을, 이었음, 이자, 이지, 이지만, 이진, 인, 인가, 인데, 인지, 일, 일까, 일지.

The 3rd highest number of forms (23) was observed with the lemma “않”: 않게, 않고, 않기, 않는, 않는다, 않는다고, 않는다면, 않다, 않다고, 않다는, 않도록, 않아, 않았고, 않았는데, 않았다, 않았어요, 않았으면, 않았지만, 않으며, 않으면, 않으므로, 않은, 않을.

AUX occurs with 7 features: Form (362; 55% instances), VerbForm (301; 45% instances), Mood (265; 40% instances), Tense (150; 23% instances), PronType (12; 2% instances), Polite (5; 1% instances), Case (1; 0% instances)

AUX occurs with 12 feature-value pairs: Case=Acc, Form=Adn, Form=Aux, Form=Compl, Mood=Imp, Mood=Ind, Polite=Form, PronType=Int, Tense=Fut, Tense=Past, VerbForm=Fin, VerbForm=Ger

AUX occurs with 17 feature combinations. The most frequent feature combination is Form=Adn (218 tokens). Examples: 인, 있는, 일, 이라는, 이란, 않은, 가는, 있던, 내는, 낼

Relations

AUX nodes are attached to their parents using 2 different relations: cop (458; 69% instances), aux (205; 31% instances)

Parents of AUX nodes belong to 8 different parts of speech: NOUN (432; 65% instances), VERB (185; 28% instances), PROPN (13; 2% instances), ADJ (12; 2% instances), PART (9; 1% instances), PRON (8; 1% instances), ADV (2; 0% instances), NUM (2; 0% instances)

604 (91%) AUX nodes are leaves.

55 (8%) AUX nodes have one child.

4 (1%) AUX nodes have two children.

The highest child degree of a AUX node is 2.

Children of AUX nodes are attached using 2 different relations: punct (57; 90% instances), conj (6; 10% instances)

Children of AUX nodes belong to 3 different parts of speech: PUNCT (57; 90% instances), VERB (4; 6% instances), NOUN (2; 3% instances)