Treebank Statistics: UD_Korean-KSL: POS Tags: AUX
There are 5 AUX lemmas (0%), 331 AUX types (1%) and 4599 AUX tokens (3%).
Out of 14 observed tags, the rank of AUX is: 13 in number of lemmas, 5 in number of types and 6 in number of tokens.
The 10 most frequent AUX lemmas: 있, 하, 싶, 않, 이
The 10 most frequent AUX types: 있다, 있는, 싶습니다, 한다, 있습니다, 있고, 있어요, 싶은, 있다고, 싶다
The 10 most frequent ambiguous lemmas: 있 (AUX 2309, VERB 3), 하 (AUX 849, VERB 5), 않 (AUX 652, ADV 1, VERB 1), 이 (DET 515, ADP 6, ADJ 2, AUX 2, NUM 2, NOUN 1, VERB 1)
The 10 most frequent ambiguous types: 있다 (AUX 750, VERB 340, ADJ 40), 있는 (AUX 344, VERB 217, ADJ 10), 한다 (AUX 269, VERB 100), 있습니다 (VERB 236, AUX 205, ADJ 22), 있고 (AUX 116, VERB 96, ADJ 20), 있어요 (AUX 106, VERB 69, ADJ 4), 있다고 (AUX 93, VERB 37, ADJ 1), 합니다 (VERB 117, AUX 79, X 5), 있기 (AUX 73, VERB 31, ADJ 4), 한다고 (AUX 72, VERB 4)
- 있다
- 있는
- 한다
- 있습니다
- 있고
- 있어요
- 있다고
- 합니다
- 있기
- 한다고
Morphology
The form / lemma ratio of AUX is 66.200000 (the average of all parts of speech is 1.007876).
The 1st highest number of forms (115) was observed with the lemma “있”: 있, 있ㄷ어요, 있거나, 있게, 있겠네요, 있겠다, 있겠습니까, 있겠습니다, 있겠어요, 있겠으나, 있겠죠, 있겠지, 있겠지만, 있고, 있기, 있기도, 있기때문에, 있기에, 있길, 있나요, 있냐고, 있냐면, 있냐면은, 있느냐고, 있는, 있는가, 있는것, 있는것이, 있는것이다, 있는게, 있는다, 있는다고, 있는데, 있는데다가, 있는데도, 있는만으로도, 있는지, 있는지가, 있는지도, 있는지를, 있는지에, 있니, 있다, 있다”, 있다고, 있다고한다, 있다는, 있다라는, 있다며, 있다면, 있단말이에요, 있답니다, 있던, 있도, 있도록, 있듯이, 있따, 있면, 있면서, 있모록, 있스니까, 있습니다, 있어, 있어도, 있어서, 있어서요, 있어야, 있어야지, 있어요, 있어지만, 있엇다고, 있었기, 있었는데, 있었다, 있었다고, 있었다는, 있었더라면, 있었던, 있었던것을, 있었습니다, 있었어, 있었어도, 있었어요, 있었으면, 있었은, 있었을, 있었을까, 있었지만, 있으나, 있으니, 있으니까, 있으며, 있으면, 있으면서, 있으므로, 있은, 있은다는, 있을, 있을가보다, 있을거야, 있을것입니다, 있을까, 있을까요, 있을지, 있을지도, 있음에, 있음을, 있읍니다, 있잖아요, 있죠, 있지, 있지마, 있지만, 있지않다, 있지요.
The 2nd highest number of forms (76) was observed with the lemma “않”: 않, 않게, 않겠는가, 않겠다, 않겠다고, 않겠습니다, 않겠지만, 않고, 않고도, 않기, 않기를, 않는, 않는가, 않는게, 않는날을, 않는다, 않는다고, 않는다면, 않는데, 않다, 않다고, 않다고하고, 않다느니, 않다는, 않다면, 않더라도, 않도록, 않도록이요, 않마십니다, 않막아요, 않면, 않습니다, 않아, 않아고, 않아도, 않아면, 않아서, 않아요, 않았나, 않았냐면, 않았는, 않았다, 않았다는, 않았더라면, 않았던, 않았습니다, 않았어, 않았어요, 않았으니까, 않았을것입니다, 않았을까라는, 않았지만, 않애요, 않였는데, 않으니까, 않으며, 않으면, 않은, 않은다, 않은다고, 않은데, 않은데요, 않은면, 않은옷을, 않은지, 않을, 않을가, 않을겠다, 않을까, 않을까라고, 않을때가, 않을면, 않있습니다, 않지만, 않지요, 않했습니다.
The 3rd highest number of forms (73) was observed with the lemma “하”: 하게, 하겠다, 하겠말이다, 하겠습니다, 하고, 하기, 하나, 하는, 하는다고, 하는데, 하는데다가, 하는지, 하는지에도, 하니까, 하다, 하다고, 하다는, 하더라도, 하도록, 하든지, 하며, 하면, 하면서, 하셨는데, 하셨다, 하시네요, 하신다, 하였다, 하지, 하지마, 하지만, 한, 한것은, 한는데, 한다, 한다고, 한다는, 한다면, 한데, 할, 할것입니다, 할까, 할때, 할수록, 할지, 할지가, 합니까라고, 합니다, 합니다만, 합다, 해, 해는데, 해도, 해라는, 해서, 해서도, 해야, 해요, 해주는, 해주셔서, 해준, 해준다고, 했고, 했는, 했는데, 했다, 했다는, 했습니다, 했어, 했어요, 했을, 했지만, 했지요.
AUX occurs with 1 features: Typo (33; 1% instances)
AUX occurs with 1 feature-value pairs: Typo=Yes
AUX occurs with 2 feature combinations.
The most frequent feature combination is _ (4566 tokens).
Examples: 있다, 있는, 싶습니다, 한다, 있습니다, 있고, 있어요, 싶은, 있다고, 싶다
Relations
AUX nodes are attached to their parents using 14 different relations: aux (2675; 58% instances), root (946; 21% instances), acl (350; 8% instances), advcl (296; 6% instances), conj (140; 3% instances), ccomp (107; 2% instances), nmod (42; 1% instances), obl (20; 0% instances), flat (8; 0% instances), obj (8; 0% instances), nsubj (4; 0% instances), csubj (1; 0% instances), list (1; 0% instances), parataxis (1; 0% instances)
Parents of AUX nodes belong to 7 different parts of speech: VERB (2708; 59% instances), (946; 21% instances), ADJ (530; 12% instances), NOUN (254; 6% instances), AUX (131; 3% instances), ADV (29; 1% instances), PRON (1; 0% instances)
2561 (56%) AUX nodes are leaves.
715 (16%) AUX nodes have one child.
501 (11%) AUX nodes have two children.
822 (18%) AUX nodes have three or more children.
The highest child degree of a AUX node is 6.
Children of AUX nodes are attached using 19 different relations: nsubj (1784; 39% instances), punct (1077; 24% instances), advcl (897; 20% instances), cc (228; 5% instances), obl (168; 4% instances), conj (105; 2% instances), advmod (92; 2% instances), ccomp (90; 2% instances), dislocated (41; 1% instances), case (16; 0% instances), mark (13; 0% instances), obj (13; 0% instances), aux (11; 0% instances), goeswith (11; 0% instances), acl (9; 0% instances), nmod (3; 0% instances), csubj (2; 0% instances), flat (1; 0% instances), parataxis (1; 0% instances)
Children of AUX nodes belong to 12 different parts of speech: NOUN (1857; 41% instances), PUNCT (1077; 24% instances), VERB (846; 19% instances), CCONJ (228; 5% instances), ADJ (177; 4% instances), ADV (176; 4% instances), AUX (131; 3% instances), PRON (22; 0% instances), ADP (16; 0% instances), SCONJ (13; 0% instances), X (11; 0% instances), NUM (8; 0% instances)