Treebank Statistics: UD_Korean-KSL: POS Tags: AUX
There are 5 AUX lemmas (0%), 373 AUX types (1%) and 5506 AUX tokens (4%).
Out of 16 observed tags, the rank of AUX is: 13 in number of lemmas, 5 in number of types and 6 in number of tokens.
The 10 most frequent AUX lemmas: 있, 하, 싶, 않, 이
The 10 most frequent AUX types: 있다, 있는, 한다, 싶습니다, 있습니다, 있고, 있어요, 한다고, 있다고, 싶은
The 10 most frequent ambiguous lemmas: 있 (AUX 2861, VERB 3, ADJ 1), 하 (AUX 1036, VERB 5), 않 (AUX 776, ADV 1, VERB 1), 이 (DET 549, ADP 9, ADJ 2, AUX 2, NUM 2, NOUN 1, VERB 1)
The 10 most frequent ambiguous types: 있다 (AUX 892, VERB 419, ADJ 41), 있는 (AUX 431, VERB 253, ADJ 10), 한다 (AUX 309, VERB 107), 있습니다 (AUX 270, VERB 269, ADJ 23), 있고 (AUX 144, VERB 118, ADJ 20), 있어요 (AUX 129, VERB 81, ADJ 4), 한다고 (AUX 129, VERB 9), 있다고 (AUX 121, VERB 60, ADJ 1), 합니다 (VERB 131, AUX 99, X 5), 있기 (AUX 96, VERB 38, ADJ 4)
- 있다
- 있는
- 한다
- 있습니다
- 있고
- 있어요
- 한다고
- 있다고
- 합니다
- 있기
Morphology
The form / lemma ratio of AUX is 74.600000 (the average of all parts of speech is 1.008073).
The 1st highest number of forms (123) was observed with the lemma “있”: 있, 있ㄷ어요, 있거나, 있게, 있겠네요, 있겠다, 있겠습니까, 있겠습니다, 있겠어요, 있겠으나, 있겠죠, 있겠지, 있겠지만, 있고, 있기, 있기도, 있기때문에, 있기에, 있길, 있나요, 있냐고, 있냐면, 있냐면은, 있느냐고, 있는, 있는가, 있는것, 있는것이, 있는것이다, 있는게, 있는다, 있는다고, 있는데, 있는데다가, 있는데도, 있는만으로도, 있는지, 있는지가, 있는지도, 있는지를, 있는지에, 있는지에만, 있니, 있다, 있다”, 있다고, 있다고한다, 있다는, 있다라는, 있다며, 있다면, 있단말이에요, 있답니다, 있던, 있도, 있도록, 있듯이, 있따, 있면, 있면서, 있모록, 있스니까, 있습니다, 있어, 있어도, 있어서, 있어서다고, 있어서요, 있어야, 있어야지, 있어요, 있어지만, 있엇다고, 있었고, 있었기, 있었는데, 있었다, 있었다고, 있었다는, 있었더라면, 있었던, 있었던것을, 있었습니다, 있었어, 있었어도, 있었어요, 있었으면, 있었으면은, 있었은, 있었을, 있었을까, 있었지만, 있으나, 있으니, 있으니까, 있으니까요, 있으며, 있으면, 있으면서, 있으므로, 있은, 있은다는, 있을, 있을가보다, 있을거라, 있을거야, 있을것입니다, 있을까, 있을까요, 있을뿐, 있을지, 있을지도, 있음에, 있음을, 있읍니다, 있잖아요, 있죠, 있지, 있지마, 있지만, 있지않다, 있지요, 있토록.
The 2nd highest number of forms (93) was observed with the lemma “않”: 않, 않게, 않겠는가, 않겠다, 않겠다고, 않겠습니다, 않겠지만, 않고, 않고도, 않기, 않기때문이다, 않기를, 않는, 않는가, 않는게, 않는날을, 않는다, 않는다고, 않는다면, 않는데, 않는지가더무섭다, 않다, 않다고, 않다고하고, 않다느니, 않다는, 않다면, 않더라도, 않도록, 않도록이요, 않마십니다, 않막아요, 않면, 않습니까, 않습니다, 않아, 않아고, 않아도, 않아면, 않아서, 않아야, 않아요, 않아져요, 않아진다, 않았나, 않았냐면, 않았는, 않았는데도, 않았다, 않았다고, 않았다는, 않았다면, 않았더라면, 않았던, 않았습니다, 않았어, 않았어요, 않았으니까, 않았으면, 않았을것입니다, 않았을까, 않았을까는, 않았을까라는, 않았지만, 않애요, 않였는데, 않으니, 않으니까, 않으니까요, 않으려고, 않으며, 않으면, 않은, 않은다, 않은다고, 않은데, 않은데요, 않은면, 않은옷을, 않은지, 않은지의, 않을, 않을가, 않을겠다, 않을까, 않을까라고, 않을때가, 않을면, 않있습니다, 않지, 않지만, 않지요, 않했습니다.
The 3rd highest number of forms (87) was observed with the lemma “하”: 하게, 하겠다, 하겠말이다, 하겠습니다, 하고, 하고요, 하기, 하기때문에, 하나, 하냐고, 하는, 하는다고, 하는데, 하는데다가, 하는이유도, 하는지, 하는지에, 하는지에도, 하니까, 하니까요, 하다, 하다고, 하다는, 하더라도, 하도록, 하든지, 하라고, 하며, 하면, 하면서, 하셨는데, 하셨다, 하셨죠, 하시네요, 하신다, 하였다, 하죠, 하지, 하지마, 하지만, 한, 한것은, 한는데, 한는지, 한다, 한다고, 한다는, 한다면, 한데, 할, 할것입니다, 할까, 할때, 할수록, 할지, 할지가, 함으로써, 합니까, 합니까라고, 합니다, 합니다만, 합다, 해, 해는데, 해도, 해라는, 해서, 해서도, 해야, 해요, 해주는, 해주셔서, 해준, 해준다고, 했고, 했는, 했는데, 했다, 했다고, 했다는, 했던지를, 했습니다, 했어, 했어요, 했을, 했지만, 했지요.
AUX occurs with 1 features: Typo (41; 1% instances)
AUX occurs with 1 feature-value pairs: Typo=Yes
AUX occurs with 2 feature combinations.
The most frequent feature combination is _ (5465 tokens).
Examples: 있다, 있는, 한다, 싶습니다, 있습니다, 있고, 있어요, 한다고, 있다고, 싶은
Relations
AUX nodes are attached to their parents using 15 different relations: aux (3104; 56% instances), root (1168; 21% instances), acl (464; 8% instances), advcl (353; 6% instances), conj (165; 3% instances), ccomp (142; 3% instances), nmod (59; 1% instances), obl (25; 0% instances), flat (8; 0% instances), obj (8; 0% instances), nsubj (5; 0% instances), dislocated (2; 0% instances), csubj (1; 0% instances), list (1; 0% instances), parataxis (1; 0% instances)
Parents of AUX nodes belong to 7 different parts of speech: VERB (3158; 57% instances), (1168; 21% instances), ADJ (652; 12% instances), NOUN (329; 6% instances), AUX (163; 3% instances), ADV (34; 1% instances), PRON (2; 0% instances)
2985 (54%) AUX nodes are leaves.
878 (16%) AUX nodes have one child.
612 (11%) AUX nodes have two children.
1031 (19%) AUX nodes have three or more children.
The highest child degree of a AUX node is 6.
Children of AUX nodes are attached using 19 different relations: nsubj (2238; 39% instances), punct (1303; 23% instances), advcl (1161; 20% instances), cc (276; 5% instances), obl (208; 4% instances), conj (127; 2% instances), ccomp (109; 2% instances), advmod (106; 2% instances), dislocated (53; 1% instances), case (21; 0% instances), aux (15; 0% instances), mark (14; 0% instances), obj (14; 0% instances), goeswith (13; 0% instances), acl (11; 0% instances), nmod (4; 0% instances), csubj (3; 0% instances), flat (1; 0% instances), parataxis (1; 0% instances)
Children of AUX nodes belong to 13 different parts of speech: NOUN (2342; 41% instances), PUNCT (1303; 23% instances), VERB (1094; 19% instances), CCONJ (276; 5% instances), ADJ (215; 4% instances), ADV (205; 4% instances), AUX (163; 3% instances), PRON (23; 0% instances), ADP (21; 0% instances), SCONJ (14; 0% instances), X (13; 0% instances), NUM (8; 0% instances), DET (1; 0% instances)