Treebank Statistics: UD_Korean-KSL: POS Tags: ADJ
There are 3778 ADJ lemmas (12%), 3790 ADJ types (12%) and 14289 ADJ tokens (9%).
Out of 16 observed tags, the rank of ADJ is: 4 in number of lemmas, 4 in number of types and 5 in number of tokens.
The 10 most frequent ADJ lemmas: 다르+ㄴ, 것+이+ㅂ니다, 좋+은, 것+이+다, 많+은, 크+ㄴ, 아니+라, 많+다, 때문+이+다, 좋+다
The 10 most frequent ADJ types: 다른, 겁니다, 좋은, 것이다, 많은, 큰, 아니라, 많다, 때문이다, 좋다
The 10 most frequent ambiguous lemmas: 다르+ㄴ (ADJ 513, VERB 1), 좋+은 (ADJ 450, ADV 1), 많+은 (ADJ 291, ADV 6), 때문+이+다 (ADJ 188, ADP 1), 없+다 (VERB 96, ADJ 85), 없+는 (ADJ 83, VERB 52), 없+어서 (ADJ 72, VERB 2), 것+이+에요 (ADJ 70, ADV 1), 없+고 (ADJ 57, VERB 4), 많+고 (ADJ 54, ADV 1)
The 10 most frequent ambiguous types: 다른 (ADJ 513, DET 91, VERB 1), 좋은 (ADJ 450, ADV 1), 많은 (ADJ 291, ADV 6, X 1), 때문이다 (ADJ 188, ADP 1), 같은 (ADJ 118, X 2), 없다 (VERB 96, ADJ 85), 없는 (ADJ 83, VERB 53), 없어서 (ADJ 72, VERB 2), 없고 (ADJ 57, VERB 4), 많고 (ADJ 54, ADV 1)
- 다른
- 좋은
- 많은
- 때문이다
- 같은
- 없다
- 없는
- 없어서
- 없고
- 많고
Morphology
The form / lemma ratio of ADJ is 1.003176 (the average of all parts of speech is 1.008073).
The 1st highest number of forms (3) was observed with the lemma “것+이+ㅂ니다”: 겁니다, 것, 것입니다.
The 2nd highest number of forms (3) was observed with the lemma “것+이+에요”: 거에요, 거예요, 것이에요.
The 3rd highest number of forms (2) was observed with the lemma “40+명+이+었+다”: 40명이었다, 40명이였다.
ADJ occurs with 1 features: Typo (400; 3% instances)
ADJ occurs with 1 feature-value pairs: Typo=Yes
ADJ occurs with 2 feature combinations.
The most frequent feature combination is _ (13889 tokens).
Examples: 다른, 겁니다, 좋은, 것이다, 많은, 큰, 아니라, 많다, 때문이다, 좋다
Relations
ADJ nodes are attached to their parents using 17 different relations: root (5731; 40% instances), amod (3545; 25% instances), advcl (1959; 14% instances), acl (1145; 8% instances), ccomp (808; 6% instances), conj (673; 5% instances), obl (132; 1% instances), nmod (86; 1% instances), flat (84; 1% instances), obj (52; 0% instances), nsubj (37; 0% instances), parataxis (17; 0% instances), list (12; 0% instances), csubj (4; 0% instances), dislocated (2; 0% instances), appos (1; 0% instances), dep (1; 0% instances)
Parents of ADJ nodes belong to 10 different parts of speech: (5731; 40% instances), NOUN (3497; 24% instances), VERB (2302; 16% instances), ADJ (1795; 13% instances), ADV (732; 5% instances), AUX (215; 2% instances), PRON (9; 0% instances), ADP (3; 0% instances), NUM (3; 0% instances), DET (2; 0% instances)
3558 (25%) ADJ nodes are leaves.
2768 (19%) ADJ nodes have one child.
3007 (21%) ADJ nodes have two children.
4956 (35%) ADJ nodes have three or more children.
The highest child degree of a ADJ node is 8.
Children of ADJ nodes are attached using 29 different relations: nsubj (6549; 25% instances), punct (5804; 22% instances), advmod (2857; 11% instances), advcl (2709; 10% instances), obl (2280; 9% instances), acl (1767; 7% instances), cc (972; 4% instances), conj (676; 3% instances), dislocated (607; 2% instances), nmod (459; 2% instances), amod (355; 1% instances), aux (289; 1% instances), obj (257; 1% instances), mark (188; 1% instances), nmod:poss (167; 1% instances), case (163; 1% instances), list (55; 0% instances), ccomp (46; 0% instances), goeswith (35; 0% instances), det (23; 0% instances), csubj (15; 0% instances), vocative (15; 0% instances), flat (13; 0% instances), nummod (7; 0% instances), appos (6; 0% instances), parataxis (6; 0% instances), discourse (2; 0% instances), compound (1; 0% instances), dep (1; 0% instances)
Children of ADJ nodes belong to 14 different parts of speech: NOUN (7913; 30% instances), PUNCT (5804; 22% instances), ADV (4369; 17% instances), VERB (3826; 15% instances), ADJ (1795; 7% instances), CCONJ (972; 4% instances), AUX (652; 2% instances), PRON (489; 2% instances), SCONJ (188; 1% instances), ADP (176; 1% instances), NUM (66; 0% instances), DET (37; 0% instances), X (36; 0% instances), PART (1; 0% instances)