home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Korean-KSL: POS Tags: ADJ

There are 3475 ADJ lemmas (12%), 3487 ADJ types (12%) and 12548 ADJ tokens (9%). Out of 14 observed tags, the rank of ADJ is: 4 in number of lemmas, 4 in number of types and 5 in number of tokens.

The 10 most frequent ADJ lemmas: 것+이+ㅂ니다, 다르+ㄴ, 좋+은, 것+이+다, 많+은, 크+ㄴ, 아니+라, 많+다, 때문+이+다, 좋+다

The 10 most frequent ADJ types: 겁니다, 다른, 좋은, 것이다, 많은, 큰, 아니라, 많다, 때문이다, 좋다

The 10 most frequent ambiguous lemmas: 다르+ㄴ (ADJ 401, VERB 1), 좋+은 (ADJ 345, ADV 1), 많+은 (ADJ 228, ADV 6), 때문+이+다 (ADJ 149, ADP 1), 없+다 (VERB 83, ADJ 77), 없+는 (ADJ 74, VERB 43), 것+이+에요 (ADJ 68, ADV 1), 없+어서 (ADJ 67, VERB 2), 크+고 (ADJ 51, VERB 2), 없+고 (ADJ 50, VERB 4)

The 10 most frequent ambiguous types: 다른 (ADJ 401, DET 91, VERB 1), 좋은 (ADJ 345, ADV 1), 많은 (ADJ 228, ADV 6), 때문이다 (ADJ 149, ADP 1), 같은 (ADJ 92, X 2), 없다 (VERB 83, ADJ 77), 없는 (ADJ 74, VERB 44), 없어서 (ADJ 67, VERB 2), 크고 (ADJ 51, VERB 2), 없고 (ADJ 50, VERB 4)

Morphology

The form / lemma ratio of ADJ is 1.003453 (the average of all parts of speech is 1.007876).

The 1st highest number of forms (3) was observed with the lemma “것+이+ㅂ니다”: 겁니다, 것, 것입니다.

The 2nd highest number of forms (3) was observed with the lemma “것+이+에요”: 거에요, 거예요, 것이에요.

The 3rd highest number of forms (2) was observed with the lemma “40+명+이+었+다”: 40명이었다, 40명이였다.

ADJ occurs with 1 features: Typo (379; 3% instances)

ADJ occurs with 1 feature-value pairs: Typo=Yes

ADJ occurs with 2 feature combinations. The most frequent feature combination is _ (12169 tokens). Examples: 겁니다, 다른, 좋은, 것이다, 많은, 큰, 아니라, 많다, 때문이다, 좋다

Relations

ADJ nodes are attached to their parents using 17 different relations: root (5209; 42% instances), amod (2941; 23% instances), advcl (1721; 14% instances), acl (1017; 8% instances), ccomp (660; 5% instances), conj (626; 5% instances), obl (110; 1% instances), flat (83; 1% instances), nmod (72; 1% instances), obj (45; 0% instances), nsubj (27; 0% instances), parataxis (17; 0% instances), list (12; 0% instances), csubj (4; 0% instances), dislocated (2; 0% instances), appos (1; 0% instances), dep (1; 0% instances)

Parents of ADJ nodes belong to 9 different parts of speech: (5209; 42% instances), NOUN (2939; 23% instances), VERB (1981; 16% instances), ADJ (1616; 13% instances), ADV (613; 5% instances), AUX (177; 1% instances), PRON (7; 0% instances), ADP (3; 0% instances), NUM (3; 0% instances)

3004 (24%) ADJ nodes are leaves.

2396 (19%) ADJ nodes have one child.

2744 (22%) ADJ nodes have two children.

4404 (35%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 8.

Children of ADJ nodes are attached using 29 different relations: nsubj (5813; 25% instances), punct (5284; 22% instances), advmod (2534; 11% instances), advcl (2385; 10% instances), obl (2033; 9% instances), acl (1566; 7% instances), cc (888; 4% instances), conj (634; 3% instances), dislocated (541; 2% instances), nmod (388; 2% instances), amod (317; 1% instances), aux (254; 1% instances), obj (230; 1% instances), nmod:poss (157; 1% instances), mark (137; 1% instances), case (134; 1% instances), list (55; 0% instances), goeswith (35; 0% instances), ccomp (33; 0% instances), det (20; 0% instances), vocative (15; 0% instances), csubj (14; 0% instances), flat (13; 0% instances), appos (6; 0% instances), nummod (6; 0% instances), parataxis (6; 0% instances), discourse (2; 0% instances), compound (1; 0% instances), dep (1; 0% instances)

Children of ADJ nodes belong to 13 different parts of speech: NOUN (7060; 30% instances), PUNCT (5284; 22% instances), ADV (3906; 17% instances), VERB (3369; 14% instances), ADJ (1616; 7% instances), CCONJ (888; 4% instances), AUX (530; 2% instances), PRON (445; 2% instances), ADP (146; 1% instances), SCONJ (137; 1% instances), NUM (51; 0% instances), X (36; 0% instances), DET (34; 0% instances)