home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Korean-GSD: POS Tags: NOUN

There are 19268 NOUN lemmas (53%), 19213 NOUN types (52%) and 32347 NOUN tokens (40%). Out of 16 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: 수, 등, 것+이, 것, 때, 길, 맛+이, 것+을, 맛+도, 등+을

The 10 most frequent NOUN types: 수, 등, 것, 때, 게, 길, 것이, 맛이, 것을, 맛도

The 10 most frequent ambiguous lemmas: 수 (NOUN 370, ADJ 1), 때 (NOUN 79, ADP 23, ADV 1), 이번 (NOUN 53, DET 1), 전 (NOUN 50, ADP 12, PRON 11, DET 3), 서울 (NOUN 45, PROPN 3), 원 (NOUN 40, ADP 1), 대전 (NOUN 36, ADV 1), 뒤 (NOUN 34, ADP 10, ADV 1), 미국 (NOUN 33, PROPN 6), 근처 (NOUN 29, ADV 2)

The 10 most frequent ambiguous types: 수 (NOUN 370, ADJ 1), 때 (NOUN 79, ADP 23, ADV 1), 이번 (NOUN 53, DET 1), 전 (NOUN 50, ADP 12, PRON 11, DET 3), 서울 (NOUN 45, PROPN 3), 원 (NOUN 40, ADP 1), 대전 (NOUN 36, ADV 1), 뒤 (NOUN 34, ADP 10, ADV 1), 미국 (NOUN 33, PROPN 6), 근처 (NOUN 29, ADV 2)

Morphology

The form / lemma ratio of NOUN is 0.997146 (the average of all parts of speech is 1.001499).

The 1st highest number of forms (2) was observed with the lemma “http:-FWS-”: http://www.khmais.net, http://www.modetour.com.

The 2nd highest number of forms (2) was observed with the lemma “것+이”: 것이, 게.

The 3rd highest number of forms (1) was observed with the lemma “0+시+를”: 0시를.

NOUN does not occur with any features.

Relations

NOUN nodes are attached to their parents using 22 different relations: flat (8260; 26% instances), nsubj (7222; 22% instances), obj (5473; 17% instances), obl (2537; 8% instances), nmod:poss (2275; 7% instances), conj (1887; 6% instances), nmod (1639; 5% instances), appos (790; 2% instances), root (646; 2% instances), dep (574; 2% instances), nsubj:pass (499; 2% instances), acl:relcl (174; 1% instances), advcl (114; 0% instances), iobj (102; 0% instances), ccomp (80; 0% instances), nummod (31; 0% instances), case (18; 0% instances), csubj (15; 0% instances), amod (8; 0% instances), compound (1; 0% instances), parataxis (1; 0% instances), xcomp (1; 0% instances)

Parents of NOUN nodes belong to 12 different parts of speech: VERB (15312; 47% instances), NOUN (13789; 43% instances), ADJ (1122; 3% instances), ADV (858; 3% instances), (646; 2% instances), PROPN (373; 1% instances), NUM (186; 1% instances), PRON (36; 0% instances), DET (13; 0% instances), AUX (6; 0% instances), SYM (4; 0% instances), INTJ (2; 0% instances)

16153 (50%) NOUN nodes are leaves.

9496 (29%) NOUN nodes have one child.

4039 (12%) NOUN nodes have two children.

2659 (8%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 17.

Children of NOUN nodes are attached using 29 different relations: flat (10209; 37% instances), punct (3093; 11% instances), acl:relcl (2413; 9% instances), conj (2042; 7% instances), nmod:poss (1728; 6% instances), dep (1614; 6% instances), case (1187; 4% instances), amod (1118; 4% instances), appos (1059; 4% instances), nsubj (580; 2% instances), advmod (417; 1% instances), obj (404; 1% instances), det (400; 1% instances), nummod (350; 1% instances), obl (308; 1% instances), nmod (306; 1% instances), advcl (240; 1% instances), cc (168; 1% instances), det:poss (135; 0% instances), cop (49; 0% instances), ccomp (42; 0% instances), nsubj:pass (17; 0% instances), fixed (12; 0% instances), iobj (9; 0% instances), acl (6; 0% instances), csubj (3; 0% instances), mark (2; 0% instances), aux (1; 0% instances), parataxis (1; 0% instances)

Children of NOUN nodes belong to 15 different parts of speech: NOUN (13789; 49% instances), VERB (4487; 16% instances), PUNCT (3093; 11% instances), ADV (2505; 9% instances), ADP (1277; 5% instances), ADJ (1091; 4% instances), NUM (466; 2% instances), DET (401; 1% instances), PROPN (216; 1% instances), PRON (187; 1% instances), CCONJ (168; 1% instances), SYM (154; 1% instances), AUX (50; 0% instances), PART (24; 0% instances), X (5; 0% instances)