home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Korean-GSD: POS Tags: NOUN

There are 19267 NOUN lemmas (52%), 19212 NOUN types (52%) and 32347 NOUN tokens (40%). Out of 16 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: 수, 등, 것+이, 것, 때, 길, 맛+이, 것+을, 맛+도, 등+을

The 10 most frequent NOUN types: 수, 등, 것, 때, 게, 길, 것이, 맛이, 것을, 맛도

The 10 most frequent ambiguous lemmas: 수 (NOUN 370, ADJ 1), 때 (NOUN 79, ADP 23, ADV 1), 이번 (NOUN 53, DET 1), 전 (NOUN 50, ADP 12, PRON 11, DET 3), 서울 (NOUN 45, PROPN 3), 원 (NOUN 40, ADP 1), 대전 (NOUN 36, ADV 1), 뒤 (NOUN 34, ADP 10, ADV 1), 미국 (NOUN 33, PROPN 6), 근처 (NOUN 29, ADV 2)

The 10 most frequent ambiguous types: 수 (NOUN 370, ADJ 1), 때 (NOUN 79, ADP 23, ADV 1), 이번 (NOUN 53, DET 1), 전 (NOUN 50, ADP 12, PRON 11, DET 3), 서울 (NOUN 45, PROPN 3), 원 (NOUN 40, ADP 1), 대전 (NOUN 36, ADV 1), 뒤 (NOUN 34, ADP 10, ADV 1), 미국 (NOUN 33, PROPN 6), 근처 (NOUN 29, ADV 2)

Morphology

The form / lemma ratio of NOUN is 0.997145 (the average of all parts of speech is 1.000681).

The 1st highest number of forms (2) was observed with the lemma “http:-FWS-”: http://www.khmais.net, http://www.modetour.com.

The 2nd highest number of forms (2) was observed with the lemma “것+이”: 것이, 게.

The 3rd highest number of forms (1) was observed with the lemma “0+시+를”: 0시를.

NOUN does not occur with any features.

Relations

NOUN nodes are attached to their parents using 22 different relations: flat (8261; 26% instances), nsubj (7238; 22% instances), obj (5491; 17% instances), det:poss (2276; 7% instances), advmod (2221; 7% instances), conj (1877; 6% instances), nmod (1639; 5% instances), appos (937; 3% instances), root (590; 2% instances), dep (551; 2% instances), nsubj:pass (503; 2% instances), mark (287; 1% instances), acl:relcl (143; 0% instances), iobj (102; 0% instances), advcl (86; 0% instances), ccomp (66; 0% instances), nummod (30; 0% instances), case (18; 0% instances), csubj (15; 0% instances), amod (8; 0% instances), obl (7; 0% instances), xcomp (1; 0% instances)

Parents of NOUN nodes belong to 13 different parts of speech: VERB (15548; 48% instances), NOUN (13626; 42% instances), ADJ (1126; 3% instances), ADV (870; 3% instances), (590; 2% instances), PROPN (306; 1% instances), NUM (165; 1% instances), PUNCT (39; 0% instances), PRON (34; 0% instances), AUX (25; 0% instances), DET (13; 0% instances), SYM (3; 0% instances), INTJ (2; 0% instances)

15830 (49%) NOUN nodes are leaves.

9841 (30%) NOUN nodes have one child.

4097 (13%) NOUN nodes have two children.

2579 (8%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 18.

Children of NOUN nodes are attached using 27 different relations: flat (10185; 37% instances), punct (3006; 11% instances), acl:relcl (2410; 9% instances), conj (2034; 7% instances), det:poss (1865; 7% instances), dep (1613; 6% instances), case (1175; 4% instances), amod (1117; 4% instances), appos (913; 3% instances), nsubj (543; 2% instances), advmod (526; 2% instances), det (400; 1% instances), obj (371; 1% instances), nummod (350; 1% instances), nmod (285; 1% instances), advcl (217; 1% instances), cc (169; 1% instances), obl (114; 0% instances), cop (65; 0% instances), ccomp (39; 0% instances), nsubj:pass (14; 0% instances), fixed (12; 0% instances), iobj (8; 0% instances), mark (8; 0% instances), aux (5; 0% instances), csubj (3; 0% instances), acl (2; 0% instances)

Children of NOUN nodes belong to 15 different parts of speech: NOUN (13626; 50% instances), VERB (4370; 16% instances), PUNCT (3035; 11% instances), ADV (2430; 9% instances), ADP (1261; 5% instances), ADJ (1088; 4% instances), NUM (487; 2% instances), DET (401; 1% instances), PROPN (228; 1% instances), PRON (181; 1% instances), CCONJ (169; 1% instances), SYM (139; 1% instances), PART (24; 0% instances), AUX (5; 0% instances), X (5; 0% instances)