home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Korean-Kaist: POS Tags: PROPN

There are 5871 PROPN lemmas (6%), 5868 PROPN types (6%) and 12367 PROPN tokens (4%). Out of 17 observed tags, the rank of PROPN is: 6 in number of lemmas, 6 in number of types and 8 in number of tokens.

The 10 most frequent PROPN lemmas: 일본, 일본+의, 한국+의, 한국, 중국, 중국+의, 흉노, 미국+의, 미국, 영국+의

The 10 most frequent PROPN types: 일본, 일본의, 한국의, 한국, 중국, 중국의, 흉노, 미국의, 미국, 영국의

The 10 most frequent ambiguous lemmas: 한국 (PROPN 113, NOUN 1), 미국+의 (PROPN 77, NOUN 1), 남훈+은 (PROPN 59, NOUN 1), 하나님+의 (PROPN 41, NOUN 1), 만해+의 (PROPN 37, NOUN 1), 조선 (PROPN 36, NOUN 2), 한 (NUM 578, ADJ 69, NOUN 46, PROPN 32, DET 4), 노 (PROPN 30, NOUN 3), 유럽+의 (PROPN 30, NOUN 1), 장 (PROPN 23, NOUN 8)

The 10 most frequent ambiguous types: 한국 (PROPN 113, NOUN 1), 흉노 (PROPN 78, ADV 1), 미국의 (PROPN 77, NOUN 1), 남훈은 (PROPN 59, NOUN 1), 하나님의 (PROPN 41, NOUN 1), 만해의 (PROPN 37, NOUN 1), 조선 (PROPN 36, NOUN 2), 한 (NUM 577, VERB 173, ADJ 69, NOUN 46, AUX 41, PROPN 32, DET 4, PART 2), 노 (PROPN 30, NOUN 3), 유럽의 (PROPN 30, NOUN 1)

Morphology

The form / lemma ratio of PROPN is 0.999489 (the average of all parts of speech is 0.998034).

The 1st highest number of forms (2) was observed with the lemma “가이아”: 가이아, 가이야.

The 2nd highest number of forms (2) was observed with the lemma “갈릴레이+는”: 갈릴레이느, 갈릴레이는.

The 3rd highest number of forms (2) was observed with the lemma “네덜란드”: 네덜란드, 네들란드.

PROPN does not occur with any features.

Relations

PROPN nodes are attached to their parents using 19 different relations: compound (3356; 27% instances), nmod (2971; 24% instances), dislocated (2061; 17% instances), nsubj (1012; 8% instances), conj (962; 8% instances), obj (707; 6% instances), flat (496; 4% instances), appos (442; 4% instances), obl (104; 1% instances), advcl (103; 1% instances), dep (68; 1% instances), root (40; 0% instances), ccomp (16; 0% instances), csubj (9; 0% instances), acl (6; 0% instances), amod (6; 0% instances), iobj (5; 0% instances), vocative (2; 0% instances), xcomp (1; 0% instances)

Parents of PROPN nodes belong to 13 different parts of speech: NOUN (4598; 37% instances), VERB (3281; 27% instances), ADV (1473; 12% instances), PROPN (1332; 11% instances), CCONJ (980; 8% instances), SCONJ (442; 4% instances), ADJ (172; 1% instances), (40; 0% instances), NUM (34; 0% instances), X (7; 0% instances), PRON (4; 0% instances), INTJ (2; 0% instances), PART (2; 0% instances)

9265 (75%) PROPN nodes are leaves.

1653 (13%) PROPN nodes have one child.

1050 (8%) PROPN nodes have two children.

399 (3%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 13.

Children of PROPN nodes are attached using 23 different relations: punct (1722; 33% instances), conj (625; 12% instances), flat (606; 12% instances), acl (595; 11% instances), compound (480; 9% instances), appos (346; 7% instances), case (276; 5% instances), nmod (221; 4% instances), amod (103; 2% instances), cop (58; 1% instances), cc (45; 1% instances), dislocated (41; 1% instances), det (17; 0% instances), obl (17; 0% instances), advmod (16; 0% instances), dep (14; 0% instances), ccomp (13; 0% instances), advcl (9; 0% instances), nummod (8; 0% instances), nsubj (6; 0% instances), obj (5; 0% instances), parataxis (1; 0% instances), xcomp (1; 0% instances)

Children of PROPN nodes belong to 16 different parts of speech: PUNCT (1722; 33% instances), PROPN (1332; 25% instances), NOUN (683; 13% instances), VERB (643; 12% instances), ADP (249; 5% instances), CCONJ (164; 3% instances), ADV (149; 3% instances), X (79; 2% instances), ADJ (71; 1% instances), AUX (58; 1% instances), NUM (23; 0% instances), DET (17; 0% instances), PRON (15; 0% instances), SCONJ (12; 0% instances), PART (5; 0% instances), SYM (3; 0% instances)