Treebank Statistics: UD_Korean-Kaist: POS Tags: PROPN
There are 5871 PROPN
lemmas (6%), 5868 PROPN
types (6%) and 12367 PROPN
tokens (4%).
Out of 17 observed tags, the rank of PROPN
is: 6 in number of lemmas, 6 in number of types and 8 in number of tokens.
The 10 most frequent PROPN
lemmas: 일본, 일본+의, 한국+의, 한국, 중국, 중국+의, 흉노, 미국+의, 미국, 영국+의
The 10 most frequent PROPN
types: 일본, 일본의, 한국의, 한국, 중국, 중국의, 흉노, 미국의, 미국, 영국의
The 10 most frequent ambiguous lemmas: 한국 (PROPN 113, NOUN 1), 미국+의 (PROPN 77, NOUN 1), 남훈+은 (PROPN 59, NOUN 1), 하나님+의 (PROPN 41, NOUN 1), 만해+의 (PROPN 37, NOUN 1), 조선 (PROPN 36, NOUN 2), 한 (NUM 578, ADJ 69, NOUN 46, PROPN 32, DET 4), 노 (PROPN 30, NOUN 3), 유럽+의 (PROPN 30, NOUN 1), 장 (PROPN 23, NOUN 8)
The 10 most frequent ambiguous types: 한국 (PROPN 113, NOUN 1), 흉노 (PROPN 78, ADV 1), 미국의 (PROPN 77, NOUN 1), 남훈은 (PROPN 59, NOUN 1), 하나님의 (PROPN 41, NOUN 1), 만해의 (PROPN 37, NOUN 1), 조선 (PROPN 36, NOUN 2), 한 (NUM 577, VERB 173, ADJ 69, NOUN 46, AUX 41, PROPN 32, DET 4, PART 2), 노 (PROPN 30, NOUN 3), 유럽의 (PROPN 30, NOUN 1)
- 한국
- 흉노
- 미국의
- 남훈은
- 하나님의
- 만해의
- 조선
- 한
- NUM 577: 김규식 선생도 그런 고아의 한 예이다 .
- VERB 173: 한 것에 놀라지 않을 수 없었다 .
- ADJ 69: 헤겔의 Sittlichkeit도 그 한 예거니와 , 민주주의나 공산주의 혁명이론도 또한 마찬가지다 .
- NOUN 46: 이처럼 재벌들이 쓰러질 것을 각오하고 체질을 개선하려 하지 않는 한 개선은 불가능하다 .
- AUX 41: 고려가요를 산출하게 한 고려사회는 중세적 질서가 한층 강화된 사회이다 .
- PROPN 32: 한 , 일 생활문화 비교 사회자 감사합니다 .
- DET 4: 지난 20년 동안 한 250차례 대화가 왔다갔다했습니다 .
- PART 2: 그것에 의하면 제논이 운동의 개념에 의해 증시 ( 證示 ) 한 모순은 바로 다음과 같은 사실로서 인정되지 않으면 안 된다 .
- 노
- 유럽의
Morphology
The form / lemma ratio of PROPN
is 0.999489 (the average of all parts of speech is 0.998034).
The 1st highest number of forms (2) was observed with the lemma “가이아”: 가이아, 가이야.
The 2nd highest number of forms (2) was observed with the lemma “갈릴레이+는”: 갈릴레이느, 갈릴레이는.
The 3rd highest number of forms (2) was observed with the lemma “네덜란드”: 네덜란드, 네들란드.
PROPN
does not occur with any features.
Relations
PROPN
nodes are attached to their parents using 19 different relations: compound (3356; 27% instances), nmod (2971; 24% instances), dislocated (2061; 17% instances), nsubj (1012; 8% instances), conj (962; 8% instances), obj (707; 6% instances), flat (496; 4% instances), appos (442; 4% instances), obl (104; 1% instances), advcl (103; 1% instances), dep (68; 1% instances), root (40; 0% instances), ccomp (16; 0% instances), csubj (9; 0% instances), acl (6; 0% instances), amod (6; 0% instances), iobj (5; 0% instances), vocative (2; 0% instances), xcomp (1; 0% instances)
Parents of PROPN
nodes belong to 13 different parts of speech: NOUN (4598; 37% instances), VERB (3281; 27% instances), ADV (1473; 12% instances), PROPN (1332; 11% instances), CCONJ (980; 8% instances), SCONJ (442; 4% instances), ADJ (172; 1% instances), (40; 0% instances), NUM (34; 0% instances), X (7; 0% instances), PRON (4; 0% instances), INTJ (2; 0% instances), PART (2; 0% instances)
9265 (75%) PROPN
nodes are leaves.
1653 (13%) PROPN
nodes have one child.
1050 (8%) PROPN
nodes have two children.
399 (3%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 13.
Children of PROPN
nodes are attached using 23 different relations: punct (1722; 33% instances), conj (625; 12% instances), flat (606; 12% instances), acl (595; 11% instances), compound (480; 9% instances), appos (346; 7% instances), case (276; 5% instances), nmod (221; 4% instances), amod (103; 2% instances), cop (58; 1% instances), cc (45; 1% instances), dislocated (41; 1% instances), det (17; 0% instances), obl (17; 0% instances), advmod (16; 0% instances), dep (14; 0% instances), ccomp (13; 0% instances), advcl (9; 0% instances), nummod (8; 0% instances), nsubj (6; 0% instances), obj (5; 0% instances), parataxis (1; 0% instances), xcomp (1; 0% instances)
Children of PROPN
nodes belong to 16 different parts of speech: PUNCT (1722; 33% instances), PROPN (1332; 25% instances), NOUN (683; 13% instances), VERB (643; 12% instances), ADP (249; 5% instances), CCONJ (164; 3% instances), ADV (149; 3% instances), X (79; 2% instances), ADJ (71; 1% instances), AUX (58; 1% instances), NUM (23; 0% instances), DET (17; 0% instances), PRON (15; 0% instances), SCONJ (12; 0% instances), PART (5; 0% instances), SYM (3; 0% instances)