Statistics of NOUN in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Korean-KSL: POS Tags: `NOUN`

There are 11228 NOUN lemmas (39%), 11141 NOUN types (39%) and 40304 NOUN tokens (29%). Out of 14 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: 수, 것+이, 한국어+를, 사람+들+이, 씨+는, 것, 사람+이, 것+을, 한국, 것+은

The 10 most frequent NOUN types: 수, 것이, 한국어를, 사람들이, 씨는, 것, 사람이, 것을, 한국, 것은

The 10 most frequent ambiguous lemmas: 수 (NOUN 1865, DET 2, ADV 1, NUM 1), 것+이 (NOUN 476, ADJ 1), 친구 (NOUN 103, ADV 1), 다음 (NOUN 93, ADV 1, SCONJ 1), 뿐+만 (NOUN 72, ADP 38), 아이+들+이 (NOUN 66, PRON 1), 이번 (NOUN 62, DET 1), 학교 (NOUN 53, ADV 1), 자기 (NOUN 50, PRON 49), 자신+의 (NOUN 49, PRON 1)

The 10 most frequent ambiguous types: 수 (NOUN 1865, DET 2, ADV 1, NUM 1, X 1), 것이 (NOUN 440, ADJ 1), 것 (NOUN 286, ADJ 1), 자연 (NOUN 166, ADV 1), 한국어 (NOUN 130, VERB 1), 생일 (NOUN 108, VERB 2), 친구 (NOUN 103, ADV 1), 다음 (NOUN 93, ADV 1, SCONJ 1), 사람 (NOUN 82, ADV 1), 뿐만 (NOUN 72, ADP 38)

수
- NOUN 1865: 한국어도 배울 수 있습니다 .
- DET 2: 수 만 명 사람들 중에서 1위를 차지다면 성공인라는 관점을 사회에게 분편적인 인정하는 것이다 .
- ADV 1: 아파트는 편리한 시설을 제공할 수 뿐만 아니라 안전도 좋다 .
- NUM 1: 평생에 한 두 번이나 가보고 싶은 섬이며 , 해마다 수 십만의 관광갱이 물려드는 아름다운 섬은 홍도이다 .
- X 1: 우리 생활이 편리해질 수 록 환경오염은 심각해진다 .
것이
- NOUN 440: 나는 어렸을 때 드라마를 보는 것이 좋았다 .
- ADJ 1: 그러나 자연을 이렇게 과하게 개발하면 몰론 안되는 것이 라고 생각한다 .
것
- NOUN 286: 그런데 내가 금연에 찬성하는 이유는 단순히 담배 냄새를 싫어하는 것 때문이 아니라 건강을 위해서이다 .
- ADJ 1: 조기 언어교윤은 외국어 배우기 위에 너무 중요한 것 입니다 .
자연
- NOUN 166: 저는 자연 개발보다 자연 보존이 더 중요하다고 생각합니다 .
- ADV 1: 즉 , 한국어를 한국사람처럼 자연 스럽게 하고 싶으며 한국어를 오래시간 기억하고 싶으면 한국어 오고 나서 한국어를 공부한다 .
한국어
- NOUN 130: 정말 한국어 좋아하는 사람이 꼭 오세요 .
- VERB 1: 요코라 , 걱정하지 말고 자신을 가지고 한국어 오세요 .
생일
- NOUN 108: 동대문 근처에 식당에서 제 생일 파티를 합니다 .
- VERB 2: 사람들이 이 세계의 역사를 정화긴 모르면 , 그 역사가 다시 생일 수도 있읍니다 .
친구
- NOUN 103: 우리 반 친구 유카 씨 소개합니다 .
- ADV 1: 저는 친구 같이 사직을 많이 찍었습니다 .
다음
- NOUN 93: 다음 주에 고향 친구가 한국에 올 겁니다 .
- ADV 1: 다음 나는 같이 갈다 .
- SCONJ 1: 여유있는 척해서 출연자하고 촬영내용을 토론한 다음 드디어 촬영을 시작했다 .
사람
- NOUN 82: 두 사람 같이 앉아서 재미있는 한국영화를 봤습니다 .
- ADV 1: 우리 반에서 다른 나라 사람 보다 일본 사람이 많습니다 .
뿐만
- NOUN 72: 터미네이터는 재미있을 뿐만 아니라 감동적은 장면 도 있다 .
- ADP 38: 이것 뿐만 아니라 다른 공룡들이 이 하이브리드 공룡에게 잡히고 먹힌다 .

Morphology

The form / lemma ratio of NOUN is 0.992252 (the average of all parts of speech is 1.007876).

The 1st highest number of forms (2) was observed with the lemma “3+월+부터”: 3월, 3월부터.

The 2nd highest number of forms (2) was observed with the lemma “것+을”: 걸, 것을.

The 3rd highest number of forms (2) was observed with the lemma “것+이”: 것이, 게.

NOUN occurs with 1 features: Typo (2129; 5% instances)

NOUN occurs with 1 feature-value pairs: Typo=Yes

NOUN occurs with 2 feature combinations. The most frequent feature combination is _ (38175 tokens). Examples: 수, 것이, 한국어를, 사람들이, 씨는, 것, 사람이, 것을, 한국, 것은

Relations

NOUN nodes are attached to their parents using 22 different relations: nsubj (15216; 38% instances), obj (11009; 27% instances), nmod (4279; 11% instances), obl (3152; 8% instances), nmod:poss (1816; 5% instances), conj (1405; 3% instances), dislocated (1125; 3% instances), flat (948; 2% instances), list (512; 1% instances), root (206; 1% instances), advcl (201; 0% instances), acl (94; 0% instances), compound (91; 0% instances), appos (87; 0% instances), amod (48; 0% instances), compound:lvc (41; 0% instances), vocative (31; 0% instances), ccomp (27; 0% instances), csubj (10; 0% instances), parataxis (4; 0% instances), dep (1; 0% instances), reparandum (1; 0% instances)

Parents of NOUN nodes belong to 12 different parts of speech: VERB (21624; 54% instances), NOUN (7944; 20% instances), ADJ (7060; 18% instances), AUX (1857; 5% instances), ADV (1510; 4% instances), (206; 1% instances), PRON (44; 0% instances), NUM (26; 0% instances), DET (17; 0% instances), ADP (14; 0% instances), INTJ (1; 0% instances), X (1; 0% instances)

21479 (53%) NOUN nodes are leaves.

16132 (40%) NOUN nodes have one child.

2026 (5%) NOUN nodes have two children.

667 (2%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 9.

Children of NOUN nodes are attached using 30 different relations: acl (6190; 27% instances), nmod (3518; 16% instances), amod (2328; 10% instances), nmod:poss (1842; 8% instances), conj (1573; 7% instances), det (1429; 6% instances), case (1242; 6% instances), flat (1155; 5% instances), punct (1004; 4% instances), nsubj (426; 2% instances), list (412; 2% instances), nummod (334; 1% instances), obj (271; 1% instances), advmod (209; 1% instances), advcl (147; 1% instances), obl (99; 0% instances), appos (91; 0% instances), compound (71; 0% instances), goeswith (70; 0% instances), cc (39; 0% instances), ccomp (18; 0% instances), dislocated (18; 0% instances), mark (16; 0% instances), aux (13; 0% instances), vocative (5; 0% instances), csubj (2; 0% instances), parataxis (2; 0% instances), compound:lvc (1; 0% instances), dep (1; 0% instances), reparandum (1; 0% instances)

Children of NOUN nodes belong to 13 different parts of speech: NOUN (7944; 35% instances), VERB (5548; 25% instances), ADJ (2939; 13% instances), DET (1543; 7% instances), ADP (1247; 6% instances), PUNCT (1004; 4% instances), PRON (797; 4% instances), ADV (763; 3% instances), NUM (363; 2% instances), AUX (254; 1% instances), X (70; 0% instances), CCONJ (39; 0% instances), SCONJ (16; 0% instances)

Treebank Statistics: UD_Korean-KSL: POS Tags: NOUN

Morphology

Relations

Treebank Statistics: UD_Korean-KSL: POS Tags: `NOUN`