Statistics of PUNCT in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Korean-GSD: POS Tags: `PUNCT`

There are 102 PUNCT lemmas (0%), 105 PUNCT types (0%) and 10411 PUNCT tokens (13%). Out of 16 observed tags, the rank of PUNCT is: 10 in number of lemmas, 10 in number of types and 4 in number of tokens.

The 10 most frequent PUNCT lemmas: ., ,, ‘, (, ), “, %, ?, !, •

The 10 most frequent PUNCT types: ., ,, ‘, (, ), “, %, ?, !, •

The 10 most frequent ambiguous lemmas: % (PUNCT 137, SYM 45), ? (PUNCT 134, SYM 1), ~ (PUNCT 69, SYM 1), 이+다 (PUNCT 19, NOUN 1, VERB 1), ㎡ (PUNCT 13, SYM 4), ㎞ (SYM 7, PUNCT 5), ^ (PUNCT 3, SYM 1), ℓ (PUNCT 3, SYM 1), ㎢ (PUNCT 3, SYM 3), ㎝ (PUNCT 2, SYM 1)

The 10 most frequent ambiguous types: % (PUNCT 137, SYM 45), ? (PUNCT 134, SYM 1), ~ (PUNCT 69, SYM 1), 이다 (PUNCT 14, AUX 1, NOUN 1, VERB 1), ㎡ (PUNCT 13, SYM 4), ㎞ (SYM 7, PUNCT 5), 다 (ADV 46, PUNCT 5, NOUN 3), ^ (PUNCT 3, SYM 1), ℓ (PUNCT 3, SYM 1), ㎢ (PUNCT 3, SYM 3)

%
- PUNCT 137: 협상과 교섭은 100 % 란 게 없다 .
- SYM 45: 구글맵에 10 % 쿠폰이 있어서 휴대폰에서 보여주고 구매했습니다 .
?
- PUNCT 134: 일반 삼계닭이 아닌가 ?
- SYM 1: 단도제 ( 檀道濟 , ? ~ 436년 ) 는 중국 남북조시대 송나라의 장군이다 .
~
- PUNCT 69: 디자이너 분이 친절하고 디자인이 굳입니다 ~
- SYM 1: 또 이후 노사합의에 따라 대출금 지원 등의 명목으로 성과급이 지급됐지만 세금 ( 소득세 ) 등을 감안하면 실질적으로 실제 대출금의 50 ~ 60 % 만 지원된 것으로 알려진다 .
이다
- PUNCT 14: 역할은 사채업자들에게 쫓기는 고집 센 가수 지망생 ‘ 소연 ‘ 이다 .
- AUX 1: 직거래를 하면 그들은 비싸게 팔고 우리는 싸게 사니 , ‘ 윈윈 ‘ 이다 “ 고 말했다 .
- NOUN 1: 그 중에서 153 힐다 , 216 클레오파트라 , 243 이다 , 253 마틸데 , 324 밤베르가 , 719 알베르트 등이 유명하다 .
- VERB 1: 최고 속도는 워프 8.8 이다 .
㎡
- PUNCT 13: 주목할 사항은 분양가가 3.3 ㎡ 당 소형면적의 경우 2100만원부터 시작한다는 점이다 .
- SYM 4: 전용 85 ㎡ 이하로만 구성됐다 .
㎞
- SYM 7: 도쿄는 후쿠시마 원전에서 남서쪽으로 240 ㎞ 정도 떨어져 있다 .
- PUNCT 5: 총 연장길이는 14.3 ㎞ 로 , 영동고속도로 , 서울외곽순환고속도로 등과 연결된다 .
다
- ADV 46: 모든 음식이 다 깔끔하고 최고입니다 .
- PUNCT 5: 김 감독이 이날 취재진과 인터뷰에서 유독 강조했던 것은 ‘ 정정당당한 야구 ‘ 다 .
- NOUN 3: “ 다 모이면 웃느라고 정신이 없다 “ 는 것이다 .
- PUNCT 3: 회도 양이 짱 많아서 넘넘 좋아요 ^ 0 ^
- SYM 1: 회도 양이 짱 많아서 넘넘 좋아요 ^ 0 ^
ℓ
- PUNCT 3: 연비 또한 ℓ 당 10.6 ㎞ 로 ES350 ( ℓ 당 9.6 ㎞ ) 보다 연료효율이 높다 .
- SYM 1: 전 모델 10기통 5.2 ℓ 엔진을 탑재 최고출력 550~570마력의 힘과 최고시속 320~325 ㎞ 의 속도를 자랑한다 .
㎢
- PUNCT 3: 넓이는 627 ㎢ 이고 , 인구는 2007년 기준으로 370,000명이다 .
- SYM 3: 네르케 ( ) 는 스웨덴 중부 스베알란드 지역을 구성하는 지방 가운데 하나로 , 면적은 4,122 ㎢ , 인구는 195,414명 ( 2009년 기준 ) 이다 .

Morphology

The form / lemma ratio of PUNCT is 1.029412 (the average of all parts of speech is 1.001499).

The 1st highest number of forms (2) was observed with the lemma “<”: <, <.

The 2nd highest number of forms (2) was observed with the lemma “이+다”: 다, 이다.

The 3rd highest number of forms (2) was observed with the lemma “이+었+다”: 였다, 이었다.

PUNCT occurs with 1 features: NumType (16; 0% instances)

PUNCT occurs with 1 feature-value pairs: NumType=Card

PUNCT occurs with 2 feature combinations. The most frequent feature combination is _ (10395 tokens). Examples: ., ,, ‘, (, ), “, %, ?, !, •

Relations

PUNCT nodes are attached to their parents using 1 different relations: punct (10411; 100% instances)

Parents of PUNCT nodes belong to 15 different parts of speech: VERB (4842; 47% instances), NOUN (3093; 30% instances), ADJ (824; 8% instances), SYM (428; 4% instances), NUM (371; 4% instances), PROPN (371; 4% instances), ADV (251; 2% instances), ADP (144; 1% instances), AUX (25; 0% instances), DET (22; 0% instances), PRON (18; 0% instances), PUNCT (8; 0% instances), CCONJ (6; 0% instances), INTJ (6; 0% instances), PART (2; 0% instances)

10407 (100%) PUNCT nodes are leaves.

0 (0%) PUNCT nodes have one child.

4 (0%) PUNCT nodes have two children.

The highest child degree of a PUNCT node is 2.

Children of PUNCT nodes are attached using 1 different relations: punct (8; 100% instances)

Children of PUNCT nodes belong to 1 different parts of speech: PUNCT (8; 100% instances)

Treebank Statistics: UD_Korean-GSD: POS Tags: PUNCT

Morphology

Relations

Treebank Statistics: UD_Korean-GSD: POS Tags: `PUNCT`