Statistics of NUM in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Korean-Kaist: POS Tags: `NUM`

There are 1372 NUM lemmas (1%), 1353 NUM types (1%) and 4848 NUM tokens (1%). Out of 17 observed tags, the rank of NUM is: 8 in number of lemmas, 8 in number of types and 11 in number of tokens.

The 10 most frequent NUM lemmas: 한, 두, 1, 하나+의, 세, 2, 3, 10, 5, 하나+는

The 10 most frequent NUM types: 한, 두, 1, 하나의, 세, 2, 3, 10, 5, 하나는

The 10 most frequent ambiguous lemmas: 한 (NUM 578, ADJ 69, NOUN 46, PROPN 32, DET 4), 두 (NUM 375, ADJ 17), 하나+의 (NUM 104, NOUN 71), 세 (NUM 101, NOUN 40, ADJ 7), 하나+는 (NUM 56, NOUN 28), 첫 (NUM 35, ADJ 13), 네 (NUM 32, PRON 28, ADJ 3), 만 (NUM 25, NOUN 8, ADJ 1, ADP 1), 하나 (NUM 23, NOUN 11, CCONJ 1), 3+천 (NUM 21, NOUN 1)

The 10 most frequent ambiguous types: 한 (NUM 577, VERB 173, ADJ 69, NOUN 46, AUX 41, PROPN 32, DET 4, PART 2), 두 (NUM 375, ADJ 17), 1 (NUM 132, NOUN 1), 하나의 (NUM 104, NOUN 71), 세 (NUM 101, NOUN 40, ADJ 7), 2 (NUM 95, NOUN 1), 10 (NUM 67, NOUN 1), 하나는 (NUM 56, NOUN 28, ADV 1), 둘째 (NUM 40, NOUN 2), 첫째 (NUM 36, NOUN 2)

한
- NUM 577: 김규식 선생도 그런 고아의 한 예이다 .
- VERB 173: 한 것에 놀라지 않을 수 없었다 .
- ADJ 69: 헤겔의 Sittlichkeit도 그 한 예거니와 , 민주주의나 공산주의 혁명이론도 또한 마찬가지다 .
- NOUN 46: 이처럼 재벌들이 쓰러질 것을 각오하고 체질을 개선하려 하지 않는 한 개선은 불가능하다 .
- AUX 41: 고려가요를 산출하게 한 고려사회는 중세적 질서가 한층 강화된 사회이다 .
- PROPN 32: 한 , 일 생활문화 비교 사회자 감사합니다 .
- DET 4: 지난 20년 동안 한 250차례 대화가 왔다갔다했습니다 .
- PART 2: 그것에 의하면 제논이 운동의 개념에 의해 증시 ( 證示 ) 한 모순은 바로 다음과 같은 사실로서 인정되지 않으면 안 된다 .
두
- NUM 375: 이리하여 게일 목사에 의하여 남녀 두 학교가 연못골에서 밀러 학교의 맥을 이어 다시 일어난 것이다 .
- ADJ 17: 소모사가르시아의 두 아들 , 루이스소모사와 아니스타시오 ( 타치오 ) 소모사는 모두 미국에서 대학교육을 받았다 .
1
- NUM 132: 러시아 10 월 혁명이 발발한 지 꼭 1 년 만에 유럽에서 혁명이 일어났다 .
- NOUN 1: 그 타도대상이 1 차봉기에는 민씨정권이었다면 , 2 차봉기에는 내적으로는 갑오정권이었고 , 외적으로는 일본이었다 .
하나의
- NUM 104: 즉 하나의 책을 놓고 볼 때 도서는 하나의 단일한 주제에 대하여 깊이 있는 내용을 담고 있다 .
- NOUN 71: 하나의 아이디어가 떠오르면 이것이 구체적인 상품으로 만들어져 상품가치를 지닐 것인가에 대한 판단을 하여야 한다 .
세
- NUM 101: 르쁠레는 가족유형을 다음과 같이 세 가지로 분류하였다 .
- NOUN 40: 14 세기인이었던 그에게는 그의 17 세 때 사망한 단테가 그랬듯이 필생토록 자신을 사로잡았던 여인이 있었습니다 .
- ADJ 7: 1961년 7월 온두라스의 수도인 테구라갈파 ( Tegucigalpa ) 에서 세 명의 니카라과인이 만났다 .
2
- NUM 95: 파피루스는 2 세기경 양피지가 사용되며 경쟁이 시작되었으나 , 출판의 공급이 수요에 딸려 한계성에 부딪치게 되었다 .
- NOUN 1: 그 타도대상이 1 차봉기에는 민씨정권이었다면 , 2 차봉기에는 내적으로는 갑오정권이었고 , 외적으로는 일본이었다 .
10
- NUM 67: 이어 10 월 혁명은 러시아인 이외의 각 민족에 대한 억압과 불평등을 마감했다 .
- NOUN 1: 또 칭기즈칸의 10 대조인 보포차르의 어머니다 빛을 받아 세 아들을 낳았다는 신화는 우리네 부여나 고구려 신화와 일치하여 흥미를 끈다 .
하나는
- NUM 56: 오늘날 출판인들이 입을 모아 하는 말들 중의 하나는 출판의 방향을 예측하기도 어렵고 전문적인 출판이 자리를 잡기가 어렵다고 한다 .
- NOUN 28: 하나는 장기적으로 볼 때 우리 당이나 사회주의운동이 승리할 수 있다는 가능성을 믿지 못하기 때문이다 .
- ADV 1: 하나는 이른바 대륙의 이성주의고 , 또 하나는 영국에서 발전한 경험주의다 .
둘째
- NUM 40: 둘째 , 언론상품의 경우 일정량의 자본투하가 된 후의 상품은 아주 값싸게 재생산이 가능하다 .
- NOUN 2: 이렇게 볼 때 혁명과 해방이라는 자코뱅당의 이상도 정신적 진화의 둘째 단계인 형이상학적 추상적인 신화일 뿐이다 .
첫째
- NUM 36: 첫째 , 노동을 지향하는 인간의 합목적적인 행위 , 즉 노동 그 자체를 말한다 .
- NOUN 2: 이것이 첫째 문제의 답입니다 .

Morphology

The form / lemma ratio of NUM is 0.986152 (the average of all parts of speech is 0.998034).

The 1st highest number of forms (2) was observed with the lemma “60+대+가”: 60년대가, 60대가.

The 2nd highest number of forms (2) was observed with the lemma “첫째”: 저체, 첫째.

The 3rd highest number of forms (2) was observed with the lemma “한”: 한, 한마디로.

NUM does not occur with any features.

Relations

NUM nodes are attached to their parents using 13 different relations: nummod (3303; 68% instances), compound (550; 11% instances), nmod (385; 8% instances), dislocated (230; 5% instances), nsubj (130; 3% instances), obj (122; 3% instances), conj (72; 1% instances), obl (23; 0% instances), csubj (16; 0% instances), advcl (6; 0% instances), root (6; 0% instances), dep (4; 0% instances), xcomp (1; 0% instances)

Parents of NUM nodes belong to 13 different parts of speech: NOUN (2506; 52% instances), ADV (824; 17% instances), VERB (596; 12% instances), NUM (364; 8% instances), SYM (246; 5% instances), CCONJ (118; 2% instances), SCONJ (107; 2% instances), ADJ (42; 1% instances), PROPN (23; 0% instances), X (14; 0% instances), (6; 0% instances), PART (1; 0% instances), PRON (1; 0% instances)

3740 (77%) NUM nodes are leaves.

832 (17%) NUM nodes have one child.

218 (4%) NUM nodes have two children.

58 (1%) NUM nodes have three or more children.

The highest child degree of a NUM node is 6.

Children of NUM nodes are attached using 21 different relations: compound (483; 33% instances), punct (228; 16% instances), case (145; 10% instances), nummod (125; 9% instances), nmod (105; 7% instances), amod (93; 6% instances), conj (87; 6% instances), acl (33; 2% instances), advmod (32; 2% instances), obl (28; 2% instances), det (22; 2% instances), dislocated (22; 2% instances), cop (15; 1% instances), advcl (11; 1% instances), nsubj (11; 1% instances), cc (8; 1% instances), appos (4; 0% instances), ccomp (2; 0% instances), obj (2; 0% instances), clf (1; 0% instances), dep (1; 0% instances)

Children of NUM nodes belong to 16 different parts of speech: NOUN (413; 28% instances), NUM (364; 25% instances), PUNCT (228; 16% instances), ADP (139; 10% instances), ADJ (90; 6% instances), ADV (62; 4% instances), VERB (36; 2% instances), PROPN (34; 2% instances), DET (22; 2% instances), SYM (21; 1% instances), AUX (15; 1% instances), CCONJ (13; 1% instances), X (8; 1% instances), PRON (6; 0% instances), PART (5; 0% instances), SCONJ (2; 0% instances)

Treebank Statistics: UD_Korean-Kaist: POS Tags: NUM

Morphology

Relations

Treebank Statistics: UD_Korean-Kaist: POS Tags: `NUM`