Treebank Statistics: UD_Korean-KSL: POS Tags: NUM
There are 71 NUM lemmas (0%), 68 NUM types (0%) and 572 NUM tokens (0%).
Out of 14 observed tags, the rank of NUM is: 8 in number of lemmas, 10 in number of types and 12 in number of tokens.
The 10 most frequent NUM lemmas: 한, 두, 세, 둘째, 첫, 첫째, 하나+는, 하나, 둘, 네
The 10 most frequent NUM types: 한, 두, 세, 둘째, 첫, 첫째, 하나는, 하나, 둘, 네
The 10 most frequent ambiguous lemmas: 한 (NUM 166, DET 25, SCONJ 3), 두 (NUM 112, DET 7, ADP 2, ADV 1), 세 (NUM 27, DET 1), 둘째 (NUM 26, DET 5, ADV 1), 첫 (NUM 24, ADV 1, DET 1), 첫째 (NUM 22, ADV 2, NOUN 1), 하나 (NUM 17, NOUN 1), 둘 (NUM 16, ADV 1), 네 (NUM 13, PRON 5, INTJ 2, DET 1), 수+십+만+의 (NUM 12, NOUN 1)
The 10 most frequent ambiguous types: 한 (NUM 167, VERB 64, DET 25, AUX 11, X 10, SCONJ 3, ADV 2, NOUN 1), 두 (NUM 112, DET 7, ADP 2, ADV 1), 세 (NUM 27, DET 1), 둘째 (NUM 26, DET 5, ADV 1), 첫 (NUM 25, ADV 1, DET 1), 첫째 (NUM 22, ADV 2, NOUN 1), 하나 (NUM 17, AUX 1, NOUN 1, VERB 1), 둘 (NUM 16, ADV 1, VERB 1), 네 (NUM 13, PRON 5, INTJ 2, DET 1), 수십만의 (NUM 12, NOUN 1)
- 한
- NUM 167: 매일 매일 한 시간이라도 운동합니다 .
- VERB 64: 그 가수는 너무 댄스를 잘 한 가수입니다 .
- DET 25: 왜냐하면 우리 가 한 목표 있어요 .
- AUX 11: 몇 년 전에 저도 고향에서 친구랑 같이 한국어를 한찬통안 배웠는데 부끄러워서 친구끼리 한국어로 이야기를 하지 못 한 거예요 .
- X 10: 어느 어린이 사탕을 먹을 때 너무 맛이 있게 먹어 보이고 정말 행복 한 것 같지 않다 .
- SCONJ 3: 살 수 있는 한 이 순간을 잊지 않을겠다 .
- ADV 2: 한 편 동시에 모국어와 외국어를 배우는 것으로 둘 다 잘 모르고 실수가 많은 아이들도 많다 .
- NOUN 1: 한 국 말 할 줄 알아면 더 변리하지만 영어할 수 만 있으면 괜찮아요 .
- 두
- 세
- 둘째
- 첫
- 첫째
- 하나
- 둘
- 네
- 수십만의
Morphology
The form / lemma ratio of NUM is 0.957746 (the average of all parts of speech is 1.007876).
The 1st highest number of forms (1) was observed with the lemma “0”: 0.
The 2nd highest number of forms (1) was observed with the lemma “02+-+2200+-+7788”: 02-2200-7788.
The 3rd highest number of forms (1) was observed with the lemma “1”: 1.
NUM occurs with 1 features: Typo (3; 1% instances)
NUM occurs with 1 feature-value pairs: Typo=Yes
NUM occurs with 2 feature combinations.
The most frequent feature combination is _ (569 tokens).
Examples: 한, 두, 세, 둘째, 첫, 첫째, 하나는, 하나, 둘, 네
Relations
NUM nodes are attached to their parents using 11 different relations: nummod (403; 70% instances), obl (83; 15% instances), nsubj (41; 7% instances), nmod:poss (16; 3% instances), nmod (10; 2% instances), dislocated (5; 1% instances), flat (5; 1% instances), obj (4; 1% instances), amod (2; 0% instances), appos (2; 0% instances), root (1; 0% instances)
Parents of NUM nodes belong to 7 different parts of speech: NOUN (363; 63% instances), VERB (84; 15% instances), ADV (61; 11% instances), ADJ (51; 9% instances), AUX (8; 1% instances), NUM (4; 1% instances), (1; 0% instances)
489 (85%) NUM nodes are leaves.
71 (12%) NUM nodes have one child.
8 (1%) NUM nodes have two children.
4 (1%) NUM nodes have three or more children.
The highest child degree of a NUM node is 3.
Children of NUM nodes are attached using 12 different relations: punct (45; 45% instances), flat (17; 17% instances), case (12; 12% instances), nmod (10; 10% instances), amod (5; 5% instances), goeswith (2; 2% instances), nmod:poss (2; 2% instances), obl (2; 2% instances), acl (1; 1% instances), cc (1; 1% instances), dislocated (1; 1% instances), nummod (1; 1% instances)
Children of NUM nodes belong to 11 different parts of speech: PUNCT (45; 45% instances), NOUN (26; 26% instances), ADP (12; 12% instances), NUM (4; 4% instances), ADJ (3; 3% instances), ADV (2; 2% instances), DET (2; 2% instances), X (2; 2% instances), CCONJ (1; 1% instances), PRON (1; 1% instances), VERB (1; 1% instances)