home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Korean-KSL: POS Tags: NUM

There are 77 NUM lemmas (0%), 74 NUM types (0%) and 655 NUM tokens (0%). Out of 16 observed tags, the rank of NUM is: 9 in number of lemmas, 12 in number of types and 12 in number of tokens.

The 10 most frequent NUM lemmas: 한, 두, 세, 첫, 하나+는, 둘째, 둘, 첫째, 하나, 하나+도

The 10 most frequent NUM types: 한, 두, 세, 첫, 하나는, 둘째, 둘, 첫째, 하나, 하나도

The 10 most frequent ambiguous lemmas: 한 (NUM 193, DET 26, SCONJ 3), 두 (NUM 129, DET 8, ADP 2, ADV 1), 세 (NUM 33, DET 1), 첫 (NUM 28, DET 7, ADV 1), 하나+는 (NUM 28, PUNCT 1), 둘째 (NUM 26, DET 5, ADV 1), 둘 (NUM 22, ADV 1), 첫째 (NUM 22, ADV 2, NOUN 1), 하나 (NUM 17, NOUN 1), 네 (NUM 13, PRON 5, INTJ 2, DET 1)

The 10 most frequent ambiguous types: 한 (NUM 195, VERB 74, DET 26, X 12, AUX 11, SCONJ 3, ADV 2, NOUN 1), 두 (NUM 129, DET 8, ADP 2, ADV 1), 세 (NUM 33, DET 1), 첫 (NUM 29, DET 7, ADV 1), 하나는 (NUM 28, PUNCT 1), 둘째 (NUM 26, DET 5, ADV 1), 둘 (NUM 22, ADV 1, VERB 1), 첫째 (NUM 22, ADV 2, NOUN 1), 하나 (NUM 17, AUX 1, NOUN 1, VERB 1), 네 (NUM 13, PRON 5, INTJ 2, DET 1)

Morphology

The form / lemma ratio of NUM is 0.961039 (the average of all parts of speech is 1.008073).

The 1st highest number of forms (1) was observed with the lemma “0”: 0.

The 2nd highest number of forms (1) was observed with the lemma “02+-+2200+-+7788”: 02-2200-7788.

The 3rd highest number of forms (1) was observed with the lemma “1”: 1.

NUM occurs with 1 features: Typo (3; 0% instances)

NUM occurs with 1 feature-value pairs: Typo=Yes

NUM occurs with 2 feature combinations. The most frequent feature combination is _ (652 tokens). Examples: 한, 두, 세, 첫, 하나는, 둘째, 둘, 첫째, 하나, 하나도

Relations

NUM nodes are attached to their parents using 11 different relations: nummod (463; 71% instances), obl (90; 14% instances), nsubj (55; 8% instances), nmod:poss (17; 3% instances), nmod (10; 2% instances), dislocated (5; 1% instances), flat (5; 1% instances), obj (5; 1% instances), amod (2; 0% instances), appos (2; 0% instances), root (1; 0% instances)

Parents of NUM nodes belong to 7 different parts of speech: NOUN (416; 64% instances), VERB (94; 14% instances), ADJ (66; 10% instances), ADV (66; 10% instances), AUX (8; 1% instances), NUM (4; 1% instances), (1; 0% instances)

571 (87%) NUM nodes are leaves.

72 (11%) NUM nodes have one child.

8 (1%) NUM nodes have two children.

4 (1%) NUM nodes have three or more children.

The highest child degree of a NUM node is 3.

Children of NUM nodes are attached using 11 different relations: punct (45; 45% instances), flat (18; 18% instances), case (12; 12% instances), nmod (12; 12% instances), amod (5; 5% instances), goeswith (2; 2% instances), nmod:poss (2; 2% instances), acl (1; 1% instances), cc (1; 1% instances), dislocated (1; 1% instances), nummod (1; 1% instances)

Children of NUM nodes belong to 11 different parts of speech: PUNCT (45; 45% instances), NOUN (27; 27% instances), ADP (12; 12% instances), NUM (4; 4% instances), ADJ (3; 3% instances), ADV (2; 2% instances), DET (2; 2% instances), X (2; 2% instances), CCONJ (1; 1% instances), PRON (1; 1% instances), VERB (1; 1% instances)