home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Korean-KSL: POS Tags: NUM

There are 71 NUM lemmas (0%), 68 NUM types (0%) and 572 NUM tokens (0%). Out of 14 observed tags, the rank of NUM is: 8 in number of lemmas, 10 in number of types and 12 in number of tokens.

The 10 most frequent NUM lemmas: 한, 두, 세, 둘째, 첫, 첫째, 하나+는, 하나, 둘, 네

The 10 most frequent NUM types: 한, 두, 세, 둘째, 첫, 첫째, 하나는, 하나, 둘, 네

The 10 most frequent ambiguous lemmas: 한 (NUM 166, DET 25, SCONJ 3), 두 (NUM 112, DET 7, ADP 2, ADV 1), 세 (NUM 27, DET 1), 둘째 (NUM 26, DET 5, ADV 1), 첫 (NUM 24, ADV 1, DET 1), 첫째 (NUM 22, ADV 2, NOUN 1), 하나 (NUM 17, NOUN 1), 둘 (NUM 16, ADV 1), 네 (NUM 13, PRON 5, INTJ 2, DET 1), 수+십+만+의 (NUM 12, NOUN 1)

The 10 most frequent ambiguous types: 한 (NUM 167, VERB 64, DET 25, AUX 11, X 10, SCONJ 3, ADV 2, NOUN 1), 두 (NUM 112, DET 7, ADP 2, ADV 1), 세 (NUM 27, DET 1), 둘째 (NUM 26, DET 5, ADV 1), 첫 (NUM 25, ADV 1, DET 1), 첫째 (NUM 22, ADV 2, NOUN 1), 하나 (NUM 17, AUX 1, NOUN 1, VERB 1), 둘 (NUM 16, ADV 1, VERB 1), 네 (NUM 13, PRON 5, INTJ 2, DET 1), 수십만의 (NUM 12, NOUN 1)

Morphology

The form / lemma ratio of NUM is 0.957746 (the average of all parts of speech is 1.007876).

The 1st highest number of forms (1) was observed with the lemma “0”: 0.

The 2nd highest number of forms (1) was observed with the lemma “02+-+2200+-+7788”: 02-2200-7788.

The 3rd highest number of forms (1) was observed with the lemma “1”: 1.

NUM occurs with 1 features: Typo (3; 1% instances)

NUM occurs with 1 feature-value pairs: Typo=Yes

NUM occurs with 2 feature combinations. The most frequent feature combination is _ (569 tokens). Examples: 한, 두, 세, 둘째, 첫, 첫째, 하나는, 하나, 둘, 네

Relations

NUM nodes are attached to their parents using 11 different relations: nummod (403; 70% instances), obl (83; 15% instances), nsubj (41; 7% instances), nmod:poss (16; 3% instances), nmod (10; 2% instances), dislocated (5; 1% instances), flat (5; 1% instances), obj (4; 1% instances), amod (2; 0% instances), appos (2; 0% instances), root (1; 0% instances)

Parents of NUM nodes belong to 7 different parts of speech: NOUN (363; 63% instances), VERB (84; 15% instances), ADV (61; 11% instances), ADJ (51; 9% instances), AUX (8; 1% instances), NUM (4; 1% instances), (1; 0% instances)

489 (85%) NUM nodes are leaves.

71 (12%) NUM nodes have one child.

8 (1%) NUM nodes have two children.

4 (1%) NUM nodes have three or more children.

The highest child degree of a NUM node is 3.

Children of NUM nodes are attached using 12 different relations: punct (45; 45% instances), flat (17; 17% instances), case (12; 12% instances), nmod (10; 10% instances), amod (5; 5% instances), goeswith (2; 2% instances), nmod:poss (2; 2% instances), obl (2; 2% instances), acl (1; 1% instances), cc (1; 1% instances), dislocated (1; 1% instances), nummod (1; 1% instances)

Children of NUM nodes belong to 11 different parts of speech: PUNCT (45; 45% instances), NOUN (26; 26% instances), ADP (12; 12% instances), NUM (4; 4% instances), ADJ (3; 3% instances), ADV (2; 2% instances), DET (2; 2% instances), X (2; 2% instances), CCONJ (1; 1% instances), PRON (1; 1% instances), VERB (1; 1% instances)