home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Korean-GSD: POS Tags: NUM

There are 431 NUM lemmas (1%), 428 NUM types (1%) and 848 NUM tokens (1%). Out of 16 observed tags, the rank of NUM is: 6 in number of lemmas, 6 in number of types and 7 in number of tokens.

The 10 most frequent NUM lemmas: 한, 두, 첫, 10, 하+ㄴ, 1, 세, 50, 3, 30

The 10 most frequent NUM types: 한, 두, 첫, 10, 1, 세, 하나, 50, 3, 30

The 10 most frequent ambiguous lemmas: 한 (NUM 121, VERB 14, NOUN 5, DET 4, ADV 2), 두 (NUM 66, DET 1), 첫 (NUM 25, NOUN 1), 하+ㄴ (VERB 25, NUM 13, ADJ 2, ADV 2, NOUN 1), 1 (NUM 11, NOUN 2), 3 (NUM 8, NOUN 3), 40 (NUM 6, NOUN 1), 4 (NUM 5, NOUN 1), 5 (NUM 5, NOUN 1), 하+나 (NUM 5, VERB 2)

The 10 most frequent ambiguous types: 한 (NUM 134, VERB 39, NOUN 6, ADV 4, DET 4, ADJ 2), 두 (NUM 66, DET 1), 첫 (NUM 25, NOUN 1), 1 (NUM 11, NOUN 2), 하나 (NUM 10, ADV 2, VERB 2, NOUN 1), 3 (NUM 8, NOUN 3), 40 (NUM 6, NOUN 1), 4 (NUM 5, NOUN 1), 5 (NUM 5, NOUN 1), 2010 (NUM 4, NOUN 1)

Morphology

The form / lemma ratio of NUM is 0.993039 (the average of all parts of speech is 1.001499).

The 1st highest number of forms (1) was observed with the lemma “0”: 0.

The 2nd highest number of forms (1) was observed with the lemma “0.09”: 0.09.

The 3rd highest number of forms (1) was observed with the lemma “0.1”: 0.1.

NUM occurs with 1 features: NumType (531; 63% instances)

NUM occurs with 1 feature-value pairs: NumType=Card

NUM occurs with 2 feature combinations. The most frequent feature combination is NumType=Card (531 tokens). Examples: 한, 두, 첫, 세, 하나, 1, 다섯, 하나는, 하나의, 네

Relations

NUM nodes are attached to their parents using 14 different relations: nummod (509; 60% instances), appos (89; 10% instances), flat (57; 7% instances), obl (46; 5% instances), conj (30; 4% instances), nmod (28; 3% instances), nsubj (28; 3% instances), obj (17; 2% instances), nmod:poss (15; 2% instances), root (13; 2% instances), advcl (8; 1% instances), acl:relcl (3; 0% instances), dep (3; 0% instances), nsubj:pass (2; 0% instances)

Parents of NUM nodes belong to 9 different parts of speech: NOUN (466; 55% instances), VERB (127; 15% instances), ADV (93; 11% instances), SYM (65; 8% instances), NUM (41; 5% instances), PROPN (31; 4% instances), (13; 2% instances), ADJ (11; 1% instances), ADP (1; 0% instances)

499 (59%) NUM nodes are leaves.

148 (17%) NUM nodes have one child.

92 (11%) NUM nodes have two children.

109 (13%) NUM nodes have three or more children.

The highest child degree of a NUM node is 10.

Children of NUM nodes are attached using 20 different relations: punct (371; 48% instances), flat (96; 13% instances), case (65; 8% instances), nsubj (44; 6% instances), appos (33; 4% instances), conj (33; 4% instances), nmod (19; 2% instances), nmod:poss (18; 2% instances), obj (14; 2% instances), obl (13; 2% instances), advmod (12; 2% instances), dep (12; 2% instances), acl:relcl (10; 1% instances), cop (8; 1% instances), advcl (7; 1% instances), nummod (5; 1% instances), det (3; 0% instances), cc (2; 0% instances), amod (1; 0% instances), det:poss (1; 0% instances)

Children of NUM nodes belong to 14 different parts of speech: PUNCT (371; 48% instances), NOUN (186; 24% instances), ADP (85; 11% instances), NUM (41; 5% instances), VERB (32; 4% instances), ADV (30; 4% instances), AUX (8; 1% instances), PROPN (4; 1% instances), DET (3; 0% instances), ADJ (2; 0% instances), CCONJ (2; 0% instances), PART (1; 0% instances), PRON (1; 0% instances), SYM (1; 0% instances)