Treebank Statistics: UD_Thai-PUD: POS Tags: NUM
There are 216 NUM lemmas (5%), 216 NUM types (5%) and 581 NUM tokens (3%).
Out of 16 observed tags, the rank of NUM is: 5 in number of lemmas, 5 in number of types and 12 in number of tokens.
The 10 most frequent NUM lemmas: หนึ่ง, สอง, ล้าน, สาม, 1, 3, สิบ, 10, 2, สี่
The 10 most frequent NUM types: หนึ่ง, สอง, ล้าน, สาม, 1, 3, สิบ, 10, 2, สี่
The 10 most frequent ambiguous lemmas:
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.001399).
The 1st highest number of forms (1) was observed with the lemma “1”: 1.
The 2nd highest number of forms (1) was observed with the lemma “1,200”: 1,200.
The 3rd highest number of forms (1) was observed with the lemma “1.165”: 1.165.
NUM does not occur with any features.
Relations
NUM nodes are attached to their parents using 14 different relations: nummod (372; 64% instances), appos (139; 24% instances), nmod (47; 8% instances), nsubj (5; 1% instances), obl (4; 1% instances), obl:tmod (3; 1% instances), conj (2; 0% instances), obj (2; 0% instances), root (2; 0% instances), acl (1; 0% instances), acl:relcl (1; 0% instances), ccomp (1; 0% instances), obl:poss (1; 0% instances), xcomp (1; 0% instances)
Parents of NUM nodes belong to 9 different parts of speech: NOUN (479; 82% instances), NUM (53; 9% instances), VERB (22; 4% instances), SYM (12; 2% instances), PROPN (10; 2% instances), (2; 0% instances), ADJ (1; 0% instances), ADP (1; 0% instances), ADV (1; 0% instances)
354 (61%) NUM nodes are leaves.
171 (29%) NUM nodes have one child.
38 (7%) NUM nodes have two children.
18 (3%) NUM nodes have three or more children.
The highest child degree of a NUM node is 5.
Children of NUM nodes are attached using 17 different relations: clf (97; 31% instances), advmod (52; 17% instances), nummod (38; 12% instances), nmod (35; 11% instances), punct (21; 7% instances), obl:tmod (20; 6% instances), det (12; 4% instances), case (10; 3% instances), cop (7; 2% instances), nsubj (4; 1% instances), acl:relcl (3; 1% instances), amod (3; 1% instances), cc (2; 1% instances), compound (2; 1% instances), conj (2; 1% instances), mark (1; 0% instances), xcomp (1; 0% instances)
Children of NUM nodes belong to 13 different parts of speech: NOUN (136; 44% instances), NUM (53; 17% instances), ADV (51; 16% instances), PUNCT (21; 7% instances), DET (13; 4% instances), ADP (11; 4% instances), AUX (7; 2% instances), ADJ (6; 2% instances), SYM (5; 2% instances), VERB (3; 1% instances), CCONJ (2; 1% instances), PRON (1; 0% instances), PROPN (1; 0% instances)