home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Thai-PUD: POS Tags: NUM

There are 1 NUM lemmas (6%), 216 NUM types (5%) and 581 NUM tokens (3%). Out of 16 observed tags, the rank of NUM is: 8 in number of lemmas, 5 in number of types and 12 in number of tokens.

The 10 most frequent NUM lemmas: _

The 10 most frequent NUM types: หนึ่ง, สอง, ล้าน, สาม, 1, 3, สิบ, 10, 2, สี่

The 10 most frequent ambiguous lemmas: _ (NOUN 6052, VERB 4361, ADP 3134, PROPN 1491, AUX 1449, DET 1026, ADJ 969, ADV 951, PRON 683, PART 608, CCONJ 606, NUM 581, PUNCT 272, SYM 134, X 4, SCONJ 1)

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of NUM is 216.000000 (the average of all parts of speech is 269.375000).

The 1st highest number of forms (216) was observed with the lemma “_”: 1, 1,200, 1.165, 1.335, 1.365, 1.4, 1.5, 10, 10,000, 10.00, 100, 100,000, 1000, 103.7, 1072, 1075, 11, 12, 12,000, 120, 125, 13, 1340, 1350, 137, 1399, 14, 1415, 1492, 15, 15,000, 15,001, 15.5, 1519, 1530, 1538, 1563, 1566, 16, 16,000, 16,500, 1600, 1610, 1632, 168,000, 17, 1770, 1777, 1794, 18, 1820, 1832, 1839, 1842, 1856, 1858, 1860, 1879, 1882, 1886, 1887, 1896, 19, 19,999, 1900, 1903, 1904, 1911, 1912, 1913, 1914, 1916, 1917, 1918, 1925, 1926, 1927, 1928, 1933, 1945, 1947, 1948, 1950, 1952, 1954, 1955, 1960, 1961, 1962, 1969, 1970, 1973, 1975, 1976, 1977, 1979, 1980, 1981, 1984, 1987, 1988, 1990, 1991, 1992, 1993, 1994, 1996, 1997, 1998, 2, 20, 200, 2000, 2001, 2002, 2003, 2004, 2005, 2006, 2007, 2008, 2009, 2010, 2011, 2012, 2013, 2014, 2015, 2016, 2017, 2019, 2020, 2035, 2050, 21, 221,000, 23.45, 24, 25, 25,000, 27, 28, 29, 2900, 3, 3,000, 30, 31, 328, 33, 330, 330,000, 3300, 34, 35,000, 352, 36, 360, 363, 367, 393, 4, 40, 400, 42, 45, 49, 5, 5,000, 5.7, 50, 500, 511, 512, 53, 550, 56, 6, 6,000, 6.30, 60, 600,000, 62, 66, 7, 7.5, 70, 700, 71, 760, 8, 80, 830, 833, 84, 846, 9, 90, 96, พัน, ยี่, ร้อย, ล้าน, สอง, สาม, สิบ, สี่, หก, หนึ่ง, หมื่น, ห้า, เก้า, เจ็ด, แปด, ไนน์, ไฟฟ์.

NUM does not occur with any features.

Relations

NUM nodes are attached to their parents using 14 different relations: nummod (372; 64% instances), appos (139; 24% instances), obl:tmod (36; 6% instances), nmod (14; 2% instances), nsubj (5; 1% instances), obl (4; 1% instances), conj (2; 0% instances), obj (2; 0% instances), root (2; 0% instances), acl (1; 0% instances), acl:relcl (1; 0% instances), ccomp (1; 0% instances), obl:poss (1; 0% instances), xcomp (1; 0% instances)

Parents of NUM nodes belong to 8 different parts of speech: NOUN (482; 83% instances), NUM (52; 9% instances), SYM (17; 3% instances), VERB (17; 3% instances), PROPN (9; 2% instances), (2; 0% instances), ADP (1; 0% instances), ADV (1; 0% instances)

433 (75%) NUM nodes are leaves.

118 (20%) NUM nodes have one child.

22 (4%) NUM nodes have two children.

8 (1%) NUM nodes have three or more children.

The highest child degree of a NUM node is 5.

Children of NUM nodes are attached using 14 different relations: advmod (52; 27% instances), nummod (38; 20% instances), nmod (29; 15% instances), obl:tmod (20; 10% instances), punct (19; 10% instances), case (10; 5% instances), cop (7; 4% instances), det (5; 3% instances), nsubj (4; 2% instances), cc (2; 1% instances), conj (2; 1% instances), amod (1; 1% instances), mark (1; 1% instances), xcomp (1; 1% instances)

Children of NUM nodes belong to 12 different parts of speech: NUM (52; 27% instances), ADV (51; 27% instances), NOUN (37; 19% instances), PUNCT (19; 10% instances), ADP (11; 6% instances), AUX (7; 4% instances), DET (6; 3% instances), ADJ (3; 2% instances), CCONJ (2; 1% instances), PRON (1; 1% instances), PROPN (1; 1% instances), VERB (1; 1% instances)