home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Maltese-MUDT: POS Tags: NUM

There are 1 NUM lemmas (6%), 159 NUM types (2%) and 600 NUM tokens (1%). Out of 17 observed tags, the rank of NUM is: 9 in number of lemmas, 6 in number of types and 13 in number of tokens.

The 10 most frequent NUM lemmas: _

The 10 most frequent NUM types: wieħed, ewwel, waħda, żewġ, tliet, tieni, 1, tnejn, nofs, 2

The 10 most frequent ambiguous lemmas: _ (NOUN 8525, VERB 6386, ADP 5372, PUNCT 4729, DET 3602, SCONJ 2647, PRON 2321, ADJ 2313, ADV 1787, AUX 1690, PROPN 1545, CCONJ 1525, NUM 600, PART 473, X 341, SYM 241, INTJ 65)

The 10 most frequent ambiguous types: miljun (NUM 4, NOUN 1), eluf (NUM 3, NOUN 2), ħadd (PRON 33, NUM 1)

Morphology

The form / lemma ratio of NUM is 159.000000 (the average of all parts of speech is 474.705882).

The 1st highest number of forms (159) was observed with the lemma “_”: 0, 00, 1, 1,300, 10, 100000, 12, 13827, 14, 14-il, 150, 156, 16, 16-il, 17, 17-il, 18, 1815, 1864, 1878, 1880, 19, 1907, 1918, 1922, 1927, 1930, 1933, 1934, 1941, 1946, 1949, 1955, 1963, 1976, 1981, 1983, 1984, 1987, 1990, 1994, 1995, 1997, 1998, 1999, 2, 2,000, 20, 20,000, 200, 2001, 2002, 2004, 2005, 2006, 2007, 2008, 2009, 2010, 2011, 2012, 2013, 2014, 2015, 2019, 2020, 21, 22, 220,000, 228-, 23, 24, 25, 26, 27, 296, 3, 30, 300, 31, 31093, 32739, 35, 350, 3C, 4, 4,000, 40,000, 400, 41, 44, 5, 50, 500, 500,000, 51, 57, 58, 59, 6, 6,000, 60, 62, 65, 68, 7, 74, 80, 81, 86, 9, 90, TNAX-IL, disgħa, elef, elf, elfejn, eluf, erba’, erbat, erbatax-il, erbgħa, ewwel, għaxar, għoxrin, mija, miljun, mitt, nofs, nofsu, raba’, seba’, sebgħin, sitt, sittax-il, terz, tielet, tieni, tletin, tlett, tliet, tlieta, tmenin, tmien, tmienja, tnejn, tużżana, uħud, waħda, wieħed, wħud, ħadd, ħames, ħamest, ħamsa, ħmistax, żewg, żewġ, żewġt.

NUM does not occur with any features.

Relations

NUM nodes are attached to their parents using 21 different relations: nummod (252; 42% instances), obl (77; 13% instances), amod (62; 10% instances), nsubj (47; 8% instances), list (38; 6% instances), conj (28; 5% instances), nmod (20; 3% instances), nmod:poss (18; 3% instances), root (15; 3% instances), obj (13; 2% instances), appos (12; 2% instances), compound (4; 1% instances), acl (2; 0% instances), ccomp (2; 0% instances), flat (2; 0% instances), nsubj:pass (2; 0% instances), xcomp (2; 0% instances), dep (1; 0% instances), obl:agent (1; 0% instances), obl:arg (1; 0% instances), parataxis (1; 0% instances)

Parents of NUM nodes belong to 11 different parts of speech: NOUN (296; 49% instances), VERB (139; 23% instances), PROPN (50; 8% instances), NUM (48; 8% instances), SYM (36; 6% instances), (15; 3% instances), ADJ (8; 1% instances), PRON (4; 1% instances), ADV (2; 0% instances), AUX (1; 0% instances), X (1; 0% instances)

363 (61%) NUM nodes are leaves.

122 (20%) NUM nodes have one child.

56 (9%) NUM nodes have two children.

59 (10%) NUM nodes have three or more children.

The highest child degree of a NUM node is 15.

Children of NUM nodes are attached using 26 different relations: case:det (73; 15% instances), punct (65; 13% instances), det (46; 9% instances), nmod (45; 9% instances), nmod:poss (45; 9% instances), case (44; 9% instances), list (31; 6% instances), cc (27; 5% instances), conj (21; 4% instances), cop (17; 3% instances), nsubj (17; 3% instances), amod (10; 2% instances), compound (9; 2% instances), acl (8; 2% instances), obl (7; 1% instances), advmod (6; 1% instances), appos (6; 1% instances), nummod (4; 1% instances), aux:neg (3; 1% instances), mark (3; 1% instances), advcl (2; 0% instances), discourse (2; 0% instances), flat (2; 0% instances), advmod:neg (1; 0% instances), ccomp (1; 0% instances), parataxis (1; 0% instances)

Children of NUM nodes belong to 15 different parts of speech: ADP (117; 24% instances), NOUN (71; 14% instances), PUNCT (65; 13% instances), NUM (48; 10% instances), DET (45; 9% instances), PROPN (35; 7% instances), CCONJ (27; 5% instances), PRON (20; 4% instances), ADJ (17; 3% instances), SYM (17; 3% instances), AUX (11; 2% instances), VERB (10; 2% instances), ADV (9; 2% instances), SCONJ (3; 1% instances), PART (1; 0% instances)