home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Uyghur: POS Tags: NUM

There are 1 NUM lemmas (7%), 74 NUM types (1%) and 449 NUM tokens (3%). Out of 14 observed tags, the rank of NUM is: 9 in number of lemmas, 7 in number of types and 7 in number of tokens.

The 10 most frequent NUM lemmas: _

The 10 most frequent NUM types: بىر، ئىككى، تۆت، ئون، يەتتە، بىرى، مىڭ، ئۈچ، 20، بىرلا

The 10 most frequent ambiguous lemmas: _ (NOUN 4872, VERB 3255, PUNCT 2934, PRON 1262, ADJ 1123, AUX 481, NUM 449, ADV 405, CCONJ 305, ADP 200, PART 123, DET 54, INTJ 41, X 5)

The 10 most frequent ambiguous types: يۈز (NUM 5, NOUN 2), ھەممەيلەن (NUM 5, PRON 2), ئۇياق (NOUN 2, NUM 2, VERB 1), ئۈزۈپ (VERB 6, NUM 1), بىلىنەر (NOUN 1, NUM 1), بۇلدۇق (NOUN 1, NUM 1), تىزىق (NUM 1, VERB 1), جۈپ (DET 3, INTJ 1, NUM 1), شەھرىزادنىڭ (NOUN 1, NUM 1), قاينام (NOUN 1, NUM 1)

Morphology

The form / lemma ratio of NUM is 74.000000 (the average of all parts of speech is 429.142857).

The 1st highest number of forms (74) was observed with the lemma “_”: 1-, 10, 100, 1025, 15, 16, 160نەچچە, 18, 20, 2000, 21, 3, 30, 40, 5, 70, ئالتىنچى, ئالتە, ئوتتۇز, ئون, ئىككى, ئىككىسى, ئىككىسىنى, ئىككىسىنىڭ, ئىككىمىز, ئىككىمىزنىڭ, ئىككىنچى, ئىككىيلەن, ئۇياق, ئۈزۈپ, ئۈچ, ئۈچەيلەن, بىر, بىردىن, بىردەك, بىرلا, بىرنى, بىرنەچچە, بىرى, بىرىدە, بىرىنى, بىرىنىڭ, بىرىنچى, بىرىگە, بىرگە, بىرەر, بىلىنمەس, بىلىنەر, بۇلدۇق, بەش, توققۇز, تومۇزغا, تىزىق, تۆت, تۆتىنچى, تۇنجى, تۈمەنلىگەن, جۈپ, سەكسەن, سەككىز, شەھرىزادنىڭ, غال, قانچىلىغان, قاينام, كۆپلىگەن, مىليون, مىڭ, مىڭلىغان, مۈڭگۈز, نەچچىسى, يىگىرمە, يۈز, يەتتە, ھەممەيلەن.

NUM does not occur with any features.

Relations

NUM nodes are attached to their parents using 22 different relations: nummod (256; 57% instances), det (52; 12% instances), compound (26; 6% instances), advmod (20; 4% instances), nsubj (20; 4% instances), compound:redup (13; 3% instances), amod (9; 2% instances), obl (9; 2% instances), conj (6; 1% instances), nmod:poss (6; 1% instances), dep (5; 1% instances), fixed (5; 1% instances), obj (5; 1% instances), nmod (4; 1% instances), nmod:tmod (3; 1% instances), ccomp (2; 0% instances), root (2; 0% instances), xcomp (2; 0% instances), advmod:emph (1; 0% instances), appos (1; 0% instances), compound:lvc (1; 0% instances), nmod:cau (1; 0% instances)

Parents of NUM nodes belong to 11 different parts of speech: NOUN (287; 64% instances), VERB (56; 12% instances), DET (40; 9% instances), NUM (25; 6% instances), ADJ (20; 4% instances), PRON (14; 3% instances), ADV (2; 0% instances), (2; 0% instances), ADP (1; 0% instances), AUX (1; 0% instances), X (1; 0% instances)

342 (76%) NUM nodes are leaves.

84 (19%) NUM nodes have one child.

18 (4%) NUM nodes have two children.

5 (1%) NUM nodes have three or more children.

The highest child degree of a NUM node is 5.

Children of NUM nodes are attached using 24 different relations: punct (30; 22% instances), fixed (19; 14% instances), compound (13; 9% instances), nmod (13; 9% instances), compound:redup (10; 7% instances), conj (9; 7% instances), nmod:poss (8; 6% instances), det (7; 5% instances), cop (5; 4% instances), nsubj (5; 4% instances), nummod (3; 2% instances), acl (2; 1% instances), appos (2; 1% instances), nmod:part (2; 1% instances), advcl (1; 1% instances), advmod (1; 1% instances), advmod:emph (1; 1% instances), amod (1; 1% instances), aux (1; 1% instances), case (1; 1% instances), cc (1; 1% instances), compound:lvc (1; 1% instances), dep (1; 1% instances), nmod:abl (1; 1% instances)

Children of NUM nodes belong to 13 different parts of speech: NOUN (31; 22% instances), PUNCT (30; 22% instances), NUM (25; 18% instances), PRON (19; 14% instances), ADV (8; 6% instances), VERB (8; 6% instances), AUX (6; 4% instances), CCONJ (4; 3% instances), ADJ (2; 1% instances), PART (2; 1% instances), ADP (1; 1% instances), DET (1; 1% instances), INTJ (1; 1% instances)