home en/pos edit page issue tracker

NUM: numeral

The English NUM corresponds exactly to the PTB CD.


Treebank Statistics (UD_English)

There are 1241 NUM lemmas (6%), 1241 NUM types (5%) and 4913 NUM tokens (2%). Out of 17 observed tags, the rank of NUM is: 5 in number of lemmas, 5 in number of types and 13 in number of tokens.

The 10 most frequent NUM lemmas: one, two, 2, 3, 5, 1, 10, 4, three, 20

The 10 most frequent NUM types: one, two, 2, 3, 5, 1, 10, 4, three, 20

The 10 most frequent ambiguous lemmas: one (NUM 451, NOUN 146, PRON 26, VERB 1), 2 (NUM 145, X 30, PROPN 2, ADP 1, PART 1), 3 (NUM 122, X 17, NOUN 1), 5 (NUM 112, X 4, PROPN 1), 1 (NUM 111, X 31), 10 (NUM 99, X 2), 4 (NUM 97, X 13, ADP 1, SCONJ 1), 20 (NUM 66, NOUN 5), 6 (NUM 64, X 2), m (NUM 46, NOUN 17, PROPN 3)

The 10 most frequent ambiguous types: one (NUM 398, NOUN 105, PRON 22), 2 (NUM 145, X 30, PROPN 2, ADP 1, PART 1), 3 (NUM 122, X 17), 5 (NUM 112, X 4, PROPN 1), 1 (NUM 111, X 31), 10 (NUM 99, X 2), 4 (NUM 97, X 13, ADP 1, SCONJ 1), 20 (NUM 66, NOUN 3), 6 (NUM 64, X 2), m (NUM 41, VERB 14, NOUN 11, AUX 8, PROPN 3)

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.173588).

The 1st highest number of forms (1) was observed with the lemma “’02”: ‘02.

The 2nd highest number of forms (1) was observed with the lemma “’05”: ‘05.

The 3rd highest number of forms (1) was observed with the lemma “’07”: ‘07.

NUM occurs with 2 features: en-feat/NumType (4912; 100% instances), en-feat/Number (1; 0% instances)

NUM occurs with 2 feature-value pairs: NumType=Card, Number=Sing

NUM occurs with 2 feature combinations. The most frequent feature combination is NumType=Card (4912 tokens). Examples: one, two, 2, 3, 5, 1, 10, 4, three, 20

Relations

NUM nodes are attached to their parents using 26 different relations: en-dep/nummod (2895; 59% instances), en-dep/nmod (539; 11% instances), en-dep/root (423; 9% instances), en-dep/compound (256; 5% instances), en-dep/appos (212; 4% instances), en-dep/list (115; 2% instances), en-dep/dobj (105; 2% instances), en-dep/nsubj (103; 2% instances), en-dep/conj (93; 2% instances), en-dep/nmod:tmod (68; 1% instances), en-dep/amod (19; 0% instances), en-dep/parataxis (18; 0% instances), en-dep/advcl (9; 0% instances), en-dep/advmod (9; 0% instances), en-dep/nmod:npmod (9; 0% instances), en-dep/xcomp (9; 0% instances), en-dep/remnant (8; 0% instances), en-dep/ccomp (7; 0% instances), en-dep/nsubjpass (4; 0% instances), en-dep/acl:relcl (3; 0% instances), en-dep/case (2; 0% instances), en-dep/det (2; 0% instances), en-dep/reparandum (2; 0% instances), en-dep/iobj (1; 0% instances), en-dep/nmod:poss (1; 0% instances), en-dep/vocative (1; 0% instances)

Parents of NUM nodes belong to 13 different parts of speech: NOUN (2388; 49% instances), PROPN (787; 16% instances), NUM (443; 9% instances), VERB (435; 9% instances), ROOT (423; 9% instances), SYM (355; 7% instances), ADJ (41; 1% instances), X (15; 0% instances), ADV (14; 0% instances), DET (5; 0% instances), PRON (5; 0% instances), AUX (1; 0% instances), PUNCT (1; 0% instances)

3201 (65%) NUM nodes are leaves.

1032 (21%) NUM nodes have one child.

278 (6%) NUM nodes have two children.

402 (8%) NUM nodes have three or more children.

The highest child degree of a NUM node is 13.

Children of NUM nodes are attached using 34 different relations: en-dep/punct (704; 23% instances), en-dep/case (557; 18% instances), en-dep/nmod (359; 12% instances), en-dep/advmod (214; 7% instances), en-dep/nmod:tmod (197; 6% instances), en-dep/appos (174; 6% instances), en-dep/compound (157; 5% instances), en-dep/conj (101; 3% instances), en-dep/cc (93; 3% instances), en-dep/cop (92; 3% instances), en-dep/nummod (92; 3% instances), en-dep/nsubj (89; 3% instances), en-dep/det (57; 2% instances), en-dep/parataxis (47; 2% instances), en-dep/amod (27; 1% instances), en-dep/acl:relcl (20; 1% instances), en-dep/mark (15; 0% instances), en-dep/nmod:npmod (13; 0% instances), en-dep/aux (11; 0% instances), en-dep/advcl (9; 0% instances), en-dep/remnant (7; 0% instances), en-dep/neg (6; 0% instances), en-dep/discourse (5; 0% instances), en-dep/acl (4; 0% instances), en-dep/nmod:poss (2; 0% instances), en-dep/reparandum (2; 0% instances), en-dep/cc:preconj (1; 0% instances), en-dep/ccomp (1; 0% instances), en-dep/csubj (1; 0% instances), en-dep/det:predet (1; 0% instances), en-dep/dobj (1; 0% instances), en-dep/goeswith (1; 0% instances), en-dep/list (1; 0% instances), en-dep/xcomp (1; 0% instances)

Children of NUM nodes belong to 17 different parts of speech: PUNCT (694; 23% instances), NOUN (573; 19% instances), ADP (481; 16% instances), NUM (443; 14% instances), ADV (190; 6% instances), VERB (165; 5% instances), SYM (114; 4% instances), CONJ (90; 3% instances), ADJ (85; 3% instances), PRON (80; 3% instances), DET (70; 2% instances), PROPN (47; 2% instances), AUX (11; 0% instances), SCONJ (8; 0% instances), PART (5; 0% instances), INTJ (3; 0% instances), X (3; 0% instances)


NUM in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]