home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-CTeTex: POS Tags: NUM

There are 1 NUM lemmas (6%), 109 NUM types (5%) and 317 NUM tokens (3%). Out of 17 observed tags, the rank of NUM is: 9 in number of lemmas, 5 in number of types and 8 in number of tokens.

The 10 most frequent NUM lemmas: _

The 10 most frequent NUM types: 1, 2, 3, 4, 5, two, 50, 8, a, b

The 10 most frequent ambiguous lemmas: _ (NOUN 2649, PUNCT 1455, DET 936, ADP 781, VERB 721, ADJ 647, AUX 492, NUM 317, PROPN 293, CCONJ 267, ADV 185, PART 165, SCONJ 163, SYM 98, PRON 83, X 17, INTJ 4)

The 10 most frequent ambiguous types: a (DET 141, NUM 9), one (NUM 9, PRON 7), g (NOUN 1, NUM 1)

Morphology

The form / lemma ratio of NUM is 109.000000 (the average of all parts of speech is 125.235294).

The 1st highest number of forms (109) was observed with the lemma “_”: +55, -20, -95, -98, 0, 0.0, 0.25, 0x1103, 0x80, 1, 1,000,000, 1,048,576, 1.5, 1.6, 10, 10,000, 100, 10165-4, 105, 11, 12, 1200, 13,000, 14, 15, 16, 160, 19, 1993, 1995, 2, 2,097,152, 2.0, 2.5, 20, 200, 2000, 2006, 2007, 2010, 204, 212-1300, 220, 24, 25, 250, 3, 3.2.5, 3.3, 3.5, 3.7.1, 30, 30,000, 300, 330, 332, 38.5, 4, 4.5, 40, 41.5, 5, 5.10, 5.10.2, 5.10.4, 5.2, 5.7.1, 5.7.2, 5.8, 50, 6, 60, 60.6, 60950, 64, 69, 7, 7.5, 71, 75, 8, 80, 802.3, 84, 85, 9, 9.0, 95, TBD, a, b, c, d, e, eight, f, five, four, g, n, one, six, t, three, twelve, two, xxxx, yyyy, zero.

NUM does not occur with any features.

Relations

NUM nodes are attached to their parents using 12 different relations: nummod (225; 71% instances), flat (35; 11% instances), conj (24; 8% instances), nmod (14; 4% instances), parataxis (8; 3% instances), obj (3; 1% instances), appos (2; 1% instances), compound (2; 1% instances), acl:relcl (1; 0% instances), nsubj (1; 0% instances), nsubj:pass (1; 0% instances), root (1; 0% instances)

Parents of NUM nodes belong to 7 different parts of speech: NOUN (232; 73% instances), NUM (28; 9% instances), VERB (26; 8% instances), SYM (18; 6% instances), PROPN (7; 2% instances), ADJ (5; 2% instances), (1; 0% instances)

158 (50%) NUM nodes are leaves.

134 (42%) NUM nodes have one child.

19 (6%) NUM nodes have two children.

6 (2%) NUM nodes have three or more children.

The highest child degree of a NUM node is 7.

Children of NUM nodes are attached using 13 different relations: punct (112; 54% instances), advmod (36; 17% instances), conj (25; 12% instances), case (9; 4% instances), nmod (7; 3% instances), flat (6; 3% instances), cc (3; 1% instances), amod (2; 1% instances), cop (2; 1% instances), nsubj (2; 1% instances), acl (1; 0% instances), det (1; 0% instances), mark (1; 0% instances)

Children of NUM nodes belong to 12 different parts of speech: PUNCT (112; 54% instances), ADV (29; 14% instances), NUM (28; 14% instances), ADJ (9; 4% instances), ADP (8; 4% instances), SYM (8; 4% instances), NOUN (6; 3% instances), AUX (2; 1% instances), CCONJ (2; 1% instances), DET (1; 0% instances), SCONJ (1; 0% instances), VERB (1; 0% instances)