home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-ESLSpok: POS Tags: NUM

There are 1 NUM lemmas (6%), 27 NUM types (1%) and 228 NUM tokens (1%). Out of 16 observed tags, the rank of NUM is: 9 in number of lemmas, 10 in number of types and 15 in number of tokens.

The 10 most frequent NUM lemmas: _

The 10 most frequent NUM types: one, two, three, five, four, six, ten, seven, nine, twenty

The 10 most frequent ambiguous lemmas: _ (PUNCT 3316, NOUN 3083, PRON 2869, VERB 2552, ADV 1444, AUX 1302, DET 1271, ADP 1136, CCONJ 1124, ADJ 1032, PART 891, PROPN 490, SCONJ 267, INTJ 235, NUM 228, X 72)

The 10 most frequent ambiguous types: one (NUM 58, NOUN 17, PRON 1)

Morphology

The form / lemma ratio of NUM is 27.000000 (the average of all parts of speech is 146.187500).

The 1st highest number of forms (27) was observed with the lemma “_”: Ninety, XXX02, eight, eighteen, eighty, eleven, fifteen, fifty, five, forty, four, hundred, nine, nineteen, one, seven, seventeen, six, sixteen, sixty, ten, thirty, thousand, three, twelve, twenty, two.

NUM does not occur with any features.

Relations

NUM nodes are attached to their parents using 13 different relations: nummod (170; 75% instances), root (14; 6% instances), compound (13; 6% instances), conj (9; 4% instances), nsubj (6; 3% instances), obl (5; 2% instances), ccomp (3; 1% instances), obj (3; 1% instances), advcl (1; 0% instances), appos (1; 0% instances), discourse (1; 0% instances), dislocated (1; 0% instances), nmod (1; 0% instances)

Parents of NUM nodes belong to 6 different parts of speech: NOUN (172; 75% instances), NUM (21; 9% instances), VERB (15; 7% instances), (14; 6% instances), ADJ (5; 2% instances), ADV (1; 0% instances)

154 (68%) NUM nodes are leaves.

45 (20%) NUM nodes have one child.

13 (6%) NUM nodes have two children.

16 (7%) NUM nodes have three or more children.

The highest child degree of a NUM node is 5.

Children of NUM nodes are attached using 18 different relations: advmod (25; 19% instances), punct (19; 15% instances), cc (11; 8% instances), compound (11; 8% instances), conj (10; 8% instances), cop (10; 8% instances), nsubj (10; 8% instances), case (8; 6% instances), det (8; 6% instances), nmod (8; 6% instances), acl:relcl (2; 2% instances), mark (2; 2% instances), nummod (2; 2% instances), acl (1; 1% instances), amod (1; 1% instances), discourse (1; 1% instances), obl:tmod (1; 1% instances), parataxis (1; 1% instances)

Children of NUM nodes belong to 14 different parts of speech: ADV (24; 18% instances), NUM (21; 16% instances), PUNCT (19; 15% instances), CCONJ (11; 8% instances), NOUN (11; 8% instances), AUX (10; 8% instances), DET (8; 6% instances), ADP (7; 5% instances), PRON (7; 5% instances), VERB (6; 5% instances), ADJ (3; 2% instances), SCONJ (2; 2% instances), INTJ (1; 1% instances), PROPN (1; 1% instances)