home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-CTeTex: POS Tags: NUM

There are 109 NUM lemmas (6%), 109 NUM types (5%) and 317 NUM tokens (3%). Out of 17 observed tags, the rank of NUM is: 5 in number of lemmas, 5 in number of types and 8 in number of tokens.

The 10 most frequent NUM lemmas: 1, 2, 3, 4, 5, two, 50, 8, a, b

The 10 most frequent NUM types: 1, 2, 3, 4, 5, two, 50, 8, a, b

The 10 most frequent ambiguous lemmas: a (DET 150, NUM 9, NOUN 1), b (NUM 9, NOUN 1), one (NUM 9, PRON 7), n (NOUN 2, NUM 2), g (NOUN 1, NUM 1)

The 10 most frequent ambiguous types: a (DET 141, NUM 9), one (NUM 9, PRON 7), g (NOUN 1, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.138503).

The 1st highest number of forms (1) was observed with the lemma “+55”: +55.

The 2nd highest number of forms (1) was observed with the lemma “-20”: -20.

The 3rd highest number of forms (1) was observed with the lemma “-95”: -95.

NUM does not occur with any features.

Relations

NUM nodes are attached to their parents using 12 different relations: nummod (225; 71% instances), flat (35; 11% instances), conj (24; 8% instances), nmod (14; 4% instances), parataxis (8; 3% instances), obj (3; 1% instances), appos (2; 1% instances), compound (2; 1% instances), acl:relcl (1; 0% instances), nsubj (1; 0% instances), nsubj:pass (1; 0% instances), root (1; 0% instances)

Parents of NUM nodes belong to 7 different parts of speech: NOUN (232; 73% instances), NUM (28; 9% instances), VERB (26; 8% instances), SYM (18; 6% instances), PROPN (7; 2% instances), ADJ (5; 2% instances), (1; 0% instances)

158 (50%) NUM nodes are leaves.

134 (42%) NUM nodes have one child.

19 (6%) NUM nodes have two children.

6 (2%) NUM nodes have three or more children.

The highest child degree of a NUM node is 7.

Children of NUM nodes are attached using 13 different relations: punct (112; 54% instances), advmod (36; 17% instances), conj (25; 12% instances), case (9; 4% instances), nmod (7; 3% instances), flat (6; 3% instances), cc (3; 1% instances), amod (2; 1% instances), cop (2; 1% instances), nsubj (2; 1% instances), acl (1; 0% instances), det (1; 0% instances), mark (1; 0% instances)

Children of NUM nodes belong to 12 different parts of speech: PUNCT (112; 54% instances), ADV (29; 14% instances), NUM (28; 14% instances), ADJ (9; 4% instances), ADP (8; 4% instances), SYM (8; 4% instances), NOUN (6; 3% instances), AUX (2; 1% instances), CCONJ (2; 1% instances), DET (1; 0% instances), SCONJ (1; 0% instances), VERB (1; 0% instances)