home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-PatentChar: POS Tags: NUM

There are 1 NUM lemmas (7%), 23 NUM types (3%) and 185 NUM tokens (4%). Out of 15 observed tags, the rank of NUM is: 8 in number of lemmas, 6 in number of types and 7 in number of tokens.

The 10 most frequent NUM lemmas: _

The 10 most frequent NUM types: 第一、 一、 第二、 1、 一种、 两、 第三、 所述、 种、 1.3

The 10 most frequent ambiguous lemmas: _ (NOUN 1661, VERB 948, PUNCT 560, ADJ 474, PART 346, ADP 259, NUM 185, CCONJ 106, ADV 68, PROPN 60, PRON 48, DET 39, X 14, SCONJ 10, AUX 6)

The 10 most frequent ambiguous types: 第一 (NUM 67, VERB 2, DET 1, NOUN 1), 一 (NUM 38, NOUN 1), 第二 (NUM 29, VERB 4), 1 (NUM 22, NOUN 2), 一种 (DET 6, NUM 5), 所述 (ADJ 258, VERB 10, NOUN 2, NUM 2), 种 (NOUN 14, NUM 2), 1.3 (NUM 1, PUNCT 1), 2.1 (NUM 1, PUNCT 1), 2.2 (NUM 1, PUNCT 1)

Morphology

The form / lemma ratio of NUM is 23.000000 (the average of all parts of speech is 50.400000).

The 1st highest number of forms (23) was observed with the lemma “_”: 1, 1.3, 10M/100M/1000M, 2, 2.1, 2.2, 4, 5, 6, 一, 一种, 两, 个, 千, 多, 多个, 多条, 所述, 百, 种, 第一, 第三, 第二.

NUM does not occur with any features.

Relations

NUM nodes are attached to their parents using 5 different relations: nummod (146; 79% instances), nmod (15; 8% instances), obl (15; 8% instances), dep (5; 3% instances), appos (4; 2% instances)

Parents of NUM nodes belong to 4 different parts of speech: NOUN (165; 89% instances), VERB (16; 9% instances), ADJ (2; 1% instances), NUM (2; 1% instances)

179 (97%) NUM nodes are leaves.

5 (3%) NUM nodes have one child.

1 (1%) NUM nodes have two children.

The highest child degree of a NUM node is 2.

Children of NUM nodes are attached using 5 different relations: case (2; 29% instances), nummod (2; 29% instances), advcl (1; 14% instances), dep (1; 14% instances), nmod (1; 14% instances)

Children of NUM nodes belong to 5 different parts of speech: NUM (2; 29% instances), PART (2; 29% instances), ADJ (1; 14% instances), DET (1; 14% instances), NOUN (1; 14% instances)