home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Chinese-CFL: POS Tags: NUM

There are 22 NUM lemmas (1%), 22 NUM types (1%) and 143 NUM tokens (2%). Out of 15 observed tags, the rank of NUM is: 10 in number of lemmas, 10 in number of types and 11 in number of tokens.

The 10 most frequent NUM lemmas: 一、 两、 三、 第一、 几、 10、 20、 之一、 仨、 半

The 10 most frequent NUM types: 一、 两、 三、 第一、 几、 10、 20、 之一、 仨、 半

The 10 most frequent ambiguous lemmas: 一 (NUM 93, ADV 3, SCONJ 2), 第一 (ADJ 7, NUM 6), 几 (DET 4, NUM 4), 一下 (ADV 1, NUM 1)

The 10 most frequent ambiguous types: 一 (NUM 93, ADV 3, SCONJ 2), 第一 (ADJ 7, NUM 6), 几 (DET 4, NUM 4), 一下 (ADV 1, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.001198).

The 1st highest number of forms (1) was observed with the lemma “10”: 10.

The 2nd highest number of forms (1) was observed with the lemma “14”: 14.

The 3rd highest number of forms (1) was observed with the lemma “20”: 20.

NUM does not occur with any features.

Relations

NUM nodes are attached to their parents using 4 different relations: nummod (136; 95% instances), amod (3; 2% instances), appos (2; 1% instances), conj (2; 1% instances)

Parents of NUM nodes belong to 6 different parts of speech: NOUN (133; 93% instances), VERB (4; 3% instances), NUM (2; 1% instances), PRON (2; 1% instances), ADJ (1; 1% instances), DET (1; 1% instances)

52 (36%) NUM nodes are leaves.

89 (62%) NUM nodes have one child.

2 (1%) NUM nodes have two children.

The highest child degree of a NUM node is 2.

Children of NUM nodes are attached using 3 different relations: clf (90; 97% instances), conj (2; 2% instances), advmod (1; 1% instances)

Children of NUM nodes belong to 3 different parts of speech: NOUN (90; 97% instances), NUM (2; 2% instances), ADV (1; 1% instances)