home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Japanese-Modern: POS Tags: NUM

There are 30 NUM lemmas (1%), 30 NUM types (1%) and 213 NUM tokens (1%). Out of 17 observed tags, the rank of NUM is: 9 in number of lemmas, 10 in number of types and 10 in number of tokens.

The 10 most frequent NUM lemmas: 一, 三, 二, 十, 七, 百, 六, 五十, 四, 數

The 10 most frequent NUM types: 一, 三, 二, 十, 七, 百, 六, 五十, 四, 數

The 10 most frequent ambiguous lemmas: 數 (NOUN 8, NUM 6, ADV 1)

The 10 most frequent ambiguous types: 數 (NOUN 8, NUM 6, ADV 1)

Morphology

The form / lemma ratio of NUM is 1.000000 (the average of all parts of speech is 1.139839).

The 1st highest number of forms (1) was observed with the lemma “一”: 一.

The 2nd highest number of forms (1) was observed with the lemma “一二”: 一二.

The 3rd highest number of forms (1) was observed with the lemma “七”: 七.

NUM does not occur with any features.

Relations

NUM nodes are attached to their parents using 8 different relations: nummod (160; 75% instances), dep (20; 9% instances), root (14; 7% instances), nmod (7; 3% instances), obj (5; 2% instances), obl (4; 2% instances), nsubj (2; 1% instances), iobj (1; 0% instances)

Parents of NUM nodes belong to 7 different parts of speech: NOUN (144; 68% instances), VERB (26; 12% instances), PART (24; 11% instances), (14; 7% instances), NUM (3; 1% instances), AUX (1; 0% instances), PROPN (1; 0% instances)

154 (72%) NUM nodes are leaves.

26 (12%) NUM nodes have one child.

24 (11%) NUM nodes have two children.

9 (4%) NUM nodes have three or more children.

The highest child degree of a NUM node is 4.

Children of NUM nodes are attached using 9 different relations: compound (31; 29% instances), case (21; 20% instances), nmod (21; 20% instances), aux (20; 19% instances), iobj (5; 5% instances), cc (3; 3% instances), nummod (3; 3% instances), advmod (1; 1% instances), obl (1; 1% instances)

Children of NUM nodes belong to 9 different parts of speech: NOUN (48; 45% instances), ADP (21; 20% instances), AUX (20; 19% instances), PRON (8; 8% instances), CCONJ (3; 3% instances), NUM (3; 3% instances), ADV (1; 1% instances), PROPN (1; 1% instances), VERB (1; 1% instances)