This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home ja/pos issue tracker

X: other

The Japanese tag X is used for zenkanku space (IDEOGRAPHIC SPACE U+3000 in Unicode) tagged with whitespace / 空白 in UniDic.


Treebank Statistics (UD_Japanese)

There are 1 X lemmas (8%), 14 X types (0%) and 18 X tokens (0%). Out of 12 observed tags, the rank of X is: 12 in number of lemmas, 10 in number of types and 10 in number of tokens.

The 10 most frequent X lemmas: _

The 10 most frequent X types: まあ, #205。, (^3^)/, *, _, d=(^o^)=b, ……。, ※世帯ごとに, ♂の, そらあ

The 10 most frequent ambiguous lemmas: _ (NOUN 50340, VERB 18567, PUNCT 10247, NUM 4184, ADJ 3393, ADV 3061, PRON 1113, DET 925, CONJ 180, X 18, PART 3, ADP 2)

The 10 most frequent ambiguous types: まあ (X 5, ADV 1), _ (NOUN 21, VERB 7, ADJ 2, NUM 1, X 1)

Morphology

The form / lemma ratio of X is 14.000000 (the average of all parts of speech is 4757.166667).

The 1st highest number of forms (14) was observed with the lemma “_”: #205。, (^3^)/, *, _, d=(^o^)=b, ……。, ※世帯ごとに, ♂の, そらあ, とんでもない。, なんだよなんだよぉ, ま, まぁ, まあ.

X does not occur with any features.

Relations

X nodes are attached to their parents using 3 different relations: ja-dep/dep (11; 61% instances), ja-dep/root (5; 28% instances), ja-dep/nmod (2; 11% instances)

Parents of X nodes belong to 6 different parts of speech: NOUN (5; 28% instances), ROOT (5; 28% instances), VERB (5; 28% instances), ADJ (1; 6% instances), NUM (1; 6% instances), PRON (1; 6% instances)

7 (39%) X nodes are leaves.

11 (61%) X nodes have one child.

The highest child degree of a X node is 1.

Children of X nodes are attached using 4 different relations: ja-dep/punct (7; 64% instances), ja-dep/dep (2; 18% instances), ja-dep/acl:relcl (1; 9% instances), ja-dep/nmod (1; 9% instances)

Children of X nodes belong to 4 different parts of speech: PUNCT (7; 64% instances), NOUN (2; 18% instances), ADJ (1; 9% instances), VERB (1; 9% instances)


X in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]