home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Japanese-GSDLUW: POS Tags: DET

There are 8 DET lemmas (0%), 10 DET types (0%) and 987 DET tokens (1%). Out of 17 observed tags, the rank of DET is: 16 in number of lemmas, 16 in number of types and 12 in number of tokens.

The 10 most frequent DET lemmas: 其の, 此の, 或る, 何の, 彼の, あらゆる, 我が, とある

The 10 most frequent DET types: その, この, ある, どの, あの, あらゆる, わが, とある, 我が, 或る

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types: ある (VERB 414, DET 33)

Morphology

The form / lemma ratio of DET is 1.250000 (the average of all parts of speech is 1.095294).

The 1st highest number of forms (2) was observed with the lemma “我が”: わが, 我が.

The 2nd highest number of forms (2) was observed with the lemma “或る”: ある, 或る.

The 3rd highest number of forms (1) was observed with the lemma “あらゆる”: あらゆる.

DET does not occur with any features.

Relations

DET nodes are attached to their parents using 1 different relations: det (987; 100% instances)

Parents of DET nodes belong to 7 different parts of speech: NOUN (960; 97% instances), PROPN (15; 2% instances), NUM (7; 1% instances), PRON (2; 0% instances), ADJ (1; 0% instances), ADV (1; 0% instances), VERB (1; 0% instances)

975 (99%) DET nodes are leaves.

11 (1%) DET nodes have one child.

1 (0%) DET nodes have two children.

The highest child degree of a DET node is 2.

Children of DET nodes are attached using 2 different relations: punct (10; 77% instances), fixed (3; 23% instances)

Children of DET nodes belong to 2 different parts of speech: PUNCT (10; 77% instances), ADP (3; 23% instances)