home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Japanese-GSD: POS Tags: DET

There are 8 DET lemmas (0%), 10 DET types (0%) and 987 DET tokens (1%). Out of 16 observed tags, the rank of DET is: 16 in number of lemmas, 16 in number of types and 14 in number of tokens.

The 10 most frequent DET lemmas: 其の, 此の, 或る, 何の, 彼の, あらゆる, 我が, とある

The 10 most frequent DET types: その, この, ある, どの, あの, あらゆる, わが, とある, 我が, 或る

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types: ある (VERB 1008, DET 33)

Morphology

The form / lemma ratio of DET is 1.250000 (the average of all parts of speech is 1.115220).

The 1st highest number of forms (2) was observed with the lemma “我が”: わが, 我が.

The 2nd highest number of forms (2) was observed with the lemma “或る”: ある, 或る.

The 3rd highest number of forms (1) was observed with the lemma “あらゆる”: あらゆる.

DET does not occur with any features.

Relations

DET nodes are attached to their parents using 1 different relations: det (987; 100% instances)

Parents of DET nodes belong to 6 different parts of speech: NOUN (966; 98% instances), PROPN (9; 1% instances), ADJ (8; 1% instances), PRON (2; 0% instances), NUM (1; 0% instances), VERB (1; 0% instances)

974 (99%) DET nodes are leaves.

12 (1%) DET nodes have one child.

1 (0%) DET nodes have two children.

The highest child degree of a DET node is 2.

Children of DET nodes are attached using 3 different relations: punct (10; 71% instances), case (3; 21% instances), nmod (1; 7% instances)

Children of DET nodes belong to 3 different parts of speech: PUNCT (10; 71% instances), ADP (3; 21% instances), NOUN (1; 7% instances)