home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Hittite-HitTB: POS Tags: DET

There are 6 DET lemmas (1%), 12 DET types (2%) and 16 DET tokens (1%). Out of 15 observed tags, the rank of DET is: 12 in number of lemmas, 11 in number of types and 12 in number of tokens.

The 10 most frequent DET lemmas: kuiš, apā-, kā-, mekki, ḫūmant-, aši

The 10 most frequent DET types: ke-e, ku-e-da-aš, ku-e-da-ni, me-ek-ki, a-pé-e-da-aš, a-pé-e-da-ni, a-pí-ya, a-ši, ku-iš, ku-u-uš

The 10 most frequent ambiguous lemmas: kuiš (PRON 9, DET 5), apā- (PRON 4, DET 3), kā- (DET 3, PRON 2), ḫūmant- (DET 2, PRON 2)

The 10 most frequent ambiguous types: ku-iš (PRON 2, DET 1)

Morphology

The form / lemma ratio of DET is 2.000000 (the average of all parts of speech is 1.571106).

The 1st highest number of forms (3) was observed with the lemma “apā-”: a-pé-e-da-aš, a-pé-e-da-ni, a-pí-ya.

The 2nd highest number of forms (3) was observed with the lemma “kuiš”: ku-e-da-aš, ku-e-da-ni, ku-iš.

The 3rd highest number of forms (2) was observed with the lemma “kā-”: ke-e, ku-u-uš.

DET occurs with 4 features: Case (13; 81% instances), Number (13; 81% instances), Gender (5; 31% instances), PronType (1; 6% instances)

DET occurs with 8 feature-value pairs: Case=Acc, Case=Dat, Case=Nom, Gender=Com, Gender=Neut, Number=Plur, Number=Sing, PronType=Tot

DET occurs with 8 feature combinations. The most frequent feature combination is Case=Dat|Number=Sing (4 tokens). Examples: ku-e-da-ni, a-pé-e-da-ni, a-pí-ya

Relations

DET nodes are attached to their parents using 2 different relations: det (15; 94% instances), obj (1; 6% instances)

Parents of DET nodes belong to 5 different parts of speech: NOUN (12; 75% instances), ADJ (1; 6% instances), NUM (1; 6% instances), PROPN (1; 6% instances), VERB (1; 6% instances)

14 (88%) DET nodes are leaves.

2 (13%) DET nodes have one child.

The highest child degree of a DET node is 1.

Children of DET nodes are attached using 2 different relations: discourse (1; 50% instances), nmod (1; 50% instances)

Children of DET nodes belong to 2 different parts of speech: NOUN (1; 50% instances), PART (1; 50% instances)