home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Kadiweu-Unicamp: POS Tags: DET

There are 8 DET lemmas (10%), 12 DET types (12%) and 33 DET tokens (10%). Out of 11 observed tags, the rank of DET is: 4 in number of lemmas, 3 in number of types and 4 in number of tokens.

The 10 most frequent DET lemmas: ica, ijo, niGini, adi, niGida, niGijo, niGina, eliodi

The 10 most frequent DET types: ica, ajo, NaGani, NiGida, adi, ijo, NaGajo, NiGijo, NiGinoa, eliodi

The 10 most frequent ambiguous lemmas: niGida (DET 2, PRON 2), niGijo (DET 2, PRON 2), niGina (DET 2, PRON 1), eliodi (ADV 3, DET 1)

The 10 most frequent ambiguous types: NaGajo (DET 1, PRON 1), eliodi (ADV 3, DET 1)

Morphology

The form / lemma ratio of DET is 1.500000 (the average of all parts of speech is 1.209877).

The 1st highest number of forms (2) was observed with the lemma “ijo”: ajo, ijo.

The 2nd highest number of forms (2) was observed with the lemma “niGijo”: NaGajo, NiGijo.

The 3rd highest number of forms (2) was observed with the lemma “niGina”: NiGinoa, naGana.

DET occurs with 3 features: PronType (33; 100% instances), Number (32; 97% instances), Gender (31; 94% instances)

DET occurs with 6 feature-value pairs: Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, PronType=Dem, PronType=Ind

DET occurs with 4 feature combinations. The most frequent feature combination is Gender=Masc|Number=Sing|PronType=Dem (19 tokens). Examples: ica, NiGida, ijo, NiGijo

Relations

DET nodes are attached to their parents using 2 different relations: det (32; 97% instances), nsubj (1; 3% instances)

Parents of DET nodes belong to 2 different parts of speech: NOUN (32; 97% instances), VERB (1; 3% instances)

33 (100%) DET nodes are leaves.

The highest child degree of a DET node is 0.