home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Xavante-XDT: POS Tags: DET

There are 4 DET lemmas (1%), 5 DET types (1%) and 65 DET tokens (4%). Out of 15 observed tags, the rank of DET is: 11 in number of lemmas, 10 in number of types and 8 in number of tokens.

The 10 most frequent DET lemmas: hã, ã, tahata, õ

The 10 most frequent DET types: hã, Tahata, Ãhã, Õhõ, ã

The 10 most frequent ambiguous lemmas: (DET 61, PART 27), õ (PART 15, ADV 4, DET 1)

The 10 most frequent ambiguous types: (DET 61, PART 27), Ãhã (DET 1, PRON 1)

Morphology

The form / lemma ratio of DET is 1.250000 (the average of all parts of speech is 1.294461).

The 1st highest number of forms (2) was observed with the lemma “ã”: Ãhã, ã.

The 2nd highest number of forms (1) was observed with the lemma “hã”: .

The 3rd highest number of forms (1) was observed with the lemma “tahata”: Tahata.

DET occurs with 2 features: Deixis (3; 5% instances), Emph (2; 3% instances)

DET occurs with 3 feature-value pairs: Deixis=Prox, Deixis=Remt, Emph=Yes

DET occurs with 4 feature combinations. The most frequent feature combination is _ (62 tokens). Examples: hã, Tahata

Relations

DET nodes are attached to their parents using 2 different relations: det (64; 98% instances), nsubj (1; 2% instances)

Parents of DET nodes belong to 3 different parts of speech: NOUN (61; 94% instances), VERB (3; 5% instances), PROPN (1; 2% instances)

65 (100%) DET nodes are leaves.

The highest child degree of a DET node is 0.