Treebank Statistics: UD_Xavante-XDT: POS Tags: DET
There are 5 DET
lemmas (1%), 7 DET
types (1%) and 71 DET
tokens (3%).
Out of 15 observed tags, the rank of DET
is: 11 in number of lemmas, 10 in number of types and 10 in number of tokens.
The 10 most frequent DET
lemmas: hã, õ, ã, o, tahata
The 10 most frequent DET
types: hã, õ, Tahata, o, Ãhã, Õhõ, ã
The 10 most frequent ambiguous lemmas: hã (DET 61, PART 48, PRON 2), õ (PART 22, DET 6, ADV 4, PRON 1), o (DET 1, PRON 1)
The 10 most frequent ambiguous types: hã (DET 61, PART 50, PRON 3), õ (PART 22, ADV 4, DET 4), Ãhã (DET 1, PRON 1)
- hã
- õ
- Ãhã
Morphology
The form / lemma ratio of DET
is 1.400000 (the average of all parts of speech is 1.232409).
The 1st highest number of forms (2) was observed with the lemma “ã”: Ãhã, ã.
The 2nd highest number of forms (2) was observed with the lemma “õ”: Õhõ, õ.
The 3rd highest number of forms (1) was observed with the lemma “hã”: hã.
DET
occurs with 2 features: Deixis (4; 6% instances), Emph (2; 3% instances)
DET
occurs with 3 feature-value pairs: Deixis=Prox
, Deixis=Remt
, Emph=Yes
DET
occurs with 5 feature combinations.
The most frequent feature combination is _
(67 tokens).
Examples: hã, õ, Tahata, o
Relations
DET
nodes are attached to their parents using 4 different relations: det (63; 89% instances), nsubj (5; 7% instances), dep (2; 3% instances), obl (1; 1% instances)
Parents of DET
nodes belong to 4 different parts of speech: NOUN (61; 86% instances), VERB (8; 11% instances), AUX (1; 1% instances), PROPN (1; 1% instances)
65 (92%) DET
nodes are leaves.
4 (6%) DET
nodes have one child.
0 (0%) DET
nodes have two children.
2 (3%) DET
nodes have three or more children.
The highest child degree of a DET
node is 4.
Children of DET
nodes are attached using 3 different relations: dep (9; 82% instances), case (1; 9% instances), punct (1; 9% instances)
Children of DET
nodes belong to 6 different parts of speech: PART (4; 36% instances), NOUN (2; 18% instances), PRON (2; 18% instances), ADP (1; 9% instances), PUNCT (1; 9% instances), X (1; 9% instances)