home edit page issue tracker

This page pertains to UD version 2.

UD for Xavante

Tokenization and Word Segmentation

Mapping UPOS to XPOS Xavante

UPOS XPOS
ADJ adj
ADV adv
INTJ intj
NOUN n
PROPN ppn
VERB v, vi, vt
ADP pp
AUX aux
CCONJ cc
DET det
NUM num
PART pcl
PRON pro
SCONJ sc
PUNCT punct
SYM sym
X x

Morphology

Tags

Features

NOMINAL FEATURE

Person indexes

Xavante has a complicated system of indexation, using many different sets of markers. These are given in the tables below.

VERBAL FEATURE

Wh-words

Xavante wh-words are built from words such as wa ‘who’, marĩ ‘what’ (man’s speech), tiha ‘what’ (woman’s speech), mamɛ ‘where’, mahãta ‘where is’, and momo ‘where to’. These are, in questions, predeced by the particle e, which indicates that the speaker requires new information.

UPOS XPOS
who e wa
what (man) e marĩ
what (woman) e tiha
por quê? (man) e marĩ bə
por quê? (woman) e tiha bə
where e mamɛ
where to e momo
where (is) e mahãta

Syntax


Treebanks

There are N Xavante UD treebanks:


Instruction: Treebank-specific pages are generated automatically from the README file in the treebank repository and from the data in the latest release. Link to the respective *-index.html page in the treebanks folder, using the language code and the treebank code in the file name.