UD for Makurap
Tokenization and Word Segmentation
- Words are delimited by whitespace characters
- According to typographical rules, many punctuation marks are attached to a neighboring word. They are given as separate tokens (words);
- There are no adjectives in Makurap. Modification is made by composition, juxtaposing lexical roots, so when a lexical root is modified by another a new word appears as in kito moke ‘the thief man’ (kito ‘man’ + moke ‘thief’). Such words are treated sometimes as multiword tokens.
Morphology
Tags
- Makurap uses 16 of the 17 universal POS categories.
ADJ
is not used since there is no separate class of adjectives.
Mapping UPOS to XPOS Makurap
UPOS | XPOS |
---|---|
ADJ | adj |
ADV | adv |
INTJ | intj |
NOUN | n |
PROPN | ppn |
VERB | v, vi, vt |
ADP | pp |
AUX | aux |
CCONJ | cc |
DET | det |
NUM | num |
PART | pcl |
PRON | pro |
SCONJ | sc |
PUNCT | punct |
SYM | sym |
X | x |
Nominal Features
- Makurap nouns are not marked for gender. Number is optionally marked by the lexical root yã .
- Nous can take the following Cases:
Gen
. - NOUN, PROPN and PRON, are not marked for Gender.
- Personal Pronouns and Person Markers distinguish Number(Singular or Plural). They also distinguish Clusivity in the 1st person plural.
- The relational markers
Rel
, which indicate contiguity or non-contiguity between a head and its dependent, take respectively the following features:Rel=Cont
andRel=NCont
. The reflexive/correferential morpheme et is associated with the feature-valueReflex=Yes
. - Makurap is reach in nominalizations. Lexical roots can be nominalized by suffixes that receive the following features: nominalization of circunstance
Nomzr=Circ
(-ap),Nomzr=Ag
(-ret),Nomzr=Obj
(-yĩ). - Nouns may also be reduplicated in both ways denoting: plurality, collectivity, superlativity, and other semantic nuances. Numerals may also be reduplicated in order to indicate distribution.
Verbal Features
- Verbs have a lexical Aspect:
Imp
(Imperfective),Perf
(Perfective) ,Iter
(Iterative),Compl
(Completive). - Makurap is a head marking language
- Lexical roots may be reduplicated in two differentways:
monosylabic reduplication (
Red=Mo
). The modify the aspect of the verb in different ways: disylabic reduplication indicate the repetition or duration of an action; monosylabic reduplication indicates iteration of the action.
Syntax
- As a head-marking language, core arguments, except oblique core arguments are cross-referenced on the predicate.
- The order of arguments cross-referenced on the predicate is SOV. Full NPs associated with core arguments may appear in any order. The NPs may be marked as
obl
‘ obliques’, since they are not the core arguments. - Particles may indicate
Tense
(Tense),Mood
(Mood), andEvident
(Evidentiality).
Treebanks
There is 1 Makurap UD treebank: