UD for Veps 
Tokenization and Word Segmentation
- The main tokenization is the standard white-space delimited approach with punctuations separated.
- Punctuation marks are treated as separate tokens; the exceptions include apostrophes, what mark palatalizations; ordinary numbers (23. sügüz’ku) abbreviations (they can be written with and without period).
Morphology
Tags
- Veps uses 13 universal POS categories
- DET (determiner), INT (interjection), SYM (symbol), X (other) categories are currently not used
- Veps has the following auxiliary verbs:
- “olda” (to be, also to own etc.)
- “ei” (negation of verb)
- modals: “voida”, “sada” (can), “pidada” (must). (list can be extended as the corpora gets larger)
Features
Verbal Features
- There are five main verbal forms, distinguished by the value of the VerbForm feature:
- Mood has two values:
CndorInd.- values
ImpandPotcan be added as the corpora gets larger
- values
- Tense has two values:
PastorPres. - Voice has two values:
ActandPass. - Person has three values,
1,2and3. - Number has two values
SingorPlur.
Nominal Features
- Veps does not have Gender feature
- Number feature has two possible values:
SingandPlur - Case has 15 possible values:
Abl,Ade,All,Apr,Com,Ela,Ess,Gen,Ill,Ine,Nom,Par,Pro,Ter,Tra- values
AbeandEgrcan be added as the corpora gets larger
- values
Degree and Polarity
- Degree applies to adjectives (ADJ) and has one of two possible values:
Pos,Cmp- value
Supcan be added as the corpora gets larger - value
Dimcan be added and applied to nouns as the corpora gets larger
- value
- Polarity has only value
Neg, and applies to auxiliarie ‘ei’ - Connegative has only value
Yesand applies to verbs which have been negated by ‘ei’.
Pronouns, Determiners, Quantifiers
- PronType is used with pronouns (PRON).
- NumType is used with numerals (NUM) and adjectives (ADJ).
- The Reflex feature marks reflexive pronouns (ičeze).
In Veps it is always used together with
PronType=Prs. - Person is a lexical feature of personal pronouns (PRON) and has three values,
1,2and3. - PronType is a list based feature of pronouns and determiners and it has the following values:
- “PronType=Dem” for demonstrative pronouns
- “PronType=Int” for interrogative pronouns
- “PronType=Prs” for personal pronouns
- “PronType=Tot” for total (collective) pronouns (kaik)
### Other Features
- Veps treebank has the following language-specific features:
Syntax
- Nominal subject (nsubj) is typically a nominal in the nominative,
genitive or partitive case, without preposition.
- An infinitive verb may serve as the subject and is labeled as clausal subject, csubj.
- Objects (obj) can be nominals in nominative, genitive or partitive case.
- The copula verb olda (be) is used in equational, attributional, locative, possessive and benefactory nonverbal clauses.
Relations Overview
- The following relation subtypes are used in Veps:
Treebanks
There is 1 Veps UD treebanks: