home edit page issue tracker

This page pertains to UD version 2.

UD for Umbrian

Introduction

Umbrian is an Indo-European language of the Italic branch. As such it shares a number of characteristics with classical IE languages and especially with Latin. The main similarities between Umbrian and Latin are their declension and conjugation systems. The main difference, beside phonology, is the extenvive use of cliticised postpositions in Umbrian where Latin has plain prepositions.

Tokenization and Word Segmentation

The Iguvine tablets use a word separator to (: in the Umbrian script and ⋅ in the Latin script). We thus follow native word segmentation as much as possible. The main exceptions are :

Since there is no sentence boundary marker, we decided to have one finite verb per sentence unless there are clear signs of subordination (subordinators, relativisers…). We follow this principle as much as possible while maintaining parallel structures in the original texte.

Morphology

Tags

Features

Syntax

Treebanks

There is 1 Umbrian UD treebank: