home edit page issue tracker

This page pertains to UD version 2.

UD for Old French

Sentence segmentation

Due to the natures of the original texts where sentence boundaries are not usually clearly marked by punctuation, we keep only one main clause per tree regardless of the punctuation added by modern transcriptions. See for instance in La queste del saint Graal,

Certes, fet li rois, Keus, vos dites voir, ceste costume ai je toz jorz tenue et la tendrai tant com je porrai mes je avoie si grant joie de Lancelot et de ses cousins qui estoient venu a cort sain et haitié qu’il ne me sovenoit de la costume.

Is split in four trees:

Tokenization and Word Segmentation

Morphology

The following custom morphological features are used:

Syntax

The following relation subtypes are used:

Treebanks

There is 1 Old French UD treebank: