home edit page issue tracker

This page pertains to UD version 2.

UD Middle French ALTM

Language: Middle French (code: frm)
Family: IE

This treebank has been part of Universal Dependencies since the UD v2.17 release.

The following people have contributed to making this treebank part of UD: Natalia Romanova, Rayan Ziane, Khensa Daoudi, Théo Brillet.

Repository: UD_Middle_French-ALTM
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.17

License: CC BY-SA 4.0

Genre: legal

Questions, comments? General annotation questions (either Middle French-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [natalia • romanova (æt) unicaen • fr]. Development of the treebank happens directly in the UD repository, so you may submit bug fixes as pull requests against the dev branch.

Annotation Source
Lemmas annotated manually
UPOS annotated manually, natively in UD style
XPOS not available
Features annotated manually, natively in UD style
Relations annotated manually, natively in UD style

Description

Middle-French ALTM (AUTOMATED Legal Texts Medieval) is a treebank of medieval legal French from Normandy. Currently in contains one text, an extract from Coutume, style et usage au temps des Échiquiers de Normandie, dated 1425.

The text of the first ten chapters of Coutume, style et usage au temps des Échiquiers de Normandie was digitised from the following edition: A.-J. Marnier (ed). 1847. Coutume, style et usage au temps des Échiquiers de Normandie. Mémoires de la Société des antiquaires de Normandie. 2nd series, vol 8. Caen: A. Hardel, pp. 1-7.

The text was first annotated in PoS, lemmatised and automatically parsed as part of the Franco-German MICLE project (2021-2024) led by Professor Pierre Larrivée (University of Caen) and Professor Cecilia Poletto (University of Frankfurt). An earlier version, annotated with HT-CRISCO workflow incorporating the use of HOPS parser, can be consulted on CRISCO Lab’s TXM server and via the website.

The treebank currently contains 217 sentences and 7,301 tokens.

As part of AUTOMATED project led by Professor Larrivée at the University of Caen (2023-2025), the text was reannotated with BertForDeprel parser and manually corrected using bootstrapping methodology (Peng et al 2022) on ArboratorGrew software.

Annotation in syntactic functions was conducted following the guidelines for Old French developed by the (Profiterole project).

Wherever possible, lemmata used in the corpus are modern French or lemmata of the (Dictionnaire du Moyen Français).

Sentences written completely in Latin were excluded. If Latin words occur in French sentences, the token contains the tag Lang=la and is lemmatised with a Latin lemma.

Where morpological features are concerned, verbs and auxiliaries are annotated in verb forms (VerbForm): Inf (infinitive), Fin (conjugated) and Part (participle). Congujated forms are annotated in Person and Number. Pronouns are annotated in type (PronType: Dem for demonstrative, Ind for indefinite, Prs for personal and Rel for relative). Reflexive and possessive pronouns are also tagged (Reflexive=Yes and Poss=Yes). Determiners are annotated using PronType feature (Art for articles, Dem for demonstratives, Ind for indefinite). Possessive determiners have are annotated Poss=Yes.

Please note that Middle_French-ALTM treebank is still under development and new material will be added to the collection in future UD releases. Please do not hesitate to contact us is you have any questions, suggestions or comments.

Acknowledgments

This work was funded by ANR-DFG and Normandy Region grants and took place under the direction of Professor Pierre Larrivée (University of Caen). Mathieu Goux conducted initial PoS annotation and lemmatisation. Natasha Romanova is responsible for the revision of the annotation and for syntactic parsing. Théo Brillet assisted with syntactic annotation and annotated Latin tokens. Rayan Ziane and Khensa Daoudi provided technical support.

References

Statistics of UD Middle French ALTM

POS Tags

ADJADPADVAUXCCONJDETNOUNNUMPRONPROPNPUNCTSCONJVERB

Features

DefiniteExtPosNumberNumTypePersonPossPronTypeTenseVerbForm

Relations

aclacl:relcladvcladvmodamodapposauxaux:passcaseccccompconjcopcsubjdetdiscoursedislocatedexplfixedflatiobjmarknmodnsubjnummodobjoblorphanpunctrootvocativexcomp

Tokenization and Word Segmentation

Morphology

Tags

Nominal Features

Degree and Polarity

Verbal Features

Pronouns, Determiners, Quantifiers

Other Features

Syntax

Auxiliary Verbs and Copula

Core Arguments, Oblique Arguments and Adjuncts

Here we consider only relations between verbs (parent) and nouns or pronouns (child).

Relations Overview