UD Old French ALTM
Language: Old French (code: fro)
Family: IE
This treebank has been part of Universal Dependencies since the UD v2.17 release.
The following people have contributed to making this treebank part of UD: Natalia Romanova, Rayan Ziane, Mathieu Goux, Khensa Daoudi, Pierre Larrivée.
Repository: UD_Old_French-ALTM
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.17
License: CC BY-SA 4.0
Genre: legal
Questions, comments? General annotation questions (either Old French-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [natalia • romanova (æt) unicaen • fr]. Development of the treebank happens directly in the UD repository, so you may submit bug fixes as pull requests against the dev branch.
| Annotation | Source |
|---|---|
| Lemmas | annotated manually |
| UPOS | annotated manually, natively in UD style |
| XPOS | not available |
| Features | annotated manually, natively in UD style |
| Relations | annotated manually, natively in UD style |
Description
Old-French ALTM (AUTOMATED Legal Texts Medieval) is a treebank of medieval legal French from Normandy. Currently in contains one text, Atiremens et jugiés d’eschequiers, dated 1314.
The text of Atiremens et jugiés d’eschequiers was digitised from the following edition: R. Génestal & E.-J. Tardif (eds.) 1921. Atiremenz et jugiés d’eschequiers. Caen: A. Olivier, pp. 1-75.
The text was first annotated in PoS, lemmatised and automatically parsed as part of the Franco-German MICLE project (2021-2024) led by Professor Pierre Larrivée (University of Caen) and Professor Cecilia Poletto (University of Frankfurt). An earlier version, annotated with HT-CRISCO workflow incorporating the use of HOPS parser, can be consulted on CRISCO Lab’s TXM server and via the website.
As part of AUTOMATED project led by Professor Larrivée at the University of Caen (2023-2025), the text was reannotated with BertForDeprel parser and manually corrected using bootstrapping methodology (Peng et al 2022) on ArboratorGrew software.
Annotation in syntactic functions was conducted following the guidelines for Old French developed by the (Profiterole project).
Where morpological features are concerned, verbs and auxiliaries are annotated in verb forms (VerbForm): Inf (infinitive), Fin (conjugated) and Part (participle). Congujated forms are annotated in Person and Number. Pronouns are annotated in type (PronType: Dem for demonstrative, Ind for indefinite, Prs for personal and Rel for relative). Reflexive and possessive pronouns are also tagged (Reflexive=Yes and Poss=Yes). Determiners are annotated using PronType feature (Art for articles, Dem for demonstratives, Ind for indefinite). Possessive determiners have are annotated Poss=Yes.
Wherever possible, lemmata used in the corpus are modern French or lemmata of the (Dictionnaire du Moyen Français).
Please note that Old_French-ALTM treebank is still under development and new material will be added to the collection in future UD releases. Please do not hesitate to contact us is you have any questions, suggestions or comments.
Acknowledgments
This work was funded by ANR-DFG and Normandy Region grants and took place under the direction of Professor Pierre Larrivée (University of Caen). Mathieu Goux conducted initial PoS annotation and lemmatisation. Natasha Romanova is responsible for the revision of the annotation and for syntactic parsing. Rayan Ziane and Khensa Daoudi provided technical support.
References
- Forthcoming
Statistics of UD Old French ALTM
POS Tags
ADJ – ADP – ADV – AUX – CCONJ – DET – NOUN – NUM – PRON – PROPN – PUNCT – SCONJ – VERB
Features
Definite – ExtPos – Number – NumType – Person – Polarity – Poss – PronType – Tense – VerbForm
Relations
acl – acl:relcl – advcl – advmod – amod – appos – aux – aux:pass – case – cc – ccomp – conj – cop – csubj – det – dislocated – expl – fixed – flat – iobj – mark – nmod – nsubj – nummod – obj – obl – orphan – parataxis – punct – root – xcomp
Tokenization and Word Segmentation
- This corpus contains 553 sentences, 15076 tokens and 15285 syntactic words.
- This corpus contains 1278 tokens (8%) that are not followed by a space.
- This corpus does not contain words with spaces.
- This corpus contains 62 types of words that contain both letters and punctuation. Examples: l', n', d', s', ij., m', iij., vj., c', entr', [que], iiij., x., c., xxj., aag[i]é, xx., [se], espousa[i]lles, gag[i]er, ix., vi., xij., xl., [brebis], [conneü, [des]truis, [et], [in]conneües, [jour], [l'], [n', [on], [que, [savoir], [son, [sont], [terminer], [un], [x]viij., aid[i]er, as[s]eoir, comme], contre[s]tant, el[e], en], est], i[l], j', jug[ement]
- This corpus contains 209 multi-word tokens. On average, one multi-word token consists of 2.00 syntactic words.
- There are 9 types of multi-word tokens. Examples: du, des, au, as, u, es, eu, el, auquel.
Morphology
Tags
- This corpus uses 13 UPOS tags out of 17 possible: ADJ, ADP, ADV, AUX, CCONJ, DET, NOUN, NUM, PRON, PROPN, PUNCT, SCONJ, VERB
- This corpus does not use the following tags: PART, INTJ, SYM, X
- This corpus contains 32 lemmas tagged as pronouns (PRON): aucun, autre, autrui, ce, chacun, cil, cist, dont, en, il, je, lequel, nous, nul, néant, on, où, que, quel, qui, quiconque, quoi, rien, se, sien, soi, tel, tout, un, vous, vôtre, y
- This corpus contains 21 lemmas tagged as determiners (DET): aucun, autre, ce, chacun, cil, cist, de, l', la, le, leur, ma, mon, notre, nul, plusieurs, quel, son, tout, un, votre
- Out of the above, 10 lemmas occurred sometimes as PRON and sometimes as DET: aucun, autre, ce, chacun, cil, cist, nul, quel, tout, un
- This corpus contains 5 lemmas tagged as auxiliaries (AUX): avoir, devoir, pouvoir, souloir, être
- Out of the above, 4 lemmas occurred sometimes as AUX and sometimes as VERB: avoir, devoir, pouvoir, être
- There are 3 (de)verbal forms:
- Fin
- AUX: est, fu, doit, avoit, estoit, puet, devoit, eüst, a, soit
- VERB: dist, disoit, vouloit, a, demandoit, estoit, vout, est, di, fet
- Inf
- AUX: estre, avoir
- VERB: savoir, avoir, fere, respondre, aler, metre, prendre, connoistre, demander, semondre
- Part
- AUX: esté
- VERB: jugié, fete, tenu, ataint, fet, pris, rendu, mis, justicié, tret
Nominal Features
- Plur
- AUX-Fin: avoient, estoient, avez, sont, devez, furent, doivent, fussent, avon, devoient
- NOUN: nans, resons, parties, enfans, hommes, deffautes, tesmoins, deniers, livres, ans
- VERB-Fin: sont, mistrent, distrent, disoient, voulon, aroient, avoient, dison, firent, vindrent
- Sing
- AUX-Fin: est, fu, doit, avoit, estoit, puet, devoit, eüst, a, soit
- NOUN: homme, droit, fame, veüe, brief, terre, jugement, marchié, court, heritage
- PROPN: Normendie, Diex, Robert, Roen, Bosc, Dieu, France, Sainne
- VERB-Fin: dist, disoit, vouloit, a, demandoit, estoit, vout, est, di, fet
- VERB-Part: fete
- Def
- ADP: de
- DET: le, l', la, les, li
- Ind
- DET: un, une, tous, tout, de, toute, aucune, autre, nul, toutes
Degree and Polarity
- Neg
- ADV: ne, n', pas, point, mie, [n'
Verbal Features
- Past
- AUX-Part: esté
- VERB-Fin: departi
- VERB-Part: jugié, fete, tenu, ataint, fet, pris, rendu, mis, justicié, tret
- Pres
- VERB-Part: pendant, fesant, metant, vouchant, abatant, contre[s]tant, contrebatant, contrestant, demourant, disant
Pronouns, Determiners, Quantifiers
- Art
- DET: le, l', la, les, un, une, li, uns, [un], des
- Dem
- DET: cest, cele, cel, ce, ces, ceste, ses, celi, cen, cil
- PRON: ce, cil, celi, c', cels, celui, cen, ceus, cele, celes
- Ind
- PRON: autre, riens, els, aucun, nul, un, tout, aucuns, nus, autrui
- Int
- ADV: comment
- DET: quel
- Prs
- ADP: a
- PRON: il, en, li, vous, je, l', le, se, ele, i
- Rel
- PRON: qui, que, quoi, ou, donc, lequel, quele, dont, quel
- Card
- NUM: ij., iij., vj., deus, iiij., x., c., xxj., xx., ix.
- Ord
- ADJ: premier, premiere, premieres, première, segont, tiers
- Yes
- ADJ: soen
- DET: son, sa, lor, ses, vostre, ma, mon, leur, nostre, mes
- PRON: soen, soens, vos, vostre
- 1
- AUX-Fin: sui, ai, avon, doi, aroie, avion, eüsse, aie, doie, fui
- VERB-Fin: di, veul, voulon, dison, oï, pris, sieus, vi, vueil, ai
- 2
- AUX-Fin: avez, devez, pouez, fussiez, estiez, eussiez, eüssiez, peüssiez, poez, porriez
- VERB-Fin: feïstes, meïstes, aportastes, connoissiez, criez, devez, dites, estes, eüssiez, faciez
- 3
- AUX-Fin: est, fu, doit, avoit, estoit, puet, devoit, eüst, a, soit
- VERB-Fin: dist, disoit, vouloit, a, demandoit, estoit, vout, est, fet, prist
Other Features
- ExtPos
- ADP
- ADP: juques, jusques
- ADV: hors, Non, ains, quant
- ADV
- ADP: de
- PRON: c'
- PRON
- DET: la
- SCONJ
- ADP: por, devant, par, puis, sans, depuis, fors, avant, des
- ADV: si, tant, anciés, puis, Anceis, Ausi, ainceis, ains, ainçois, combien
- PRON: tout
- ADP
Syntax
Auxiliary Verbs and Copula
- This corpus uses 1 lemmas as copulas (cop). Examples: être.
- This corpus uses 5 lemmas as auxiliaries (aux). Examples: avoir, devoir, pouvoir, être, souloir.
- This corpus uses 1 lemmas as passive auxiliaries (aux:pass). Examples: être.
Core Arguments, Oblique Arguments and Adjuncts
Here we consider only relations between verbs (parent) and nouns or pronouns (child).
- nsubj
- VERB-Fin--NOUN (366)
- VERB-Fin--NOUN-ADP(à) (1)
- VERB-Fin--PRON (514)
- VERB-Inf--NOUN (42)
- VERB-Inf--PRON (122)
- VERB-Part--NOUN (168)
- VERB-Part--PRON (223)
- obj
- VERB-Fin--NOUN (329)
- VERB-Fin--NOUN-ADP(de) (2)
- VERB-Fin--NOUN-ADP(depuis) (1)
- VERB-Fin--NOUN-ADP(à) (1)
- VERB-Fin--PRON (151)
- VERB-Inf--NOUN (144)
- VERB-Inf--PRON (49)
- VERB-Part--NOUN (74)
- VERB-Part--NOUN-ADP(à) (1)
- VERB-Part--PRON (49)
- iobj
- VERB-Fin--PRON (53)
- VERB-Inf--PRON (15)
- VERB-Inf--PRON-ADP(à) (1)
- VERB-Part--PRON (36)