UD_Old_French-ALTM
|
UD_Old_French-PROFITEROLE
|
Tokenization and Word Segmentation
|
Tokenization and Word Segmentation
|
- This corpus contains 553 sentences, 15076 tokens and 15285 syntactic words.
|
- This corpus contains 20350 sentences, 237750 tokens and 240335 syntactic words.
|
- This corpus contains 1278 tokens (8%) that are not followed by a space.
|
- This corpus contains 33501 tokens (14%) that are not followed by a space.
|
- This corpus does not contain words with spaces.
|
- This corpus does not contain words with spaces.
|
- This corpus contains 62 types of words that contain both letters and punctuation. Examples: l', n', d', s', ij., m', iij., vj., c', entr', [que], iiij., x., c., xxj., aag[i]é, xx., [se], espousa[i]lles, gag[i]er, ix., vi., xij., xl., [brebis], [conneü, [des]truis, [et], [in]conneües, [jour], [l'], [n', [on], [que, [savoir], [son, [sont], [terminer], [un], [x]viij., aid[i]er, as[s]eoir, comme], contre[s]tant, el[e], en], est], i[l], j', jug[ement]
|
- This corpus contains 157 types of words that contain both letters and punctuation. Examples: l', qu', s', n', d', m', .i., t', c', j', jusqu', .ii., l'en, entr', .iiii., .iii., g', q', .xx., .xii., .c., .vii., ch', ensembl', un', ·l, quanqu', .v., .xxx., c., tresqu', .x., k', .c.m., entresqu', .xv., .l., .vi., .xxiiii., .ix., josqu', .viii., an.ii., cest', ·s, .XL., .iiij.m., .lx., .xxxvi.m., jesqu'
|
- This corpus contains 209 multi-word tokens. On average, one multi-word token consists of 2.00 syntactic words.
- There are 9 types of multi-word tokens. Examples: du, des, au, as, u, es, eu, el, auquel.
|
- This corpus contains 2585 multi-word tokens. On average, one multi-word token consists of 2.00 syntactic words.
- There are 22 types of multi-word tokens. Examples: au, des, as, del, el, al, du, dou, es, ou, u, aus, dels, als, jel, nel, ás, ell, eu, nes, os, ál.
|
Morphology
Tags
- This corpus uses 13 UPOS tags out of 17 possible: ADJ, ADP, ADV, AUX, CCONJ, DET, NOUN, NUM, PRON, PROPN, PUNCT, SCONJ, VERB
- This corpus does not use the following tags: PART, INTJ, SYM, X
|
Morphology
Tags
- This corpus uses 15 UPOS tags out of 17 possible: ADJ, ADP, ADV, AUX, CCONJ, DET, INTJ, NOUN, NUM, PRON, PROPN, PUNCT, SCONJ, VERB, X
- This corpus does not use the following tags: PART, SYM
|
|
|
|
- This corpus contains 32 lemmas tagged as pronouns (PRON): aucun, autre, autrui, ce, chacun, cil, cist, dont, en, il, je, lequel, nous, nul, néant, on, où, que, quel, qui, quiconque, quoi, rien, se, sien, soi, tel, tout, un, vous, vôtre, y
|
- This corpus contains 167 lemmas tagged as pronouns (PRON): @card@, _, _mot_nombre, alquant, ambedeux, ambore, aucun, auquant, autel, autre, autrui, cant, cantque, cart, ce, ce1, ce2, cel, cent, cest, chacun, chascun, cil, cinq, cinquante, cist, coi, cui, deux, disme, dix, donc, dont, douze, dîme, el, el1, elle, elle1, en, en2, eux, huit, huitième, ice, icel, il, je, je+le, je.en2, je.le, le, lequel, leur1, leur2, li, lor, lui, maint, me, mien, mille1, moi, mon1, moult, même, ne+le, ne1.le, ne1.les, ne1.me, ne1.se1, ne1.te, neuf2, neuvième, nos, nostre, notre, nous, nul, néant, nïent, o4, ol1, on, où, où.le, petit, peu, plusieurs, poi, premerain, premier, premier1, quanque, quant, quarante, quart, quatre, que, que+il, que.le, que.me, que.se, que.se1, que_que, quel, queux1, qui, qui1, qui1.en2, qui1.le, qui1.se1, quint, quinze, quoi, riant, rien, se1, sept, septième, sexte, si, si+il, si3.le, si3.me, si4, si4.en2, si4.le, si4.me, si4.te, sien, six, soi, soixante, son4, suen, tant, te, tel, tien, tierce, tiers, toi, tot, tous, tout, tout1, tout2, treizième, trente, trestot, trois, trèstous, trèstout, tu, tu.me, tuen, un, vingt, voloir, vos, vostre, votre, vous, y2, ça, çà
|
- This corpus contains 21 lemmas tagged as determiners (DET): aucun, autre, ce, chacun, cil, cist, de, l', la, le, leur, ma, mon, notre, nul, plusieurs, quel, son, tout, un, votre
|
- This corpus contains 65 lemmas tagged as determiners (DET): @card@, ADJ, _, alquant, aucun, autre, ce, ce1, ce2, cel, cest, chacun, chascun, cil, cinquante, cist, divers, es, icest, il, itel, le, lequel, leur2, li, lor, maint, mi1, mien, mon, mon1, moult, nostre, notre, nul, plusieurs, plusor, premier, quel, quelque, sien, son, son4, souvent, suen, tant, tel, tien, tierz, ton, ton2, tot, tous, tout, tout2, trestot, trèstous, trèstout, un, uns, vingt-quatre, vos, vostre, votre, à.le
|
- Out of the above, 10 lemmas occurred sometimes as PRON and sometimes as DET: aucun, autre, ce, chacun, cil, cist, nul, quel, tout, un
|
- Out of the above, 48 lemmas occurred sometimes as PRON and sometimes as DET: @card@, _, alquant, aucun, autre, ce, ce1, ce2, cel, cest, chacun, chascun, cil, cinquante, cist, il, le, lequel, leur2, li, lor, maint, mien, mon1, moult, nostre, notre, nul, plusieurs, premier, quel, sien, son4, suen, tant, tel, tien, tot, tous, tout, tout2, trestot, trèstous, trèstout, un, vos, vostre, votre
|
- This corpus contains 5 lemmas tagged as auxiliaries (AUX): avoir, devoir, pouvoir, souloir, être
|
- This corpus contains 10 lemmas tagged as auxiliaries (AUX): _, avoir, devoir, estre, pouvoir, pöoir, restre, savoir, souloir, voloir
|
- Out of the above, 4 lemmas occurred sometimes as AUX and sometimes as VERB: avoir, devoir, pouvoir, être
|
- Out of the above, 9 lemmas occurred sometimes as AUX and sometimes as VERB: _, avoir, devoir, estre, pouvoir, pöoir, restre, savoir, souloir
|
|
|
|
- Fin
- AUX: est, fu, doit, avoit, estoit, puet, devoit, eüst, a, soit
- VERB: dist, disoit, vouloit, a, demandoit, estoit, vout, est, di, fet
|
- Fin
- AUX: est, fu, a, estoit, avoit, ad, furent, ai, sont, ert
- VERB: dist, fist, fet, est, fait, a, avoit, vint, ad, ot
|
- Inf
- AUX: estre, avoir
- VERB: savoir, avoir, fere, respondre, aler, metre, prendre, connoistre, demander, semondre
|
- Inf
- AUX: estre, avoir, estra, aveir, estr', iestre
- NOUN: avoir, estre, pooir, saveir, voloir, mangier, plesir, departir, penser, savoir
- VERB: aler, dire, faire, venir, parler, avoir, prendre, veoir, estre, fere
|
- Part
- AUX: esté
- VERB: jugié, fete, tenu, ataint, fet, pris, rendu, mis, justicié, tret
|
- Part
- ADJ: dolanz, clére, cuintes, demenies, enceinte, barbee, dolantes, hardiz, quarré
- AUX: esté, estet, fais
- NOUN: amant, mesfait, dormanz, esliz, junie, mort, sacrefises, senblant, morz, Conplainz
- VERB: fait, dit, mis, mort, fet, venuz, pris, morz, prise, oï
|
Nominal Features
|
Nominal Features
|
|
|
|
|
|
|
|
|
|
- Plur
- AUX-Fin: avoient, estoient, avez, sont, devez, furent, doivent, fussent, avon, devoient
- NOUN: nans, resons, parties, enfans, hommes, deffautes, tesmoins, deniers, livres, ans
- VERB-Fin: sont, mistrent, distrent, disoient, voulon, aroient, avoient, dison, firent, vindrent
|
|
- Sing
- AUX-Fin: est, fu, doit, avoit, estoit, puet, devoit, eüst, a, soit
- NOUN: homme, droit, fame, veüe, brief, terre, jugement, marchié, court, heritage
- PROPN: Normendie, Diex, Robert, Roen, Bosc, Dieu, France, Sainne
- VERB-Fin: dist, disoit, vouloit, a, demandoit, estoit, vout, est, di, fet
- VERB-Part: fete
|
|
|
|
|
|
|
|
|
|
|
- Def
- ADP: de
- DET: le, l', la, les, li
|
- Def
- ADP: del, au, al, dou, ou, Des, an, le
- AUX: es
- DET: le, li, la, les, l', lo, lu, lé, lí, lis
|
- Ind
- DET: un, une, tous, tout, de, toute, aucune, autre, nul, toutes
|
- Ind
- DET: un, une, .i., uns, unes, un', úne, u·, ún, U
|
Degree and Polarity
|
Degree and Polarity
|
|
|
|
|
|
|
- Neg
- ADV: ne, n', pas, point, mie, [n'
|
- Neg
- ADV: ne, n', mie, pas, non, point, nen, nun, nes, nient
- CCONJ: n'
- PRON: nel, nes, nu, nen, nul
|
|
|
|
Verbal Features
|
Verbal Features
|
|
|
|
|
|
|
|
|
|
- Past
- AUX-Part: esté
- VERB-Fin: departi
- VERB-Part: jugié, fete, tenu, ataint, fet, pris, rendu, mis, justicié, tret
|
- Past
- ADJ: hardi, hardiz, barbee, quarré
- ADJ-Part: barbee, hardiz, quarré
- AUX: esté, este, éste, estet, fais
- AUX-Part: esté, estet, fais
- NOUN-Part: morz, Seignurs, adubez, asolue, comandet, guariz, loee, maufé, parjurez, pechét
- VERB: fait, dit, mis, mort, venuz, fet, pris, morz, prise, oï
- VERB-Part: fait, dit, mis, mort, fet, venuz, pris, morz, prise, oï
|
- Pres
- VERB-Part: pendant, fesant, metant, vouchant, abatant, contre[s]tant, contrebatant, contrestant, demourant, disant
|
- Pres
- ADJ-Part: dolanz, dolantes
- VERB: querant, curant, plorant, recreant, fuiant, parlant, recreanz, trenchant, veant, curanz
- VERB-Part: querant, curant, plorant, fuiant, parlant, recreant, recreanz, trenchant, veant, curanz
|
|
|
|
|
|
|
Pronouns, Determiners, Quantifiers
|
Pronouns, Determiners, Quantifiers
|
|
|
|
- Art
- DET: le, l', la, les, un, une, li, uns, [un], des
|
- Art
- ADP: del, des, au, al, dou, ou, an, le
- AUX: es
- DET: le, li, la, les, l', un, une, .i., uns, unes
|
- Dem
- DET: cest, cele, cel, ce, ces, ceste, ses, celi, cen, cil
- PRON: ce, cil, celi, c', cels, celui, cen, ceus, cele, celes
|
- Dem
- ADP: Ches, an, en
- ADV: en, i, an, í, em, u, o, ent, enn, ·n
- DET: ceste, cest, cele, cel, ces, ches, cil, chele, cez, cist
- PRON: ce, cil, ço, chou, çó, chil, celui, che, cele, cels
|
- Ind
- PRON: autre, riens, els, aucun, nul, un, tout, aucuns, nus, autrui
|
- Ind
- ADJ: tel, autre, autres, meïsmes, altre, tex, tele, nule, meïsme, meesme
- ADV: tout, tot, tut, tant, tous, po, alques, toute, tuit, Tel
- DET: toz, tel, nule, nul, tote, tout, autre, tuit, tot, toutes
- PRON: on, autre, tuit, nus, rien, uns, autres, un, l'en, en
- SCONJ: quant, que
|
|
|
- Int
- ADV: cum, comant, purquei, con, coment, que, Cument, porqoi, ou, conment
- DET: quel, qel, quels, quele, Qanz, itels, quex
- PRON: que, qui, coi, ou, qu', quoi, quei, ki, liquels, q'
|
- Prs
- ADP: a
- PRON: il, en, li, vous, je, l', le, se, ele, i
|
- Prs
- ADV: nen, s', la, ne, sil, nel
- DET: les, l', le, li, me, la, lor
- PRON: il, li, vos, s', le, l', je, se, ele, lui
- SCONJ: s', se
|
|
|
- Prs,Rel
- ADP: ou
- ADV: Don
- CCONJ: que, c', Ou, U, qu'
- DET: laquele
- PRON: qui, ki, que, qu', ou, cui, dunt, don, u, dont
- SCONJ: que, qu', q', c', k'
|
- Rel
- PRON: qui, que, quoi, ou, donc, lequel, quele, dont, quel
|
- Rel
- ADV: Dun, dont, que, u
- CCONJ: ou
- DET: quel, quele, quelque, quiex, qel, quels, qual, quex, laquele, ques
- PRON: qui, que, qu', ou, donc, quoi, dont, dom, ki, cui
- SCONJ: que, qu', queque, quanque, Quequ', ke
|
|
|
|
- Card
- NUM: ij., iij., vj., deus, iiij., x., c., xxj., xx., ix.
|
- Card
- ADJ: premereins, dui, .iii., .vii., ambesdous, anbedui, premer, premerein, troi
- DET: .I., .XXIIII., .l., ambdui
- NUM: deus, .ii., trois, quatre, dous, cent, dis, dui, set, .iiii.
- PRON: milie, trois, dui, andui, .ii., deus, troi, un, uns, quatre
|
- Ord
- ADJ: premier, premiere, premieres, première, segont, tiers
|
- Ord
- ADJ: premiers, premiere, premier, quarte, tierche, premeraine, premeraines, primiers, tier, tierz
- DET: tierz, premiere
- NUM: tierce
- PRON: tierz, quarte, terce, disme, quarz, sedme, noefme, premere, quart, quinte
|
|
|
|
- Yes
- ADJ: soen
- DET: son, sa, lor, ses, vostre, ma, mon, leur, nostre, mes
- PRON: soen, soens, vos, vostre
|
- Yes
- ADJ: mien, vostre, suen, sue, men, nostre, meie, moie, soe, miens
- DET: sa, son, ses, sun, vostre, nostre, lor, ma, mon, mes
- PRON: suen, mien, suens, siens, noz, sien, vostre, leur, lur, soe
|
|
|
|
|
|
|
- 1
- AUX-Fin: sui, ai, avon, doi, aroie, avion, eüsse, aie, doie, fui
- VERB-Fin: di, veul, voulon, dison, oï, pris, sieus, vi, vueil, ai
|
|
- 2
- AUX-Fin: avez, devez, pouez, fussiez, estiez, eussiez, eüssiez, peüssiez, poez, porriez
- VERB-Fin: feïstes, meïstes, aportastes, connoissiez, criez, devez, dites, estes, eüssiez, faciez
|
|
- 3
- AUX-Fin: est, fu, doit, avoit, estoit, puet, devoit, eüst, a, soit
- VERB-Fin: dist, disoit, vouloit, a, demandoit, estoit, vout, est, fet, prist
|
|
|
|
|
|
|
|
|
|
|
Other Features
|
Other Features
|
- ExtPos
- ADP
- ADP: juques, jusques
- ADV: hors, Non, ains, quant
- ADV
- PRON
- SCONJ
- ADP: por, devant, par, puis, sans, depuis, fors, avant, des
- ADV: si, tant, anciés, puis, Anceis, Ausi, ainceis, ains, ainçois, combien
- PRON: tout
|
|
|
|
- Foreign
- Yes
- ADP: in, en
- ADV: illo
- NOUN: corpus, domini, damno, verbe
- X: Explycit
|
|
|
- Morph
- VFin
- ADJ: asuage
- ADP: a, ad
- ADV: oi
- CCONJ: Et
- INTJ: Os
- NOUN: acorde, aiüe, chastie, curt, dreit, duinst, esrages, estencele, façon, freint
- PROPN: cuntredie
- VInf
- ADJ: droiturier, ácustumiers
- VPar
- ADP: voiant, oiant
- ADV: errant
- PROPN: Flurit, Perdut, Sevree
|
Syntax
Auxiliary Verbs and Copula
- This corpus uses 1 lemmas as copulas (cop). Examples: être.
|
Syntax
Auxiliary Verbs and Copula
- This corpus uses 3 lemmas as copulas (cop). Examples: estre, _, restre.
|
- This corpus uses 5 lemmas as auxiliaries (aux). Examples: avoir, devoir, pouvoir, être, souloir.
- This corpus uses 1 lemmas as passive auxiliaries (aux:pass). Examples: être.
|
- This corpus uses 8 lemmas as auxiliaries (aux). Examples: _, avoir, estre, pouvoir, devoir, pöoir, souloir, savoir.
- This corpus uses 2 lemmas as passive auxiliaries (aux:pass). Examples: estre, _.
|
Core Arguments, Oblique Arguments and Adjuncts
Here we consider only relations between verbs (parent) and nouns or pronouns (child).
- nsubj
- VERB-Fin--NOUN (366)
- VERB-Fin--NOUN-ADP(à) (1)
- VERB-Fin--PRON (514)
- VERB-Inf--NOUN (42)
- VERB-Inf--PRON (122)
- VERB-Part--NOUN (168)
- VERB-Part--PRON (223)
|
Core Arguments, Oblique Arguments and Adjuncts
Here we consider only relations between verbs (parent) and nouns or pronouns (child).
- nsubj
- VERB--NOUN (47)
- VERB--PRON (71)
- VERB-Fin--NOUN (3484)
- VERB-Fin--NOUN-ADP(_) (1)
- VERB-Fin--NOUN-ADP(de) (1)
- VERB-Fin--NOUN-ADP(fors) (1)
- VERB-Fin--PRON (8740)
- VERB-Fin--PRON-ADP(_) (2)
- VERB-Fin--PRON-ADP(de) (1)
- VERB-Fin--PRON-ADP(pour) (1)
- VERB-Inf--NOUN (139)
- VERB-Inf--PRON (653)
- VERB-Part--NOUN (681)
- VERB-Part--NOUN-ADP(_) (1)
- VERB-Part--PRON (1515)
- VERB-Part--PRON-ADP(de) (1)
|
- obj
- VERB-Fin--NOUN (329)
- VERB-Fin--NOUN-ADP(de) (2)
- VERB-Fin--NOUN-ADP(depuis) (1)
- VERB-Fin--NOUN-ADP(à) (1)
- VERB-Fin--PRON (151)
- VERB-Inf--NOUN (144)
- VERB-Inf--PRON (49)
- VERB-Part--NOUN (74)
- VERB-Part--NOUN-ADP(à) (1)
- VERB-Part--PRON (49)
|
- obj
- VERB--NOUN (75)
- VERB--PRON (62)
- VERB-Fin--NOUN (5349)
- VERB-Fin--NOUN-ADP(_) (69)
- VERB-Fin--NOUN-ADP(dalez) (1)
- VERB-Fin--NOUN-ADP(de) (46)
- VERB-Fin--NOUN-ADP(en) (1)
- VERB-Fin--NOUN-ADP(en1) (1)
- VERB-Fin--NOUN-ADP(par) (1)
- VERB-Fin--PRON (5086)
- VERB-Fin--PRON-ADP(_) (9)
- VERB-Fin--PRON-ADP(de) (5)
- VERB-Fin--PRON-ADP(por) (1)
- VERB-Inf--NOUN (1042)
- VERB-Inf--NOUN-ADP(_) (16)
- VERB-Inf--NOUN-ADP(de) (8)
- VERB-Inf--NOUN-ADP(en) (1)
- VERB-Inf--NOUN-ADP(en1) (1)
- VERB-Inf--NOUN-ADP(pour) (1)
- VERB-Inf--NOUN-ADP(à) (1)
- VERB-Inf--PRON (947)
- VERB-Inf--PRON-ADP(_) (5)
- VERB-Inf--PRON-ADP(de) (1)
- VERB-Inf--PRON-ADP(por) (1)
- VERB-Part--NOUN (774)
- VERB-Part--NOUN-ADP(_) (17)
- VERB-Part--NOUN-ADP(de) (9)
- VERB-Part--NOUN-ADP(fors) (1)
- VERB-Part--PRON (965)
- VERB-Part--PRON-ADP(_) (3)
- VERB-Part--PRON-ADP(de) (1)
- VERB-Part--PRON-ADP(por) (1)
|
- iobj
- VERB-Fin--PRON (53)
- VERB-Inf--PRON (15)
- VERB-Inf--PRON-ADP(à) (1)
- VERB-Part--PRON (36)
|
- iobj
- VERB--PRON (19)
- VERB--PRON-ADP(_) (1)
- VERB-Fin--NOUN (1)
- VERB-Fin--NOUN-ADP(de) (1)
- VERB-Fin--NOUN-ADP(en1) (1)
- VERB-Fin--PRON (2514)
- VERB-Fin--PRON-ADP(_) (11)
- VERB-Fin--PRON-ADP(dalez) (1)
- VERB-Fin--PRON-ADP(de) (3)
- VERB-Fin--PRON-ADP(devant) (2)
- VERB-Fin--PRON-ADP(encontre2) (1)
- VERB-Fin--PRON-ADP(entre) (1)
- VERB-Fin--PRON-ADP(vers) (1)
- VERB-Fin--PRON-ADP(à) (2)
- VERB-Inf--PRON (228)
- VERB-Inf--PRON-ADP(_) (2)
- VERB-Inf--PRON-ADP(à) (1)
- VERB-Part--PRON (443)
- VERB-Part--PRON-ADP(_) (1)
- VERB-Part--PRON-ADP(de) (1)
- VERB-Part--PRON-ADP(par) (1)
|
|
|
|
|
|
Reflexive Passive
- This corpus contains 4 lemmas that occur at least once with an expl:pass child. Examples: _ il, démontrer il, parler il, réprouver1 il
|
|
|
|
Relations Overview
|
Relations Overview
- This corpus uses 19 relation subtypes: acl:relcl, advmod:obl, aux:pass, case:det, cc:nc, csubj:pass, expl:pass, flat:name, mark:advmod, nsubj:advmod, nsubj:obj, nsubj:outer, nsubj:pass, obj:advmod, obj:advneg, obj:obl, obl:agent, obl:arg, obl:mod
- The following 4 relation types are not used in this corpus at all: clf, list, goeswith, reparandum
|