home edit page issue tracker

This page pertains to UD version 2.

UD_Old_French-ALTM

UD_Old_French-PROFITEROLE

Tokenization and Word Segmentation

Tokenization and Word Segmentation

  • This corpus contains 553 sentences, 15076 tokens and 15285 syntactic words.
  • This corpus contains 20359 sentences and 237822 tokens.
  • This corpus contains 1278 tokens (8%) that are not followed by a space.
  • This corpus contains 33512 tokens (14%) that are not followed by a space.
  • This corpus does not contain words with spaces.
  • This corpus does not contain words with spaces.
  • This corpus contains 62 types of words that contain both letters and punctuation. Examples: l', n', d', s', ij., m', iij., vj., c', entr', [que], iiij., x., c., xxj., aag[i]é, xx., [se], espousa[i]lles, gag[i]er, ix., vi., xij., xl., [brebis], [conneü, [des]truis, [et], [in]conneües, [jour], [l'], [n', [on], [que, [savoir], [son, [sont], [terminer], [un], [x]viij., aid[i]er, as[s]eoir, comme], contre[s]tant, el[e], en], est], i[l], j', jug[ement]
  • This corpus contains 157 types of words that contain both letters and punctuation. Examples: l', qu', s', n', d', m', .i., t', c', j', jusqu', .ii., l'en, entr', .iiii., .iii., g', q', .xx., .xii., .c., .vii., ch', ensembl', un', ·l, quanqu', .v., .xxx., c., tresqu', .x., k', .c.m., entresqu', .xv., .l., .vi., .xxiiii., .ix., josqu', .viii., an.ii., cest', ·s, .XL., .iiij.m., .lx., .xxxvi.m., jesqu'
  • This corpus contains 209 multi-word tokens. On average, one multi-word token consists of 2.00 syntactic words.
  • There are 9 types of multi-word tokens. Examples: du, des, au, as, u, es, eu, el, auquel.

Morphology

Tags

Morphology

Tags

  • This corpus contains 32 lemmas tagged as pronouns (PRON): aucun, autre, autrui, ce, chacun, cil, cist, dont, en, il, je, lequel, nous, nul, néant, on, où, que, quel, qui, quiconque, quoi, rien, se, sien, soi, tel, tout, un, vous, vôtre, y
  • This corpus contains 1 lemmas tagged as pronouns (PRON): _
  • This corpus contains 21 lemmas tagged as determiners (DET): aucun, autre, ce, chacun, cil, cist, de, l', la, le, leur, ma, mon, notre, nul, plusieurs, quel, son, tout, un, votre
  • This corpus contains 1 lemmas tagged as determiners (DET): _
  • Out of the above, 10 lemmas occurred sometimes as PRON and sometimes as DET: aucun, autre, ce, chacun, cil, cist, nul, quel, tout, un
  • Out of the above, 1 lemmas occurred sometimes as PRON and sometimes as DET: _
  • This corpus contains 5 lemmas tagged as auxiliaries (AUX): avoir, devoir, pouvoir, souloir, être
  • This corpus contains 1 lemmas tagged as auxiliaries (AUX): _
  • Out of the above, 4 lemmas occurred sometimes as AUX and sometimes as VERB: avoir, devoir, pouvoir, être
  • Out of the above, 1 lemmas occurred sometimes as AUX and sometimes as VERB: _
  • This corpus does not use the VerbForm feature.
  • Fin
    • AUX: est, fu, doit, avoit, estoit, puet, devoit, eüst, a, soit
    • VERB: dist, disoit, vouloit, a, demandoit, estoit, vout, est, di, fet
  • Inf
    • AUX: estre, avoir
    • VERB: savoir, avoir, fere, respondre, aler, metre, prendre, connoistre, demander, semondre
  • Part
    • AUX: esté
    • VERB: jugié, fete, tenu, ataint, fet, pris, rendu, mis, justicié, tret

Nominal Features

Nominal Features

  • Plur
    • AUX-Fin: avoient, estoient, avez, sont, devez, furent, doivent, fussent, avon, devoient
    • NOUN: nans, resons, parties, enfans, hommes, deffautes, tesmoins, deniers, livres, ans
    • VERB-Fin: sont, mistrent, distrent, disoient, voulon, aroient, avoient, dison, firent, vindrent
  • Sing
    • AUX-Fin: est, fu, doit, avoit, estoit, puet, devoit, eüst, a, soit
    • NOUN: homme, droit, fame, veüe, brief, terre, jugement, marchié, court, heritage
    • PROPN: Normendie, Diex, Robert, Roen, Bosc, Dieu, France, Sainne
    • VERB-Fin: dist, disoit, vouloit, a, demandoit, estoit, vout, est, di, fet
    • VERB-Part: fete
  • Def
    • ADP: de
    • DET: le, l', la, les, li
  • Ind
    • DET: un, une, tous, tout, de, toute, aucune, autre, nul, toutes

Degree and Polarity

Degree and Polarity

  • Neg
    • ADV: ne, n', pas, point, mie, [n'

Verbal Features

Verbal Features

  • Past
    • AUX-Part: esté
    • VERB-Fin: departi
    • VERB-Part: jugié, fete, tenu, ataint, fet, pris, rendu, mis, justicié, tret
  • Pres
    • VERB-Part: pendant, fesant, metant, vouchant, abatant, contre[s]tant, contrebatant, contrestant, demourant, disant

Pronouns, Determiners, Quantifiers

Pronouns, Determiners, Quantifiers

  • Art
    • DET: le, l', la, les, un, une, li, uns, [un], des
  • Dem
    • DET: cest, cele, cel, ce, ces, ceste, ses, celi, cen, cil
    • PRON: ce, cil, celi, c', cels, celui, cen, ceus, cele, celes
  • Ind
    • PRON: autre, riens, els, aucun, nul, un, tout, aucuns, nus, autrui
  • Int
    • ADV: comment
    • DET: quel
  • Prs
    • ADP: a
    • PRON: il, en, li, vous, je, l', le, se, ele, i
  • Rel
    • PRON: qui, que, quoi, ou, donc, lequel, quele, dont, quel
  • Card
    • NUM: ij., iij., vj., deus, iiij., x., c., xxj., xx., ix.
  • Ord
    • ADJ: premier, premiere, premieres, première, segont, tiers
  • Yes
    • ADJ: soen
    • DET: son, sa, lor, ses, vostre, ma, mon, leur, nostre, mes
    • PRON: soen, soens, vos, vostre
  • 1
    • AUX-Fin: sui, ai, avon, doi, aroie, avion, eüsse, aie, doie, fui
    • VERB-Fin: di, veul, voulon, dison, oï, pris, sieus, vi, vueil, ai
  • 2
    • AUX-Fin: avez, devez, pouez, fussiez, estiez, eussiez, eüssiez, peüssiez, poez, porriez
    • VERB-Fin: feïstes, meïstes, aportastes, connoissiez, criez, devez, dites, estes, eüssiez, faciez
  • 3
    • AUX-Fin: est, fu, doit, avoit, estoit, puet, devoit, eüst, a, soit
    • VERB-Fin: dist, disoit, vouloit, a, demandoit, estoit, vout, est, fet, prist

Other Features

Other Features

  • ExtPos
    • ADP
      • ADP: juques, jusques
      • ADV: hors, Non, ains, quant
    • ADV
      • ADP: de
      • PRON: c'
    • PRON
      • DET: la
    • SCONJ
      • ADP: por, devant, par, puis, sans, depuis, fors, avant, des
      • ADV: si, tant, anciés, puis, Anceis, Ausi, ainceis, ains, ainçois, combien
      • PRON: tout

Syntax

Auxiliary Verbs and Copula

  • This corpus uses 1 lemmas as copulas (cop). Examples: être.

Syntax

Auxiliary Verbs and Copula

  • This corpus uses 1 lemmas as copulas (cop). Examples: _.
  • This corpus uses 5 lemmas as auxiliaries (aux). Examples: avoir, devoir, pouvoir, être, souloir.
  • This corpus uses 1 lemmas as passive auxiliaries (aux:pass). Examples: être.
  • This corpus uses 1 lemmas as auxiliaries (aux). Examples: _.
  • This corpus uses 1 lemmas as passive auxiliaries (aux:pass). Examples: _.

Core Arguments, Oblique Arguments and Adjuncts

Here we consider only relations between verbs (parent) and nouns or pronouns (child).
  • nsubj
    • VERB-Fin--NOUN (366)
    • VERB-Fin--NOUN-ADP(à) (1)
    • VERB-Fin--PRON (514)
    • VERB-Inf--NOUN (42)
    • VERB-Inf--PRON (122)
    • VERB-Part--NOUN (168)
    • VERB-Part--PRON (223)

Core Arguments, Oblique Arguments and Adjuncts

Here we consider only relations between verbs (parent) and nouns or pronouns (child).
  • nsubj
    • VERB--NOUN (4683)
    • VERB--NOUN-ADP(_) (4)
    • VERB--PRON (11348)
    • VERB--PRON-ADP(_) (5)
  • obj
    • VERB-Fin--NOUN (329)
    • VERB-Fin--NOUN-ADP(de) (2)
    • VERB-Fin--NOUN-ADP(depuis) (1)
    • VERB-Fin--NOUN-ADP(à) (1)
    • VERB-Fin--PRON (151)
    • VERB-Inf--NOUN (144)
    • VERB-Inf--PRON (49)
    • VERB-Part--NOUN (74)
    • VERB-Part--NOUN-ADP(à) (1)
    • VERB-Part--PRON (49)
  • obj
    • VERB--NOUN (7299)
    • VERB--NOUN-ADP(_) (130)
    • VERB--PRON (7090)
    • VERB--PRON-ADP(_) (14)
  • iobj
    • VERB-Fin--PRON (53)
    • VERB-Inf--PRON (15)
    • VERB-Inf--PRON-ADP(à) (1)
    • VERB-Part--PRON (36)
  • iobj
    • VERB--NOUN (3)
    • VERB--NOUN-ADP(_) (4)
    • VERB--PRON (3223)
    • VERB--PRON-ADP(_) (285)
    • VERB--PRON-ADP(_)-ADP(_) (8)

Relations Overview

Relations Overview