UD_Old_French-ALTM
|
UD_Old_French-PROFITEROLE
|
Tokenization and Word Segmentation
|
Tokenization and Word Segmentation
|
- This corpus contains 553 sentences, 15076 tokens and 15285 syntactic words.
|
- This corpus contains 20359 sentences and 237822 tokens.
|
- This corpus contains 1278 tokens (8%) that are not followed by a space.
|
- This corpus contains 33512 tokens (14%) that are not followed by a space.
|
- This corpus does not contain words with spaces.
|
- This corpus does not contain words with spaces.
|
- This corpus contains 62 types of words that contain both letters and punctuation. Examples: l', n', d', s', ij., m', iij., vj., c', entr', [que], iiij., x., c., xxj., aag[i]é, xx., [se], espousa[i]lles, gag[i]er, ix., vi., xij., xl., [brebis], [conneü, [des]truis, [et], [in]conneües, [jour], [l'], [n', [on], [que, [savoir], [son, [sont], [terminer], [un], [x]viij., aid[i]er, as[s]eoir, comme], contre[s]tant, el[e], en], est], i[l], j', jug[ement]
|
- This corpus contains 157 types of words that contain both letters and punctuation. Examples: l', qu', s', n', d', m', .i., t', c', j', jusqu', .ii., l'en, entr', .iiii., .iii., g', q', .xx., .xii., .c., .vii., ch', ensembl', un', ·l, quanqu', .v., .xxx., c., tresqu', .x., k', .c.m., entresqu', .xv., .l., .vi., .xxiiii., .ix., josqu', .viii., an.ii., cest', ·s, .XL., .iiij.m., .lx., .xxxvi.m., jesqu'
|
- This corpus contains 209 multi-word tokens. On average, one multi-word token consists of 2.00 syntactic words.
- There are 9 types of multi-word tokens. Examples: du, des, au, as, u, es, eu, el, auquel.
|
|
Morphology
Tags
- This corpus uses 13 UPOS tags out of 17 possible: ADJ, ADP, ADV, AUX, CCONJ, DET, NOUN, NUM, PRON, PROPN, PUNCT, SCONJ, VERB
- This corpus does not use the following tags: PART, INTJ, SYM, X
|
Morphology
Tags
- This corpus uses 15 UPOS tags out of 17 possible: ADJ, ADP, ADV, AUX, CCONJ, DET, INTJ, NOUN, NUM, PRON, PROPN, PUNCT, SCONJ, VERB, X
- This corpus does not use the following tags: PART, SYM
|
|
|
|
- This corpus contains 32 lemmas tagged as pronouns (PRON): aucun, autre, autrui, ce, chacun, cil, cist, dont, en, il, je, lequel, nous, nul, néant, on, où, que, quel, qui, quiconque, quoi, rien, se, sien, soi, tel, tout, un, vous, vôtre, y
|
- This corpus contains 1 lemmas tagged as pronouns (PRON): _
|
- This corpus contains 21 lemmas tagged as determiners (DET): aucun, autre, ce, chacun, cil, cist, de, l', la, le, leur, ma, mon, notre, nul, plusieurs, quel, son, tout, un, votre
|
- This corpus contains 1 lemmas tagged as determiners (DET): _
|
- Out of the above, 10 lemmas occurred sometimes as PRON and sometimes as DET: aucun, autre, ce, chacun, cil, cist, nul, quel, tout, un
|
- Out of the above, 1 lemmas occurred sometimes as PRON and sometimes as DET: _
|
- This corpus contains 5 lemmas tagged as auxiliaries (AUX): avoir, devoir, pouvoir, souloir, être
|
- This corpus contains 1 lemmas tagged as auxiliaries (AUX): _
|
- Out of the above, 4 lemmas occurred sometimes as AUX and sometimes as VERB: avoir, devoir, pouvoir, être
|
- Out of the above, 1 lemmas occurred sometimes as AUX and sometimes as VERB: _
|
|
|
- This corpus does not use the VerbForm feature.
|
- Fin
- AUX: est, fu, doit, avoit, estoit, puet, devoit, eüst, a, soit
- VERB: dist, disoit, vouloit, a, demandoit, estoit, vout, est, di, fet
|
|
- Inf
- AUX: estre, avoir
- VERB: savoir, avoir, fere, respondre, aler, metre, prendre, connoistre, demander, semondre
|
|
- Part
- AUX: esté
- VERB: jugié, fete, tenu, ataint, fet, pris, rendu, mis, justicié, tret
|
|
Nominal Features
|
Nominal Features
|
|
|
|
|
|
|
|
|
|
- Plur
- AUX-Fin: avoient, estoient, avez, sont, devez, furent, doivent, fussent, avon, devoient
- NOUN: nans, resons, parties, enfans, hommes, deffautes, tesmoins, deniers, livres, ans
- VERB-Fin: sont, mistrent, distrent, disoient, voulon, aroient, avoient, dison, firent, vindrent
|
|
- Sing
- AUX-Fin: est, fu, doit, avoit, estoit, puet, devoit, eüst, a, soit
- NOUN: homme, droit, fame, veüe, brief, terre, jugement, marchié, court, heritage
- PROPN: Normendie, Diex, Robert, Roen, Bosc, Dieu, France, Sainne
- VERB-Fin: dist, disoit, vouloit, a, demandoit, estoit, vout, est, di, fet
- VERB-Part: fete
|
|
|
|
|
|
|
|
|
|
|
- Def
- ADP: de
- DET: le, l', la, les, li
|
|
- Ind
- DET: un, une, tous, tout, de, toute, aucune, autre, nul, toutes
|
|
Degree and Polarity
|
Degree and Polarity
|
|
|
|
|
|
|
- Neg
- ADV: ne, n', pas, point, mie, [n'
|
|
|
|
|
Verbal Features
|
Verbal Features
|
|
|
|
|
|
|
|
|
|
- Past
- AUX-Part: esté
- VERB-Fin: departi
- VERB-Part: jugié, fete, tenu, ataint, fet, pris, rendu, mis, justicié, tret
|
|
- Pres
- VERB-Part: pendant, fesant, metant, vouchant, abatant, contre[s]tant, contrebatant, contrestant, demourant, disant
|
|
|
|
|
|
|
|
Pronouns, Determiners, Quantifiers
|
Pronouns, Determiners, Quantifiers
|
|
|
|
- Art
- DET: le, l', la, les, un, une, li, uns, [un], des
|
|
- Dem
- DET: cest, cele, cel, ce, ces, ceste, ses, celi, cen, cil
- PRON: ce, cil, celi, c', cels, celui, cen, ceus, cele, celes
|
|
- Ind
- PRON: autre, riens, els, aucun, nul, un, tout, aucuns, nus, autrui
|
|
|
|
|
- Prs
- ADP: a
- PRON: il, en, li, vous, je, l', le, se, ele, i
|
|
- Rel
- PRON: qui, que, quoi, ou, donc, lequel, quele, dont, quel
|
|
|
|
|
- Card
- NUM: ij., iij., vj., deus, iiij., x., c., xxj., xx., ix.
|
|
- Ord
- ADJ: premier, premiere, premieres, première, segont, tiers
|
|
|
|
|
- Yes
- ADJ: soen
- DET: son, sa, lor, ses, vostre, ma, mon, leur, nostre, mes
- PRON: soen, soens, vos, vostre
|
|
|
|
|
|
|
|
- 1
- AUX-Fin: sui, ai, avon, doi, aroie, avion, eüsse, aie, doie, fui
- VERB-Fin: di, veul, voulon, dison, oï, pris, sieus, vi, vueil, ai
|
|
- 2
- AUX-Fin: avez, devez, pouez, fussiez, estiez, eussiez, eüssiez, peüssiez, poez, porriez
- VERB-Fin: feïstes, meïstes, aportastes, connoissiez, criez, devez, dites, estes, eüssiez, faciez
|
|
- 3
- AUX-Fin: est, fu, doit, avoit, estoit, puet, devoit, eüst, a, soit
- VERB-Fin: dist, disoit, vouloit, a, demandoit, estoit, vout, est, fet, prist
|
|
|
|
|
|
|
|
|
|
|
Other Features
|
Other Features
|
- ExtPos
- ADP
- ADP: juques, jusques
- ADV: hors, Non, ains, quant
- ADV
- PRON
- SCONJ
- ADP: por, devant, par, puis, sans, depuis, fors, avant, des
- ADV: si, tant, anciés, puis, Anceis, Ausi, ainceis, ains, ainçois, combien
- PRON: tout
|
|
Syntax
Auxiliary Verbs and Copula
- This corpus uses 1 lemmas as copulas (cop). Examples: être.
|
Syntax
Auxiliary Verbs and Copula
- This corpus uses 1 lemmas as copulas (cop). Examples: _.
|
- This corpus uses 5 lemmas as auxiliaries (aux). Examples: avoir, devoir, pouvoir, être, souloir.
- This corpus uses 1 lemmas as passive auxiliaries (aux:pass). Examples: être.
|
- This corpus uses 1 lemmas as auxiliaries (aux). Examples: _.
- This corpus uses 1 lemmas as passive auxiliaries (aux:pass). Examples: _.
|
Core Arguments, Oblique Arguments and Adjuncts
Here we consider only relations between verbs (parent) and nouns or pronouns (child).
- nsubj
- VERB-Fin--NOUN (366)
- VERB-Fin--NOUN-ADP(à) (1)
- VERB-Fin--PRON (514)
- VERB-Inf--NOUN (42)
- VERB-Inf--PRON (122)
- VERB-Part--NOUN (168)
- VERB-Part--PRON (223)
|
Core Arguments, Oblique Arguments and Adjuncts
Here we consider only relations between verbs (parent) and nouns or pronouns (child).
- nsubj
- VERB--NOUN (4683)
- VERB--NOUN-ADP(_) (4)
- VERB--PRON (11348)
- VERB--PRON-ADP(_) (5)
|
- obj
- VERB-Fin--NOUN (329)
- VERB-Fin--NOUN-ADP(de) (2)
- VERB-Fin--NOUN-ADP(depuis) (1)
- VERB-Fin--NOUN-ADP(à) (1)
- VERB-Fin--PRON (151)
- VERB-Inf--NOUN (144)
- VERB-Inf--PRON (49)
- VERB-Part--NOUN (74)
- VERB-Part--NOUN-ADP(à) (1)
- VERB-Part--PRON (49)
|
- obj
- VERB--NOUN (7299)
- VERB--NOUN-ADP(_) (130)
- VERB--PRON (7090)
- VERB--PRON-ADP(_) (14)
|
- iobj
- VERB-Fin--PRON (53)
- VERB-Inf--PRON (15)
- VERB-Inf--PRON-ADP(à) (1)
- VERB-Part--PRON (36)
|
- iobj
- VERB--NOUN (3)
- VERB--NOUN-ADP(_) (4)
- VERB--PRON (3223)
- VERB--PRON-ADP(_) (285)
- VERB--PRON-ADP(_)-ADP(_) (8)
|
|
|
|
|
|
|
|
|
|
Relations Overview
|
Relations Overview
- This corpus uses 13 relation subtypes: acl:relcl, advmod:obl, aux:pass, case:det, cc:nc, mark:advmod, nsubj:advmod, nsubj:obj, nsubj:outer, obj:advmod, obj:advneg, obj:obl, obl:mod
- The following 4 relation types are not used in this corpus at all: clf, list, goeswith, reparandum
|