UD Low Saxon LSDC
Language: Low Saxon (code: nds
)
Family: Indo-European, Germanic
This treebank has been part of Universal Dependencies since the UD v2.8 release.
The following people have contributed to making this treebank part of UD: Janine Siewert.
Repository: UD_Low_Saxon-LSDC
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.13
License: CC BY-SA 4.0
Genre: fiction, nonfiction
Questions, comments? General annotation questions (either Low Saxon-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [janine • siewert (æt) helsinki • fi]. Development of the treebank happens outside the UD repository. If there are bugs, either the original data source or the conversion procedure must be fixed. Do not submit pull requests against the UD repository.
Annotation | Source |
---|---|
Lemmas | annotated manually |
UPOS | annotated manually, natively in UD style |
XPOS | annotated manually |
Features | annotated manually, natively in UD style |
Relations | annotated manually, natively in UD style |
Description
The UD Low Saxon LSDC dataset consists of sentences in 8 major Low Saxon dialect groups from both Germany and the Netherlands. These sentences are (or are to become) part of the LSDC dataset and represent the language from mostly the 19th and early 20th century in genres such as short stories, novels, speeches, letters and fairytales.
The first version of the UD Low Saxon LSDC dataset contained 18 Low Saxon (sub-)dialects from both Germany and the Netherlands represented by 2 sentences each and belonging to the domains of short stories, novels, speeches, letters and fairytales. Each sentence was chosen from a different text to present some of the variation within the different dialect groups. In the second version, 40 sentences from four Westphalian dialects, two from Germany and two from the Netherlands, were added. The coverage of other dialect groups will be improved in future releases. For the third version, we have raised the number of sentences to 190 and made slight modifications to the subgrouping of the dialects. The major dialect group is shown as the third segment of the sentence ID. The following dialects are included:
- BRA = Brandenburgish
- DNS = German North Saxon
- DWF = German Westphalian
- MVP = Mecklenburgish – West Pomeranian
- NNS = Dutch North Saxon
- NPR = Low Prussian
- NWF = Dutch Westphalian
- OFL = Eastphalian
Since there is no official interregional spelling, the interregional spelling suggestion used by e.g. the Dutch Low Saxon Wikipedia (Nysassiske Skryvwyse, described in more detail here: https://skryvwyse.eu/ (only in Low Saxon)) is used as a compromise for normalisation, but the original spelling of the source is included in the line “text_orig =” and a Middle Low Saxon lemma is added in the tenth column (“lemma_gml=xxx”) in order to make the Modern Low Saxon data more easily comparable with the Middle Low Saxon data in the reference corpus “Referenzkorpus Mittelniederdeutsch/Niederrheinisch”. For this reason, the Middle Low Saxon lemma forms largely follow the “Mittelniederdeutsches Handwörterbuch” by Agathe Lasch et al. like in the reference corpus. Middle Low Saxon lemmata are only added in the cases where there is an attestation in Middle Low Saxon, i.e. the word is either listed in the Handwörterbuch or is found in the reference corpus. Middle Low Saxon lemmata are still included if the word’s meaning has changed, an in addition, we have done our best to create new complex word lemmata from known simplex words and reconstruct potential Middle Low Saxon forms for words which have not yet been attested at that stage of the language.
The first version of the dataset contained only sentences from copyright-free material from the 19th and early 20th century. Part of the sentences are already included in the first release of the LSDC dataset found here: https://github.com/Helsinki-NLP/LSDC/ See there for further information on the origin of the data. The other sentences originate mostly from Joh. A. Leopold’s work ‘Van de Schelde tot te Weichsel’, a digitised version of which is accessible here: https://www.dbnl.org/titels/titel.php?id=leop008sche00 An exception constitutes the text ‘Krisjaon Klaover’ to be found in the Twentse Taalbank: http://www.twentsetaalbank.nl/docs/TWA.1894-Heinink-Krisjaon_Klaover-150.pdf These other sentences will be added to the next release of the LSDC dataset. The third version of the dataset also includes a few sentences from works by modern authors from which we have received permission to include small parts of their work in annotated corpora.
Due to the small size of the dataset, it has not yet been split into training, development and test sets.
Acknowledgments
The following people were involved in the creation of this dataset:
- Janine Siewert (data collection, selection and annotation)
- Jack Michael Rueter (annotation-related advice)
References
If you use this treebank, please cite this paper:
@inproceedings{siewert-etal-2021-towards,
title = "Towards a balanced annotated Low {S}axon dataset for diachronic investigation of dialectal variation",
author = {Siewert, Janine and
Scherrer, Yves and
Tiedemann, J{\"o}rg},
booktitle = "Proceedings of the 17th Conference on Natural Language Processing (KONVENS 2021)",
month = "6--9 " # sep,
year = "2021",
address = {D{\"u}sseldorf, Germany},
publisher = "KONVENS 2021 Organizers",
url = "https://aclanthology.org/2021.konvens-1.25",
pages = "242--246",
}
References used for the creation of this dataset:
- Lasch, Agathe et al. 1928 ff. Mittelniederdeutsches Handwörterbuch. Neumünster: Wachholtz.
- ReN-Team. 2019. Referenzkorpus Mittelniederdeutsch/Niederrheinisch (1200-1650). Archived in Hamburger Zentrum für Sprachkorpora. Version 1.0. Publication date 2019-08-14. http://hdl.handle.net/11022/0000-0007-D829-8.
Statistics of UD Low Saxon LSDC
POS Tags
ADJ – ADP – ADV – AUX – CCONJ – DET – INTJ – NOUN – NUM – PART – PRON – PROPN – PUNCT – SCONJ – VERB – X
Features
AdpType – Aspect – Case – Definite – Degree – Foreign – Gender – Gender[psor] – Mood – Number – Number[psor] – NumType – PartType – Person – Person[psor] – Polite – Poss – PronType – Reflex – Tense – VerbForm – VerbType
Relations
acl – acl:relcl – advcl – advmod – amod – appos – aux – aux:pass – case – cc – ccomp – compound – compound:prt – conj – cop – csubj – det – det:poss – discourse – dislocated – expl – expl:pv – fixed – flat – iobj – mark – nmod – nmod:poss – nsubj – nsubj:pass – nummod – obj – obl – obl:agent – orphan – parataxis – punct – root – vocative – xcomp
Tokenization and Word Segmentation
- This corpus contains 189 sentences, 4659 tokens and 4683 syntactic words.
- This corpus contains 683 tokens (15%) that are not followed by a space.
- This corpus does not contain words with spaces.
- This corpus contains 11 types of words that contain both letters and punctuation. Examples: 'e, 'n, 't, E., G., Luoden-heide, Röm., St., ao., gir-af-geskigde, vyv-
- This corpus contains 15 multi-word tokens. On average, one multi-word token consists of 2.60 syntactic words.
- There are 10 types of multi-word tokens. Examples: to'm, im, to'n, Kumste, am, bym, in'en, in't, ten, van'er.
Morphology
Tags
- This corpus uses 16 UPOS tags out of 17 possible: ADJ, ADP, ADV, AUX, CCONJ, DET, INTJ, NOUN, NUM, PART, PRON, PROPN, PUNCT, SCONJ, VERB, X
- This corpus does not use the following tags: SYM
- This corpus contains 6 word types tagged as particles (PART): en, neet, nich, nit, te, to
- This corpus contains 35 lemmas tagged as pronouns (PRON): Hiärmen, al, alle, alles, ander, dat, de, dee, dit, du, dår, dê, elk, enander, et, eyn, eyner, eynig, hee, ichts, ik, jeyde, jy, keynen, man, mekare, my, niks, see, sik, wat, wee, wekker, wer, wy
- This corpus contains 20 lemmas tagged as determiners (DET): al, alle, allerley, de, dee, den, disse, dyn, ear, en, et, geyn, höär, juw, keyn, myn, neyn, syn, un, uns
- Out of the above, 5 lemmas occurred sometimes as PRON and sometimes as DET: al, alle, de, dee, et
- This corpus contains 9 lemmas tagged as auxiliaries (AUX): doon, hebben, künnen, möten, möägen, sköälen, weasen, werden, willen
- Out of the above, 8 lemmas occurred sometimes as AUX and sometimes as VERB: doon, hebben, künnen, möten, möägen, weasen, werden, willen
- There are 3 (de)verbal forms:
- Fin
- AUX: is, was, het, sint, weer, wil, hadde, kan, skölde, willet
- VERB: leyt, hadde, hebben, hevt, höyrt, segt, sit, smit, stund, Haal
- Inf
- AUX: syn, weasen, hebben, künnen, warden, werden
- VERB: seen, seggen, doon, holden, koupen, geaven, vrågen, drägen, hangen, hebben
- Part
- ADJ: vorlåten, vorvreaten
- AUX: weasd, west, worden
- VERB: giaven, gån, worden, ebracht, maked, Beskreaven, afemaked, afklopped, ankündigd, anskriaden
Nominal Features
- Fem
- ADJ: ander, junge, akademiske, andere, düslike, eyrste, gladde, goden, grout, hougen
- DET: de, der, en, syne, myn, ne, dear, den, syn, alle
- NOUN: vrouwe, stad, sake, tyd, syde, hand, tåfel, vroide, arbeid, dumheid
- NUM: eyne
- PRON: dee, see, ear, höär
- PROPN: Hente, Havel, Luoden-heide, Marigge, Nicolaikarke, St., Trina
- Fem,Masc
- ADJ: olden
- DET: de, dee, des, en, gyn
- NOUN: gek, nachts, noud, tyden
- PRON: dee, eyne
- Fem,Masc,Neut
- DET: syn
- Masc
- ADJ: anderen, armen, leven, lutherske, 31., Weynige, anseenliksten, belangryke, beste, besten
- DET: de, den, en, dem, m, eynen, dee, syn, synen, mynen
- NOUN: heyre, man, buur, god, küänig, dage, junge, kearl, apostel, böyme
- PRON: hee, dee, em, den, man, en, hum, eyne, he, wel
- PROPN: Hiärmen, Andrees, Bennad, Claus, Friedrich, Harms, Hein, Henrick, Krisjaon, Lulef
- Masc,Neut
- NOUN: bast, minske, minsken, noorden, vlas, westen
- Neut
- ADJ: eyrste, leve, vöärneame, aardig, ander, anders, beiden, beste, besten, drüdde
- ADJ-Part: vorvreaten
- DET: en, dat, de, et, syn, myn, juwe, n, 'n, 't
- NOUN: lüde, mål, broud, kinder, lüdens, pår, woord, wöörde, bittyn, ding
- NUM: eyn, eynen
- PRON: et, dat, wat, niks, al, det, alles, allens, dit, dát
- Plur
- ADJ: olden, anderen, goden, Leve, Weynige, anseenliksten, beiden, belangryke, düslike, düütske
- AUX: sint, weren, willet, hadden, konnen, köänet, sküllet, hevvet, könden, können
- AUX-Fin: sint, willet, hadden, konnen, sullen, warden, weren, wullen
- DET: de, syne, den, dyne, eare, en, juwe, Dat, alle, allen
- NOUN: lüde, kinder, wöörde, Slaumayers, böyme, dage, handsken, minsken, pearde, smartsen
- PRON: see, dee, wy, alle, juw, uus, hee, jy, nen, se
- VERB: saeten, hebben, Höyrt, anhebbet, atten, brennen, deaden, denket, drådnägelen, döppen
- VERB-Fin: hebben, drådnägelen, hove, koamet, kwaemen, loupen, maket, sadelet, saeten, seaden
- Plur,Sing
- PRON: y, jy, uw
- VERB: gå, hebbet, weat
- Sing
- ADJ: ander, eyrste, leve, anderen, armen, beste, besten, grout, halve, junge
- ADJ-Part: vorvreaten
- AUX: is, was, het, hadde, wil, kon, wol, hevt, kan, wardt
- AUX-Fin: is, was, het, weer, wil, hadde, kan, skölde, hevt, kun
- DET: de, en, den, dat, der, et, syn, myn, syne, dem
- NOUN: heyre, vrouwe, stad, man, sake, tyd, buur, god, küänig, syde
- NUM: eyne, eynen
- PRON: ik, et, hee, dee, dat, my, wat, sik, man, em
- PROPN: Marigge, Daniel, Jesus, Josua, Jüsken, Kiel, Marleen, Wiesken, Adolf, Anton
- VERB: hadde, leyt, sea, segge, wus, het, höyrt, kaem, kam, keyk
- VERB-Fin: leyt, hadde, hevt, höyrt, segt, sit, smit, stund, Haal, Kumst
- Acc
- ADJ: aardig, andere, anderen, beiden, drüdden, düchtige, düütske, enkele, eygene, eyrste
- ADJ-Part: vorvreaten
- DET: de, en, den, dat, syn, et, eynen, syne, ne, unsen
- NOUN: woord, bittyn, broud, dumheid, gek, handsken, korv, last, pearde, pår
- PRON: et, dat, wat, niks, en, see, dee, den, hum, nist
- PROPN: Garrelt, weag
- Acc,Dat
- ADJ: ander, eyrste, leve, vöärneame, akademiske, anseenliksten, armen, belangryke, gemeynen, goden
- DET: de, den, en, dat, myn, et, syne, synen, alle, dyn
- NOUN: man, stad, buur, syde, vroidenvest, Strauß, Vogel, achterhöörn, aerde, auwe
- NUM: hunderd
- PRON: sik, my, em, dy, myn, juw, uus, al, enander, ow
- PROPN: Havel, Luoden-heide, St., Trina, Zütphen
- Dat
- ADJ: besten, goden, olden, 31., anderen, eyrste, eyrsten, glönnigen, grynenden, leven
- DET: der, dem, m, n, dear, den, me, mynen, 'n, En
- NOUN: ougen, ende, gemeynde, geslächt, houpe, stad, tyd, volke, baanhoave, barge
- NUM: eyne, eynen
- PRON: em, mik, my, nen, allen, deane, eame, ear
- PROPN: Marigge, Nicolaikarke
- Gen
- DET: des, deas, en
- NOUN: nachts, åvends, moders, sündags, vadders
- PROPN: Reinekens, Winkels
- Nom
- ADJ: ander, anderen, beste, junge, leve, lutherske, olden, völle, armen, böyse
- DET: de, en, dat, myn, syn, et, syne, dee, dyne, ne
- NOUN: heyre, vrouwe, God, küänig, lüde, arbeid, buur, ding, gelouve, junge
- NUM: tein, eyn
- PRON: ik, dee, et, hee, see, dat, man, wy, wat, y
- PROPN: Hiärmen, Andrees, Hein, Henrick, Jouke, Krisjaon, Lulef, Röyverbarg, oktober
- Def
- DET: de, den, dat, der, et, dem, m, en, dee, n
- Ind
- DET: en, ne, eynen, nen, den, eyne, eyner
Degree and Polarity
- Cmp
- ADJ: naeger, Later, duurliker, eernsthafter, meyr, seaker, smaller, swäkker, vröer, wyder
- ADV: leverst
- Pos
- ADJ: good, byster, veal, eigenlik, gans, grout, houg, kold, leve, möde
- ADJ-Part: vorlåten
- Sup
- ADJ: eyrste, best, besten, anseenliksten, beste, ryksten, sköynsten, wennigste, äldsten
Verbal Features
- Perf
- AUX-Part: weasd
- VERB-Part: ebracht, gån, maked, Beskreaven, afemaked, ankündigd, antwoorded, anvungen, beknütted, bewysd
- Imp
- VERB: Sü, Haal, Maak, Süg, geavet, gelöyvet, kom, låt, skenket, slutet
- VERB-Fin: Haal, Süg, geavet, skenket
- Ind
- AUX: is, was, het, hadde, sint, wil, kon, willet, hevt, kan
- AUX-Fin: is, was, het, sint, weer, wil, hadde, kan, skölde, willet
- VERB: hadde, leyt, sea, segge, höyrt, saeten, wus, hebben, het, kam
- VERB-Fin: leyt, hadde, hebben, hevt, höyrt, segt, sit, smit, stund, Kumst
- Ind,Sub
- AUX: hadde, kon, künst, mus, mussen, sol, wol, wul, wöör
- VERB: sat, wol, wüs
- Sub
- AUX: möchte, können, künne, möcht, weer, wörde
- AUX-Fin: künne, möcht
- VERB: Bestünde, hädde, kaem, kaeme, leyt, make, miat, setten, tröyste
- VERB-Fin: hädde, miat
- Past
- ADJ-Part: vorlåten, vorvreaten
- AUX: was, hadde, kon, wol, weer, weren, wöör, had, hadden, konnen
- AUX-Fin: was, weer, hadde, skölde, hadden, konnen, kun, mochte, möcht, skul
- AUX-Part: west, worden
- VERB: hadde, leyt, giaven, saeten, sat, sea, worden, wus, kaem, kam
- VERB-Fin: leyt, hadde, stund, bedüdde, gröäl, hädde, höyld, höyrde, kaem, kam
- VERB-Part: giaven, worden, afklopped, anskriaden, ansmeard, antoagen, betaald, bliaven, dån, döärchmaked
- Pres
- AUX: is, het, sint, wil, willet, hevt, kan, hebbe, hev, bis
- AUX-Fin: is, het, sint, wil, kan, willet, hevt, künne, mut, müchte
- VERB: segge, höyrt, hebben, het, kumt, let, segt, smit, blivt, do
- VERB-Fin: hebben, hevt, höyrt, segt, sit, smit, Kumst, blivt, blyvet, drådnägelen
Pronouns, Determiners, Quantifiers
- Art
- DET: de, en, den, dat, der, et, dem, m, ne, dee
- PRON: et
- Dem
- DET: dease, dee, disse, dissen, düäse
- PRON: dee, dat, deane, det, dit, dát
- Ind
- PRON: man, eyne, anderen, eyn, wat, eynder, eynen, eyner, eynige, yts
- Ind,Int
- PRON: wat
- Ind,Neg,Tot
- DET: gin
- Int
- PRON: wat, wel, Wee, Wekke, Wer, hwat
- Int,Rel
- PRON: wat
- Neg
- DET: keyne, gin, gyn, keyn, kyn, kyne, kynen, ninne
- PRON: niks, nist, keynen
- Prs
- DET: syn, myn, syne, mynen, synen, juwe, unsen, dyne, ear, eare
- PRON: ik, et, hee, see, my, sik, dat, em, wy, y
- Rcp
- PRON: enander, mekare, sik
- Rel
- PRON: dee, den, wat, dat, hwekken
- Tot
- DET: alle, allen
- PRON: alle, al, alles, allen, allens, elk, jeydet
- Card
- NUM: eyn, dree, acht, twey, vyv
- Ord
- ADJ: eyrste, eyrsten, tweyde
- Yes
- DET: syn, syne, myn, mynen, synen, juwe, unsen, dyne, ear, eare
- Yes
- PRON: sik, sy
- 1
- AUX: wil, hev, was, had, hadde, hebbe, sin, skölde, sküllet, wul
- AUX-Fin: skölde, wil, müchte, sin, sint, sul, wul
- PRON: ik, my, wy, myn, uus, mik, uns, et, hee, ikke
- VERB: segge, hadde, dacht, hevve, las, vråge, bedanke, do, dors, dröyp
- VERB-Fin: hadde, hebben, kryge
- 2
- AUX: bis, büst, hes, künnen, künst, skalst, willet
- PRON: y, du, dy, jy, juw, See, ow, uw, dik, e
- VERB: Sü, Haal, Kumst, Süg, bruukst, denket, do, geavet, gelöyvst, gå
- VERB-Fin: Haal, Kumst, Süg, geavet, hebben, skenket, skynst
- 3
- AUX: is, was, het, hadde, sint, kon, wol, hevt, wardt, weer
- AUX-Fin: is, was, het, weer, hadde, kan, willet, hadden, hevt, konnen
- PRON: et, hee, see, sik, dat, em, dee, en, hum, det
- VERB: hadde, leyt, höyrt, saeten, sea, wus, het, kaem, kam, keyk
- VERB-Fin: leyt, hadde, hevt, höyrt, segt, sit, smit, stund, bedüdde, blivt
- Form
- AUX: künnen, willet
- PRON: Jy, See, sik
- VERB: geavet, setten, skenket
- VERB-Fin: geavet, skenket
- Fem
- DET: Syn, höären
- Masc
- DET: syn, syne, ear, synen
- Plur
- DET: unsen, eare, Unse, ear
- Sing
- DET: syn, syne, myn, myne, mynen, dyn, höären, synen, uw
Other Features
- AdpType
- Post
- ADP: an
- Prep
- ADP: in, van, mid, to, an, up, by, vöär, nå, uut
- Post
- Foreign
- Yes
- X: decipi, mundus, vult, Amicorum, Batavorum, De, Iovivat, Prosaluut, arum, bedrogen
- Yes
- PartType
- Inf
- PART: te, to
- Neg
- PART: nich, neet, nit, en
- Inf
- Person[psor]
- 1
- DET: myn, unsen, myne, mynen, Unse
- 2
- DET: dyn, juwen, ouw, uw
- 3
- DET: syn, syne, eare, ear, höären, synen
- 1
- VerbType
- Aux
- AUX: het, is, wil, hadde, kan, skölde, willet, hadden, hebben, hevt
- AUX-Fin: het, is, wil, hadde, kan, skölde, willet, hadden, hevt, konnen
- AUX-Inf: hebben
- Cop
- AUX-Fin: is, was, weer, sint, weren
- Mod
- AUX-Fin: kun, künne, möcht, sul, wul
- Aux
Syntax
Auxiliary Verbs and Copula
- This corpus uses 1 lemmas as copulas (cop). Examples: weasen.
- This corpus uses 9 lemmas as auxiliaries (aux). Examples: hebben, künnen, willen, weasen, sköälen, möten, möägen, doon, werden.
- This corpus uses 2 lemmas as passive auxiliaries (aux:pass). Examples: werden, weasen.
Core Arguments, Oblique Arguments and Adjuncts
Here we consider only relations between verbs (parent) and nouns or pronouns (child).
- nsubj
- VERB--NOUN-Dat (1)
- VERB--NOUN-Nom (50)
- VERB--PRON (1)
- VERB--PRON-Acc (1)
- VERB--PRON-Acc,Dat (1)
- VERB--PRON-Nom (142)
- VERB-Fin--NOUN (1)
- VERB-Fin--NOUN-Nom (21)
- VERB-Fin--PRON (1)
- VERB-Fin--PRON-Nom (33)
- VERB-Inf--NOUN-Dat (1)
- VERB-Inf--NOUN-Nom (5)
- VERB-Inf--PRON (1)
- VERB-Inf--PRON-Nom (54)
- VERB-Part--NOUN-Nom (20)
- VERB-Part--PRON-Nom (36)
- obj
- VERB--NOUN-Acc (57)
- VERB--NOUN-Acc,Dat (1)
- VERB--NOUN-Acc,Dat-ADP(mid) (1)
- VERB--NOUN-Acc-ADP(dale) (1)
- VERB--NOUN-Dat (1)
- VERB--NOUN-Dat-ADP(in) (1)
- VERB--NOUN-Gen (1)
- VERB--NOUN-Nom (3)
- VERB--PRON (2)
- VERB--PRON-Acc (30)
- VERB--PRON-Acc,Dat (28)
- VERB--PRON-Nom (1)
- VERB-Fin--NOUN (2)
- VERB-Fin--NOUN-Acc (20)
- VERB-Fin--PRON-Acc (8)
- VERB-Fin--PRON-Acc,Dat (6)
- VERB-Fin--PRON-Dat (1)
- VERB-Inf--NOUN-Acc (30)
- VERB-Inf--NOUN-Acc,Dat-ADP(mid) (1)
- VERB-Inf--NOUN-Acc-ADP(vöär) (1)
- VERB-Inf--NOUN-Dat (1)
- VERB-Inf--PRON (1)
- VERB-Inf--PRON-Acc (11)
- VERB-Inf--PRON-Acc,Dat (9)
- VERB-Inf--PRON-Dat (1)
- VERB-Inf--PRON-Nom (1)
- VERB-Part--NOUN-Acc (19)
- VERB-Part--NOUN-Dat (1)
- VERB-Part--NOUN-Nom (1)
- VERB-Part--PRON-Acc (11)
- VERB-Part--PRON-Acc,Dat (4)
- VERB-Part--PRON-Dat (1)
- VERB-Part--PRON-Nom (1)
- iobj
- VERB--NOUN-Acc (2)
- VERB--NOUN-Acc,Dat (2)
- VERB--NOUN-Dat (1)
- VERB--PRON-Acc,Dat (6)
- VERB--PRON-Dat (2)
- VERB-Fin--NOUN-Acc (1)
- VERB-Fin--PRON-Acc,Dat (6)
- VERB-Inf--NOUN-Acc,Dat (1)
- VERB-Inf--NOUN-Dat (1)
- VERB-Inf--PRON-Acc,Dat (5)
- VERB-Inf--PRON-Dat (2)
- VERB-Part--NOUN-Acc (1)
- VERB-Part--PRON-Acc,Dat (7)
- VERB-Part--PRON-Dat (2)
Reflexive Verbs
- This corpus contains 5 lemmas that occur at least once with an expl:pv child. Examples: besteaden sik, eaten sik, lägeren sik, setten sik, vorwunderen sik
Verbs with Reflexive Core Objects
- This corpus contains 11 lemmas that occur at least once with a reflexive core object (obj or iobj). Examples: låten sik, maken sik, setten sik, entsluten sik, geaven sik, koaken sik, köypen sy, sitten sik, weaten sik, wunderen sik, öäverdenken sik
- Out of those, 1 lemmas occurred more than once, but never without a reflexive dependent. Examples: setten
Relations Overview
- This corpus uses 8 relation subtypes: acl:relcl, aux:pass, compound:prt, det:poss, expl:pv, nmod:poss, nsubj:pass, obl:agent
- The following 5 relation types are not used in this corpus at all: clf, list, goeswith, reparandum, dep