home edit page issue tracker

This page still pertains to UD version 1.

Introduction

There are now three Norwegian UD treebanks. The Norwegian Bokmål and Nynorsk treebanks are conversions from the Norwegian Dependency Treebank (NDT), which is a syntactic treebank of Norwegian. The Norwegian Nynorsk LIA treebank is based on the LIA treebank of transcribed spoken Norwegian dialects.

NDT was developed 2011-2014 at the National Library of Norway in collaboration with the Text Laboratory and the Department of Informatics at the University of Oslo. NDT has been automatically converted to the UD scheme by Lilja Øvrelid at the University of Oslo.

The Norwegian LIA treebank consists of dialect recordings made in the period between 1950–1990, which have been digitised, transcribed, and subsequently annotated with morphological and dependency-style syntactic analysis as part of the LIA (Language Infrastructure made Accessible) project at the University of Oslo. It has been automatically converted to the UD scheme by Lilja Øvrelid at the University of Oslo.

Acknowledgements

Thanks also to the annotators and other contributors to the original NDT treebank: Per Erik Solberg, Kari Kinn, Pål Kristian Eriksen, Arne Skjærholt, Kristin Hagen, Janne Bondi Johannessen. Thanks also to the annotators and contributors to the original LIA treebank: Andre Kaasen, Laura Moquin, Per Erik Solberg, Kristin Hagen, Janne Bondi Johannessen.

References

Kristin Hagen, Janne Bondi Johannessen and Anders Nøklestad: “A Constraint-based Tagger for Norwegian”. 2000. Proceedings of the 17th Scandinavian Conference in Linguistics.

Kari Kinn, Per Erik Solberg and Pål Kristian Eriksen. “NDT Guidelines for Morphological Annotation”. National Library Tech Report.

Per Erik Solberg, Arne Skjærholt, Lilja Øvrelid, Kristin Hagen and Janne Bondi Johannessen. 2014. “The Norwegian Dependency Treebank”, Proceedings of LREC 2014, Reykjavik

Lilja Øvrelid & Petter Hohle (2016). “Universal Dependencies for Norwegian”, In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC’16)