home edit page issue tracker

This page pertains to UD version 2.

UD Gorontalo BungoLoLombi

Language: Gorontalo (code: gor)
Family: Austronesian

This treebank has been part of Universal Dependencies since the UD v2.18 release.

The following people have contributed to making this treebank part of UD: Andrew Thomas Dyer, Colleen Alena O’Brien.

Repository: UD_Gorontalo-BungoLoLombi
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.18

License: CC BY-SA 4.0

Genre: grammar-examples

Questions, comments? General annotation questions (either Gorontalo-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [andrew • dyer (æt) uni-saarland • de]. Development of the treebank happens directly in the UD repository, so you may submit bug fixes as pull requests against the dev branch.

Annotation Source
Lemmas annotated manually
UPOS annotated manually, natively in UD style
XPOS not available
Features annotated manually, natively in UD style
Relations annotated manually, natively in UD style

Description

Bungo lo Lombi is a Universal Dependencies parsed corpus of modern spoken Gorontalo as spoken in Gorontalo City, Gorontalo Province, Indonesia. It comprises fieldwork samples obtained by Colleen Alena O’Brien.

Bungo lo Lombi is a corpus of modern spoken Gorontalo as spoken in Gorontalo City, Gorontalo Province, Indonesia. It comprises fieldwork samples obtained by Colleen Alena O’Brien. The complete data contains elicited examples and monologue and dialogue. At the moment, only elicited examples have been parsed.

The parsed data is different from other Austronesian languages in Universal Dependencies in the following ways:

The name Bungo lo Lombi means “banana tree” in Gorontalo: a very useful, very versatile tree that provides a valuable fruit.

Acknowledgments

References

Cite as:

@inproceedings{dyer-obrien-2025-towards,
title = "Towards better annotation practices for symmetrical voice in {U}niversal {D}ependencies",
author = "Dyer, Andrew Thomas and
O{'}Brien, Colleen Alena",
editor = {Bouma, Gosse and
{\c{C}}{\"o}ltekin, {\c{C}}a{\u{g}}r{\i}},
booktitle = "Proceedings of the Eighth Workshop on Universal Dependencies (UDW, SyntaxFest 2025)",
month = aug,
year = "2025",
address = "Ljubljana, Slovenia",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2025.udw-1.15/",
pages = "137--142",
ISBN = "979-8-89176-292-3",
abstract = "Austronesian languages exhibit features that are challenging for Universal Dependencies: most notably, the symmetric voice system, whereby agent, patient, and instrumental arguments (among others) can be the pivot of a transitive structure {--} complicating the usual assumption that subjects of transitive sentences are semantic agents, and objects semantic patients. To showcase our ideas of how to address the representation of such systems in Universal Dependencies, we introduce a small treebank of sentences from texts and elicitation sessions in Gorontalo, an Austronesian language of Sulawesi (Indonesia), which exhibits a Philippine-type voice system. We discuss the annotation guidelines for this language, and the extensions of the Universal Dependencies guidelines that are needed to accommodate this and other Austronesian languages."
}

Andrew Thomas Dyer and Colleen Alena O’Brien. 2025. Towards better annotation practices for symmetrical voice in Universal Dependencies. In Proceedings of the Eighth Workshop on Universal Dependencies (UDW, SyntaxFest 2025), pages 137–142, Ljubljana, Slovenia. Association for Computational Linguistics.

Statistics of UD Gorontalo BungoLoLombi

POS Tags

ADJADPADVCCONJDETNOUNNUMPARTPRONPROPNVERB

Features

AspectCaseGenderMoodNumberPersonPronTypeVoice

Relations

advmodamodcaseccclfcompound:redupconjdepdetiobj:instrumentiobj:patientnmodnmod:possnsubjnsubj:agentnsubj:instrumentnsubj:patientnummodobj:agentobj:patientoblrootxcomp

Tokenization and Word Segmentation

Morphology

Tags

Nominal Features

Degree and Polarity

Verbal Features

Pronouns, Determiners, Quantifiers

Other Features

Syntax

Auxiliary Verbs and Copula

Core Arguments, Oblique Arguments and Adjuncts

Here we consider only relations between verbs (parent) and nouns or pronouns (child).

Relations Overview