home edit page issue tracker

This page pertains to UD version 2.

UD Turkish FrameNet

Language: Turkish (code: tr)
Family: Turkic

This treebank has been part of Universal Dependencies since the UD v2.8 release.

The following people have contributed to making this treebank part of UD: Neslihan Cesur, Aslı Kuzgun, Olcay Taner Yıldız, Büşra Marşan, Oğuzhan Kuyrukçu, Bilge Nas Arıcan, Ezgi Sanıyar, Neslihan Kara, Merve Özçelik.

Repository: UD_Turkish-FrameNet
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.15

License: CC BY-SA 4.0

Genre: grammar-examples

Questions, comments? General annotation questions (either Turkish-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [nesli (æt) starlangyazilim • com / olcay • yildiz (æt) ozyegin • edu • tr]. Development of the treebank happens outside the UD repository. If there are bugs, either the original data source or the conversion procedure must be fixed. Do not submit pull requests against the UD repository.

Annotation Source
Lemmas annotated manually in non-UD style, automatically converted to UD
UPOS annotated manually in non-UD style, automatically converted to UD
XPOS annotated manually in non-UD style, automatically converted to UD
Features annotated manually in non-UD style, automatically converted to UD
Relations annotated manually in non-UD style, automatically converted to UD

Description

Turkish FrameNet consists of 2,700 manually annotated example sentences and 19,221 tokens. Its data consists of the sentences taken from the Turkish FrameNet Project. The annotated sentences can be filtered according to the semantic frame category of the root of the sentence.

Turkish FrameNet consists of 2,700 manually annotated sentences and 19,221 tokens. The corpus is taken from the Turkish FrameNet Project where the Turkish verbs were annotated according to their semantic arguments. The sentences are like dictionary examples, they include made-up full sentences created by native speakers of Turkish.

The corpus is divided into 139 semantic frames according to the verb meaning. The number of sentences in frames varies according to how many verbs Turkish language has in that category. Each frame has its own dev, test, and training files.

Acknowledgments

We wish to thank the Starlang Software for funding and supporting this work.

References

*The Turkish FrameNet:

Marşan, B., Kara, N., Özçelik, M., Arıcan, B. N., Cesur, N., Kuzgun, A., … & Yıldız, O. T. (2021, January). Building the Turkish FrameNet. In Proceedings of the 11th Global Wordnet Conference (pp. 118-125).

Statistics of UD Turkish FrameNet

POS Tags

ADJADPADVAUXCCONJDETINTJNOUNNUMPRONPROPNPUNCTSCONJVERBX

Features

AspectCaseDefiniteDegreeMoodNumberNumber[psor]NumTypePersonPerson[psor]PolarityPronTypeReflexTenseVerbFormVoice

Relations

acladvcladvmodamodapposauxcaseccccompcompoundconjcsubjdetdiscoursefixedflatiobjlistmarknmodnsubjnummodobjoblorphanparataxispunctrootvocativexcomp

Tokenization and Word Segmentation

Morphology

Tags

Nominal Features

Degree and Polarity

Verbal Features

Pronouns, Determiners, Quantifiers

Other Features

Syntax

Auxiliary Verbs and Copula

Core Arguments, Oblique Arguments and Adjuncts

Here we consider only relations between verbs (parent) and nouns or pronouns (child).

Verbs with Reflexive Core Objects

Relations Overview