home edit page issue tracker

This page pertains to UD version 2.

UD Pomak Philotis

Language: Pomak (code: qpm)
Family: Indo-European, Slavic

This treebank has been part of Universal Dependencies since the UD v2.10 release.

The following people have contributed to making this treebank part of UD: Ritván Karahóǧa, Vivian Stamou, Stella Markantonatou.

Repository: UD_Pomak-Philotis
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.13

License: CC BY-NC-SA 3.0

Genre: news, grammar-examples, poetry, fiction

Questions, comments? General annotation questions (either Pomak-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [marks (æt) athenarc • gr]. Development of the treebank happens outside the UD repository. If there are bugs, either the original data source or the conversion procedure must be fixed. Do not submit pull requests against the UD repository.

Annotation Source
Lemmas assigned by a program, with some manual corrections, but not a full manual verification
UPOS assigned by a program, with some manual corrections, but not a full manual verification
XPOS not available
Features assigned by a program, with some manual corrections, but not a full manual verification
Relations annotated manually in non-UD style, automatically converted to UD

Description

The Pomak UD treebank is derived from the Pomak Dependency Treebank, a resource developed and maintained by researchers at the Institute for Language and Speech Processing/Athena R.C. (http://www.ilsp.gr).

The Pomak UD treebank consists of 6351 sentences (86782 tokens). The data in the current release derive from primary texts that will be made available soon on the repositories of the Philotis project (https://www.ilsp.gr/en/projects/filotis-en/). The treebank is licensed under the terms of Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) .

The morphological annotation of the Pomak UD treebank was originally created by applying the morphological database Rodopsky to the texts and then by extensive manual correction by two annotators. The syntactic annotation of the 1.1 release was generated automatically using a Bulgarian model. A detailed revision of the automatic syntactic annotation is due at the end of 2022.

Acknowledgments

We wish to thank all contributors to the original annotation efforts. Morphological annotation was carried out by Ritvan Karahoǧa and Nicolaos Constantinides. Panagiotis Krimpas supported the annotation with expertise in Slavic languages and Stella Markantonatou with expertise in formal grammatical frameworks. Nicolaos Kokkas contributed to the collection of Pomak texts.

References

Statistics of UD Pomak Philotis

POS Tags

ADJADPADVAUXCCONJDETINTJNOUNNUMPARTPRONPROPNPUNCTSCONJVERBX

Features

AbbrAdpTypeAnimacyAspectCaseDefiniteDegreeDeixisDeixisRefForeignGenderMoodNumberNumber[psor]NumTypePartTypeQpmPersonPolarityPossPronTypeReflexTenseVerbFormVoice

Relations

acladvcladvmodamodauxcaseccccompconjcopcsubjdepdetdiscourseexplfixedflatiobjmarknmodnsubjnummodobjoblparataxispunctrootvocativexcomp

Tokenization and Word Segmentation

Morphology

Tags

Nominal Features

Degree and Polarity

Verbal Features

Pronouns, Determiners, Quantifiers

Other Features

Syntax

Auxiliary Verbs and Copula

Core Arguments, Oblique Arguments and Adjuncts

Here we consider only relations between verbs (parent) and nouns or pronouns (child).

Verbs with Reflexive Core Objects

Relations Overview