home edit page issue tracker

This page pertains to UD version 2.

UD Pomak Philotis

Language: Pomak (code: qpm)
Family: Indo-European, Slavic

This treebank has been part of Universal Dependencies since the UD v2.10 release.

The following people have contributed to making this treebank part of UD: Ritván Karahóǧa, Vivian Stamou, Stella Markantonatou.

Repository: UD_Pomak-Philotis
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.14

License: CC BY-NC-SA 3.0

Genre: news, grammar-examples, poetry, fiction

Questions, comments? General annotation questions (either Pomak-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [marks (æt) athenarc • gr]. Development of the treebank happens outside the UD repository. If there are bugs, either the original data source or the conversion procedure must be fixed. Do not submit pull requests against the UD repository.

Annotation Source
Lemmas assigned by a program, with some manual corrections, but not a full manual verification
UPOS assigned by a program, with some manual corrections, but not a full manual verification
XPOS not available
Features assigned by a program, with some manual corrections, but not a full manual verification
Relations annotated manually in non-UD style, automatically converted to UD


The Pomak UD treebank is derived from the Pomak Dependency Treebank, a resource developed and maintained by researchers at the Institute for Language and Speech Processing/Athena R.C. (http://www.ilsp.gr).

The Pomak UD treebank consists of 6351 sentences (86782 tokens). The data in the current release derive from primary texts that will be made available soon on the repositories of the Philotis project (https://www.ilsp.gr/en/projects/filotis-en/). The treebank is licensed under the terms of Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) .

The morphological annotation of the Pomak UD treebank was originally created by applying the morphological database Rodopsky to the texts and then by extensive manual correction by two annotators. The syntactic annotation of the 1.1 release was generated automatically using a Bulgarian model. A detailed revision of the automatic syntactic annotation is due at the end of 2022.


We wish to thank all contributors to the original annotation efforts. Morphological annotation was carried out by Ritvan Karahoǧa and Nicolaos Constantinides. Panagiotis Krimpas supported the annotation with expertise in Slavic languages and Stella Markantonatou with expertise in formal grammatical frameworks. Nicolaos Kokkas contributed to the collection of Pomak texts.


Statistics of UD Pomak Philotis

POS Tags






Tokenization and Word Segmentation



Nominal Features

Degree and Polarity

Verbal Features

Pronouns, Determiners, Quantifiers

Other Features


Auxiliary Verbs and Copula

Core Arguments, Oblique Arguments and Adjuncts

Here we consider only relations between verbs (parent) and nouns or pronouns (child).

Reflexive Verbs

Verbs with Reflexive Core Objects

Relations Overview