home edit page issue tracker

This page pertains to UD version 2.

UD Soi AHA

Language: Soi (code: soj)
Family: IE

This treebank has been part of Universal Dependencies since the UD v2.7 release.

The following people have contributed to making this treebank part of UD: AmirHossein Mojiri Foroushani, Hamid Aghaei, Amir Ahmadi.

Repository: UD_Soi-AHA
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.15

License: CC BY-SA 4.0

Genre: grammar-examples, spoken

Questions, comments? General annotation questions (either Soi-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [amojiry (æt) gmail • com]. Development of the treebank happens directly in the UD repository, so you may submit bug fixes as pull requests against the dev branch.

Annotation Source
Lemmas annotated manually
UPOS annotated manually, natively in UD style
XPOS annotated manually
Features annotated manually, natively in UD style
Relations annotated manually, natively in UD style

Description

The AHA Soi Treebank is a small treebank for contemporary Soi. Its corpus is collected and annotated manually. We have prepared this treebank based on interviews with Soi speakers.

Soi treebank consist of 8 sentences of this stage. We are trying to make this corpus bigger day by day. AHA is a small group, tries to analyze Iranian language and find their similarities and differences.

Acknowledgments

Theses sentences were prepared with the help of Delijan people. On behalf of the AHA group, Delijan people is thanked. Also, Ms. Hanieh Mashayekhi sincerely helped us to translate the sentences. First, we used the sentences suggested by APLL (Academy of Persian Language and Literature) to collect Iranian languages. This project is a research project by AmirHossein, Hamid and Amir (AHA).

You can use this structure to refer to this project:

Statistics of UD Soi AHA

POS Tags

ADPADVAUXNOUNNUMPRONPUNCTVERB

Features

CaseMoodNumberNumTypePersonPolarityPronTypeTenseVerbForm

Relations

advcladvmodauxcaseccompcompound:lvcflatnmodnmod:possnsubjnummodobjoblpunctroot

Tokenization and Word Segmentation

Morphology

Tags

Nominal Features

Degree and Polarity

Verbal Features

Pronouns, Determiners, Quantifiers

Other Features

Syntax

Auxiliary Verbs and Copula

Core Arguments, Oblique Arguments and Adjuncts

Here we consider only relations between verbs (parent) and nouns or pronouns (child).

Relations Overview