home edit page issue tracker

This page pertains to UD version 2.

UD Spanish GSD

Language: Spanish (code: es)
Family: Indo-European, Romance

This treebank has been part of Universal Dependencies since the UD v1.0 release.

The following people have contributed to making this treebank part of UD: Miguel Ballesteros, Héctor Martínez Alonso, Ryan McDonald, Elena Pascual, Natalia Silveira, Daniel Zeman, Joakim Nivre.

Repository: UD_Spanish-GSD
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.5

License: CC BY-SA 4.0

Genre: blog, news, reviews, wiki

Questions, comments? General annotation questions (either Spanish-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [zeman (æt) ufal • mff • cuni • cz]. Development of the treebank happens directly in the UD repository, so you may submit bug fixes as pull requests against the dev branch.

Annotation Source
Lemmas assigned by a program, not checked manually
UPOS annotated manually in non-UD style, automatically converted to UD
XPOS not available
Features assigned by a program, not checked manually
Relations annotated manually in non-UD style, automatically converted to UD

Description

The Spanish UD is converted from the content head version of the universal dependency treebank v2.0 (legacy).

In addition to converting dependencies from the legacy UD treebank, token level morphology features have been added automatically using the parsers/taggers in Bohnet et al 2014* and Bohnet et al. 2015** trained on the Ancora*** treebank and converted automatically to UD standards.

Various heuristics have been added to improve the output of the tagger, fix obvious errors and add features that the tagger did not supply. The changes for v1.2 (November 2015) were done by Miguel Ballesteros, Dan Zeman, and Héctor Martínez Alonso.

The Spanish UD conforms to the UD guidelines, but there are some exceptions.

Acknowledgments

Statistics of UD Spanish GSD

POS Tags

ADJADPADVAUXCCONJDETINTJNOUNNUMPARTPRONPROPNPUNCTSCONJSYMVERBX

Features

CaseDefiniteDegreeForeignGenderMoodNumberNumTypePersonPolarityPolitePossPrepCasePronTypeReflexTenseVerbForm

Relations

aclacl:relcladvcladvmodamodapposauxaux:passcaseccccompcompoundconjcopcsubjcsubj:passdepdetfixedflatiobjmarknmodnsubjnsubj:passnummodobjoblorphanparataxispunctrootxcomp

Tokenization and Word Segmentation

Morphology

Tags

Nominal Features

Degree and Polarity

Verbal Features

Pronouns, Determiners, Quantifiers

Other Features

Syntax

Auxiliary Verbs and Copula

Core Arguments, Oblique Arguments and Adjuncts

Here we consider only relations between verbs (parent) and nouns or pronouns (child).

Verbs with Reflexive Core Objects

Relations Overview