home edit page issue tracker

This page pertains to UD version 2.

UD Czech CLTT

Language: Czech (code: cs)
Family: Indo-European, Slavic

This treebank has been part of Universal Dependencies since the UD v1.3 release.

The following people have contributed to making this treebank part of UD: Barbora Hladká, Daniel Zeman, Martin Popel.

Repository: UD_Czech-CLTT
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.13

License: CC BY-SA 4.0

Genre: legal

Questions, comments? General annotation questions (either Czech-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [zeman (æt) ufal • mff • cuni • cz]. Development of the treebank happens outside the UD repository. If there are bugs, either the original data source or the conversion procedure must be fixed. Do not submit pull requests against the UD repository.

Annotation Source
Lemmas annotated manually in non-UD style, automatically converted to UD
UPOS annotated manually in non-UD style, automatically converted to UD
XPOS annotated manually
Features annotated manually in non-UD style, automatically converted to UD
Relations annotated manually in non-UD style, automatically converted to UD

Description

The UD_Czech-CLTT treebank is based on the Czech Legal Text Treebank 2.0, created at the Charles University in Prague.

CLTT is a collection of 1121 manually annotated dependency trees. CLTT consists of two legal documents: The Accounting Act (563/1991 Coll., as amended; Czech: “Zákon o účetnictví”; in sentence ids: “zakon.iso”) and Decree on Double-entry Accounting for undertakers (500/2002 Coll., as amended; Czech: “Vyhláška, kterou se provádějí některá ustanovení zákona č. 563/1991 Sb., o účetnictví, ve znění pozdějších předpisů, pro účetní jednotky, které jsou podnikateli účtujícími v soustavě podvojného účetnictví”; in sentence ids: “vyhlaska.iso”).

See the following websites for more information on CLTT 2.0:

Acknowledgments

We wish to thank all of the contributors to the original CLTT annotation effort, including Barbora Hladká, Vincent Kríž and Zdeňka Urešová.

References

Statistics of UD Czech CLTT

POS Tags

ADJADPADVAUXCCONJDETNOUNNUMPARTPRONPUNCTSCONJSYMVERBX

Features

AbbrAdpTypeAnimacyAspectCaseDegreeGenderGender[psor]HyphMoodNumberNumber[psor]NumFormNumTypePersonPolarityPossPrepCasePronTypeReflexStyleTenseVariantVerbFormVoice

Relations

aclacl:relcladvcladvmodadvmod:emphamodapposauxaux:passcaseccccompcompoundconjcopcsubjcsubj:passdepdetdet:nummodexpl:passexpl:pvfixedmarknmodnsubjnsubj:passnummodnummod:govobjoblobl:argorphanparataxispunctrootxcomp

Tokenization and Word Segmentation

Morphology

Tags

Nominal Features

Degree and Polarity

Verbal Features

Pronouns, Determiners, Quantifiers

Other Features

Syntax

Auxiliary Verbs and Copula

Core Arguments, Oblique Arguments and Adjuncts

Here we consider only relations between verbs (parent) and nouns or pronouns (child).

Reflexive Verbs

Reflexive Passive

Verbs with Reflexive Core Objects

Relations Overview