home edit page issue tracker

This page pertains to UD version 2.

UD Classical Chinese Kyoto

Language: Classical Chinese (code: lzh)
Family: Sino-Tibetan

This treebank has been part of Universal Dependencies since the UD v2.4 release.

The following people have contributed to making this treebank part of UD: Koichi Yasuoka, Christian Wittern, Tomohiko Morioka, Takumi Ikeda, Naoki Yamazaki, Yoshihiro Nikaido, Shingo Suzuki, Shigeki Moro, Yuan Li, Hiroyuki Shirasu, Kazunori Fujita.

Repository: UD_Classical_Chinese-Kyoto
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.15

License: PD

Genre: nonfiction, poetry

Questions, comments? General annotation questions (either Classical Chinese-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [yasuoka (æt) kanji • zinbun • kyoto-u • ac • jp]. Development of the treebank happens outside the UD repository. If there are bugs, either the original data source or the conversion procedure must be fixed. Do not submit pull requests against the UD repository.

Annotation Source
Lemmas annotated manually in non-UD style, automatically converted to UD, with some manual corrections of the conversion
UPOS annotated manually in non-UD style, automatically converted to UD, with some manual corrections of the conversion
XPOS annotated manually in non-UD style, automatically converted to UD, with some manual corrections of the conversion
Features annotated manually in non-UD style, automatically converted to UD, with some manual corrections of the conversion
Relations annotated manually, natively in UD style

Description

Classical Chinese Universal Dependencies Treebank annotated and converted by Institute for Research in Humanities, Kyoto University.

This Treebank is taken under the full text of 論語, 孟子, 禮記, 十八史略, 楚辭, 戰國策, and others. In Classical Chinese we had no spaces or punctuations between words or sentences, so we did not include any spaces or punctuations in Treebank files:

Acknowledgments

Statistics of UD Classical Chinese Kyoto

POS Tags

ADPADVAUXCCONJINTJNOUNNUMPARTPRONPROPNPUNCTSCONJSYMVERB

Features

AdvTypeAspectCaseDegreeMoodNameTypeNounTypeNumTypePersonPolarityPronTypeReflexTenseVerbFormVerbTypeVoice

Relations

acladvcladvmodamodapposauxcaseccccompclfcompoundcompound:redupconjcopcsubjcsubj:outercsubj:passdetdiscoursediscourse:spdislocatedexplfixedflatflat:foreignflat:vviobjlistmarknmodnsubjnsubj:outernsubj:passnummodobjoblobl:lmodobl:tmodorphanparataxisrootvocativexcomp

Tokenization and Word Segmentation

Morphology

Tags

Nominal Features

Degree and Polarity

Verbal Features

Pronouns, Determiners, Quantifiers

Other Features

Syntax

Auxiliary Verbs and Copula

Core Arguments, Oblique Arguments and Adjuncts

Here we consider only relations between verbs (parent) and nouns or pronouns (child).

Verbs with Reflexive Core Objects

Relations Overview