UD Esperanto Cairo
Language: Esperanto (code: eo)
Family: Constructed
This treebank has been part of Universal Dependencies since the UD v2.16 release.
The following people have contributed to making this treebank part of UD: Masanori Oya.
Repository: UD_Esperanto-Cairo
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.17
License: CC BY-SA 4.0
Genre: grammar-examples
Questions, comments? General annotation questions (either Esperanto-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [masanori_oya2019 (æt) meiji • ac • jp]. Development of the treebank happens directly in the UD repository, so you may submit bug fixes as pull requests against the dev branch.
| Annotation | Source |
|---|---|
| Lemmas | annotated manually |
| UPOS | annotated manually, natively in UD style |
| XPOS | not available |
| Features | annotated manually, natively in UD style |
| Relations | annotated manually, natively in UD style |
Description
This is an example treebank made to ilustrate UD annotation choices made for Esperanto based on the Cairo sample sentences.
This treebank contains the 20 Cairo example sentences and is meant to be a quick reference on how various syntactic constructions are annotated in UD.
Acknowledgments
Statistics of UD Esperanto Cairo
POS Tags
ADJ – ADP – ADV – AUX – CCONJ – DET – NOUN – NUM – PART – PRON – PROPN – PUNCT – SCONJ – VERB
Features
Case – Degree – Gender – Mood – Number – Number[psor] – Person – Poss – PronType – Reflex – Tense – VerbForm – Voice
Relations
acl:relcl – advcl – advmod – amod – appos – aux – aux:pass – case – cc – ccomp – compound – conj – cop – det – mark – nmod – nmod:poss – nsubj – nsubj:pass – obj – orphan – parataxis – punct – root – vocative – xcomp
Tokenization and Word Segmentation
- This corpus contains 20 sentences and 177 tokens.
- This corpus contains 28 tokens (16%) that are not followed by a space.
- This corpus does not contain words with spaces.
- This corpus does not contain words that contain both letters and punctuation.
Morphology
Tags
- This corpus uses 14 UPOS tags out of 17 possible: ADJ, ADP, ADV, AUX, CCONJ, DET, NOUN, NUM, PART, PRON, PROPN, PUNCT, SCONJ, VERB
- This corpus does not use the following tags: INTJ, SYM, X
- This corpus contains 2 word types tagged as particles (PART): Ĉi, Ĉu
- This corpus contains 9 lemmas tagged as pronouns (PRON): il, kio, kiu, li, mi, si, vi, ĝi, ŝi
- This corpus contains 2 lemmas tagged as determiners (DET): la, tiu
- This corpus contains 1 lemmas tagged as auxiliaries (AUX): esti
- There are 3 (de)verbal forms:
- Fin
- AUX: estas, estis
- VERB: pensas, skribis, aĉetis, brakumis, devus, farbis, farus, forlasis, gajnis, havas
- Inf
- AUX: esti
- VERB: daŭrigi, fumi, iri, lavi, trinki, veni, ĉesi
- Part
- VERB: elektitaj, faranta, transdonita
Nominal Features
- Fem
- PRON: ŝi
- Masc
- PRON: li
- Plur
- PRON: Ili
- VERB-Part: elektitaj
- Sing
- ADJ: alian, granda, malgranda, moda
- DET: tiu, tiun
- NOUN: aŭton, Francio, amiko, arĝenton, barilon, biciklon, bronzon, edzon, fenestron, frato
- PRON: vi, ŝi, li, Mi, sian, Kion, Mia, kiu, lia, sia
- PROPN: Petro, Maria, Iguazu, Jane, Parizo
- VERB-Part: faranta, transdonita
- Acc
- ADJ: alian
- DET: tiun
- NOUN: aŭton, arĝenton, barilon, biciklon, bronzon, edzon, fenestron, hararon, ideon, leteron
- PRON: sian, Kion, ĝin
- Nom
- ADJ: granda, malgranda, moda
- DET: tiu
- NOUN: Francio, amiko, frato, knabino, letero, najbaro, patro, ĉefurbo
- PRON: vi, ŝi, li, Mi, Ili, Mia, kiu, lia, sia, via
- PROPN: Petro, Maria, Iguazu, Jane, Parizo
- VERB-Part: elektitaj, transdonita
Degree and Polarity
- Pos
- ADJ: alian, granda, malgranda
Verbal Features
- Imp
- VERB-Fin: malfermu
- Ind
- AUX-Fin: estas, estis
- VERB-Fin: pensas, skribis, aĉetis, brakumis, farbis, forlasis, gajnis, havas, igis, kreskis
- VERB-Part: faranta
- Sub
- VERB-Fin: devus, farus, povus
- Past
- AUX-Fin: estis
- VERB-Fin: skribis, aĉetis, brakumis, farbis, forlasis, gajnis, igis, kreskis, kuris, povis
- VERB-Part: elektitaj, transdonita
- Pres
- AUX-Fin: estas
- VERB-Fin: pensas, havas, pluvas, povas, rigardas, volas
- VERB-Part: faranta
- Act
- VERB-Part: faranta
- Pass
- VERB-Part: elektitaj, transdonita
Pronouns, Determiners, Quantifiers
- Dem
- DET: tiu
- PRON: ĝi, ĝin
- Int
- PRON: Kion
- Prs
- PRON: vi, ŝi, li, Mi, sian, Ili, Mia, lia, sia, via
- Rel
- PRON: kiu
- Yes
- PRON: sian, Mia, lia, sia, via
- Yes
- PRON: sia, sian
- 1
- PRON: Mi, Mia
- 2
- PRON: vi, via
- 3
- PRON: li, sian, Ŝi, Ili, lia, sia, ĝi
- Sing
- PRON: sian, Mia, lia, sia, via
Other Features
Syntax
Auxiliary Verbs and Copula
- This corpus uses 1 lemmas as copulas (cop). Examples: esti.
- This corpus uses 1 lemmas as auxiliaries (aux). Examples: esti.
- This corpus uses 1 lemmas as passive auxiliaries (aux:pass). Examples: esti.
Core Arguments, Oblique Arguments and Adjuncts
Here we consider only relations between verbs (parent) and nouns or pronouns (child).
- nsubj
- VERB-Fin--NOUN-Nom (2)
- VERB-Fin--PRON-Nom (15)
- obj
- VERB-Fin--NOUN-Acc (9)
- VERB-Fin--PRON-Acc (2)
- VERB-Inf--NOUN-Acc (1)
- VERB-Part--NOUN-Acc (1)
Relations Overview
- This corpus uses 4 relation subtypes: acl:relcl, aux:pass, nmod:poss, nsubj:pass
- The following 1 main types are not used alone, they are always subtyped: acl
- The following 14 relation types are not used in this corpus at all: iobj, csubj, obl, expl, dislocated, discourse, nummod, clf, fixed, flat, list, goeswith, reparandum, dep