UD Neapolitan RB
Language: Neapolitan (code: nap
)
Family: IE
This treebank has been part of Universal Dependencies since the UD v2.9 release.
The following people have contributed to making this treebank part of UD: Rodolfo Basile.
Repository: UD_Neapolitan-RB
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.16
License: CC BY-SA 4.0
Genre: grammar-examples
Questions, comments? General annotation questions (either Neapolitan-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [rodolfo • basile (æt) ut • ee]. Development of the treebank happens directly in the UD repository, so you may submit bug fixes as pull requests against the dev branch.
Annotation | Source |
---|---|
Lemmas | annotated manually |
UPOS | annotated manually, natively in UD style |
XPOS | not available |
Features | annotated manually, natively in UD style |
Relations | annotated manually, natively in UD style |
Description
This treebank contains example sentences in Neapolitan, translated by a native speaker.
The example sentences have been translated from Italian. Since Neapolitan orthography is not standardized, a new way of writing reduced vowels is proposed, to avoid italianization (Cerruti 2016). Reduced vowels are transcribed with a breve diacritic. Neapolitan reduced vowels are hence /ă/, /ĕ/ and /ŏ/, all representing the schwa sound.
Acknowledgments
…
References
Cerruti, Massimo. 2016. L’italianizzazione dei dialetti: una rassegna. Quaderns d’Italià 21, 63–74.
Statistics of UD Neapolitan RB
POS Tags
ADJ – ADP – ADV – AUX – CCONJ – DET – NOUN – PRON – PROPN – PUNCT – SCONJ – VERB
Features
Definite – Degree – Gender – Mood – Number – Person – Polarity – PronType – Reflex – Tense – VerbForm
Relations
acl – advcl – advmod – amod – appos – aux – case – cc – ccomp – conj – cop – det – expl – flat – iobj – mark – nmod – nsubj – obj – obl – orphan – punct – root – vocative – xcomp
Tokenization and Word Segmentation
- This corpus contains 20 sentences, 197 tokens and 199 syntactic words.
- This corpus contains 34 tokens (17%) that are not followed by a space.
- This corpus does not contain words with spaces.
- This corpus contains 11 types of words that contain both letters and punctuation. Examples: 'e, 'a, 'o, l', 'na, s', 'Sta, 'nu, c', n'ată, ppo'
- This corpus contains 2 multi-word tokens. On average, one multi-word token consists of 2.00 syntactic words.
- There are 2 types of multi-word tokens. Examples: all', ă'o.
Morphology
Tags
- This corpus uses 12 UPOS tags out of 17 possible: ADJ, ADP, ADV, AUX, CCONJ, DET, NOUN, PRON, PROPN, PUNCT, SCONJ, VERB
- This corpus does not use the following tags: NUM, PART, INTJ, SYM, X
- This corpus contains 11 lemmas tagged as pronouns (PRON): che, chi, ci, egli, ella, il, lo, ne, si, ti, un'altra
- This corpus contains 4 lemmas tagged as determiners (DET): il, quello, questa, un
- Out of the above, 1 lemmas occurred sometimes as PRON and sometimes as DET: il
- This corpus contains 2 lemmas tagged as auxiliaries (AUX): avé, essĕ
- There are 4 (de)verbal forms:
- Fin
- AUX: è, so, afossĕ, avia, avissĕ, ha, hannĕ, sta, stajĕ
- VERB: Penzĕ, accattajĕ, arapĕ, criscettĕ, currevĕ, facettĕ, pienzĕ, pittajĕ, putettĕrĕ, scrivettĕ
- Ger
- VERB: chiuvennĕ, guaddannĕ
- Inf
- VERB: accuncià, fummà, jí, lavà, scegliĕ, stà, vvevĕrĕ, vĕnì
- Part
- AUX: avutŏ
- VERB: distribbuită, fattŏ, scrittŏ, abbracciatĕ, asciutĕ, pruvatĕ, riuscitŏ
Nominal Features
- Fem
- ADJ: francesĕ, sojă
- DET: 'a, 'na, 'Sta, la, na
- NOUN: lettĕră, machină, amica, biciclettă, cammără, capitalĕ, casă, foră, fĕnestă, ideă
- PRON: Essă, n'ată
- PROPN: Mariă, Giuvannă, Marronĕ
- Masc
- ADJ: ccalmŏ, gruossŏ, piccirillŏ, russŏ, suojŏ, velocĕ
- DET: 'o, l', 'e, 'nu, chillu
- NOUN: Patĕmŏ, argientŏ, bbrunzŏ, capillĕ, fratĕ, juornŏ, maritŏ, orŏ, paisĕ, sticcatŏ
- PRON: issŏ, l'
- PROPN: Pietrŏ, Ferrarŏ, Iguazy
- Plur
- DET: 'e
- NOUN: capillĕ
- Sing
- ADJ: ccalmŏ, francesĕ, gruossŏ, piccirillŏ, russŏ, sojă, suojŏ, velocĕ
- DET: 'a, 'o, 'na, l', 'Sta, 'nu, chillu, la, na
- NOUN: lettĕră, machină, Patĕmŏ, amica, argientŏ, bbrunzŏ, biciclettă, cammără, capitalĕ, casă
- PRON: Essă, issŏ, Tĕ, l', n'ată
- PROPN: Pietrŏ, Mariă, Ferrarŏ, Giuvannă, Iguazy, Marronĕ
- VERB-Fin: arapĕ
- Def
- DET: 'a, 'o, l', 'e, la
- Ind
- DET: 'na, 'nu, na
Degree and Polarity
- Cmp
- ADV: cchiù, assajĕ
- Neg
- ADV: nun
Verbal Features
- Imp
- VERB-Fin: arapĕ
- Ind
- AUX-Fin: è, so, avia, ha, hannĕ, sta, stajĕ
- VERB-Fin: Penzĕ, accattajĕ, criscettĕ, currevĕ, facettĕ, pienzĕ, pittajĕ, putettĕrĕ, scrivettĕ, tenĕnĕ
- Sub
- AUX-Fin: afossĕ, avissĕ
- Imp
- AUX-Fin: avia
- VERB-Fin: currevĕ
- Past
- AUX-Fin: afossĕ, avissĕ, ha
- VERB-Fin: accattajĕ, criscettĕ, facettĕ, pittajĕ, putettĕrĕ, scrivettĕ, vincettĕ, vvĕnettĕ
- VERB-Part: abbracciatĕ, asciutĕ, pruvatĕ, riuscitŏ
- Pres
- AUX-Fin: è, so, hannĕ, sta, stajĕ
- VERB-Fin: Penzĕ, pienzĕ, tenĕnĕ, vuò
Pronouns, Determiners, Quantifiers
- Art
- DET: 'a, 'o, 'na, l', 'e, 'nu, la, na
- Dem
- DET: 'Sta, chillu
- Ind
- PRON: n'ată
- Int
- ADV: Quannŏ
- PRON: Chĕ, chi
- Prs
- PRON: Essă, issŏ, Tĕ, l'
- Yes
- PRON: s'
- 1
- VERB-Fin: Penzĕ
- 2
- AUX-Fin: stajĕ
- PRON: Tĕ
- VERB-Fin: arapĕ, pienzĕ, vuò
- 3
- AUX-Fin: è, so, afossĕ, avia, avissĕ, ha, hannĕ, sta
- PRON: Essă, issŏ, l'
- VERB-Fin: accattajĕ, criscettĕ, currevĕ, facettĕ, pittajĕ, putettĕrĕ, scrivettĕ, tenĕnĕ, vincettĕ, vvĕnettĕ
Other Features
Syntax
Auxiliary Verbs and Copula
- This corpus uses 1 lemmas as copulas (cop). Examples: essĕ.
- This corpus uses 2 lemmas as auxiliaries (aux). Examples: avé, essĕ.
Core Arguments, Oblique Arguments and Adjuncts
Here we consider only relations between verbs (parent) and nouns or pronouns (child).
- nsubj
- VERB-Fin--NOUN (2)
- VERB-Fin--PRON (4)
- VERB-Inf--PRON (1)
- VERB-Part--PRON (2)
- obj
- VERB-Fin--NOUN (6)
- VERB-Ger--PRON (1)
- VERB-Inf--NOUN (2)
- VERB-Part--PRON (2)
- iobj
- VERB-Fin--PRON (1)
Verbs with Reflexive Core Objects
- This corpus contains 1 lemmas that occur at least once with a reflexive core object (obj or iobj). Examples: abbracciare s'