UD North Sami Giella
Language: North Sami (code: sme
)
Family: Uralic
This treebank has been part of Universal Dependencies since the UD v2.1 release.
The following people have contributed to making this treebank part of UD: Trond Trosterud, Lene Antonsen, Francis Tyers.
Repository: UD_North_Sami-Giella
Search this treebank on-line: PML-TQ
Download all treebanks: UD 2.15
License: CC BY-SA 4.0
Genre: nonfiction, news
Questions, comments? General annotation questions (either North Sami-specific or cross-linguistic) can be raised in the main UD issue tracker. You can report bugs in this treebank in the treebank-specific issue tracker on Github. If you want to collaborate, please contact [ftyers (æt) prompsit • com]. Development of the treebank happens outside the UD repository. If there are bugs, either the original data source or the conversion procedure must be fixed. Do not submit pull requests against the UD repository.
Annotation | Source |
---|---|
Lemmas | annotated manually in non-UD style, automatically converted to UD |
UPOS | annotated manually in non-UD style, automatically converted to UD |
XPOS | annotated manually |
Features | annotated manually in non-UD style, automatically converted to UD |
Relations | annotated manually in non-UD style, automatically converted to UD |
Description
This is a North Sámi treebank based on a manually disambiguated and function-labelled gold-standard corpus of North Sámi produced by the Giellatekno team at UiT Norgga árktalaš universitehta.
The corpus was first analysed using a finite-state morphological analyser for North Sámi, and then disambiguated using a constraint-grammar-based disambiguator. The constraint grammar disambiguator also annotated syntactic function labels. The analyses and the function labels were manually corrected to produce a gold standard, and then a rule-based dependency parser was run on top of the gold data. On top of those parsers a series of tree-rewrite rules were used to convert the corpus to Universal Dependencies. Please see the paper below for details.
Acknowledgments
We are immensely grateful to the Giellatekno team, and especially to Trond Trosterud and Lene Antonsen for annotating the original data and for producing the rule-based parser on which the treebank is based. Their comments and help were invaluable.
If you use this data in your work, please cite:
@inproceedings{sheyanova:2017, author = {Mariya Sheyanova and Francis M. Tyers}, title = {Annotation schemes in North Sámi dependency parsing}, booktitle = {Proceedings of the 3rd International Workshop for Computational Linguistics of Uralic Languages}, pages = {66–75}, year = 2017 }
Statistics of UD North Sami Giella
POS Tags
ADJ – ADP – ADV – AUX – CCONJ – INTJ – NOUN – NUM – PART – PRON – PROPN – PUNCT – SCONJ – VERB
Features
Aspect – Case – Connegative – Degree – Mood – Number – Number[psor] – NumType – Person – Person[psor] – Polarity – PronType – Reflex – Tense – VerbForm – Voice
Relations
acl – acl:relcl – advcl – advmod – amod – appos – aux – aux:neg – case – cc – cc:preconj – ccomp – compound – compound:nn – conj – cop – csubj – det – discourse – flat – mark – nmod – nmod:poss – nsubj – nummod – obj – obl – parataxis – punct – root – vocative – xcomp – xcomp:obj – xcomp:pred
Tokenization and Word Segmentation
- This corpus contains 3122 sentences and 26845 tokens.
- This corpus contains 4085 tokens (15%) that are not followed by a space.
- This corpus does not contain words with spaces.
- This corpus contains 173 types of words that contain both letters and punctuation. Examples: dan_dihte, ovdal_go, danne_go, M., Spider-Man, dearvvašvuođa-, A., dalle_go, Biret-Elle, Joneš-bojá, Mr., Sámi_Jienat, seamma_láhkai, 1982:s, 1995:s, Davvi-Romssas, Harry_Potter-girjji, IL_Nordlys, Oarje-Finnmárkkus, Soagŋu-girji, danin_go, das_go, e-boasta, e-boastačujuhusa, giella-, jna., máná-guovttos, seamma_ládje, skuvla-, 1600-logu, 1700-jagiin, 1700-logu, 1700-lohkui, 1800-logu, 1834:s, 1877:s, 1898:s, 1899:s, 1912:s, 1926:s, 1936:s, 1944:s, 1947:s, 1948:s, 1949:s, 1951:s, 1960:s, 1968:s, 1970-logu, 1970:s
Morphology
Tags
- This corpus uses 14 UPOS tags out of 17 possible: ADJ, ADP, ADV, AUX, CCONJ, INTJ, NOUN, NUM, PART, PRON, PROPN, PUNCT, SCONJ, VERB
- This corpus does not use the following tags: DET, SYM, X
- This corpus contains 16 word types tagged as particles (PART): Almma, Amma, Na, bat, dat, ge, ges, gis, go, goit, goittot, goittotge, han, mat, nai, son
- This corpus contains 56 lemmas tagged as pronouns (PRON): buohkat, buot, dakkár, dat, diekkár, diet, don, dot, duot, dákkár, dát, eanaš, eanebut, earrása, eará, eatnagat, eatnat, gait, gii, gii_nu, giige, goabbat, goabbá, goappaš, goappašagat, goappašat, guhte, guhtege, guktot, guoibmi, ieš, iešguhtege, iešguhtet, iežá, juoga, juohke, juohkehaš, makkár, makkárge, mihkkege, mii, mii_nu, muhtin, muhtun, mun, nubbi, oktage, ollugat, olus, seammá, soames, son, uhccán, unnán, veháš, visot
- This corpus contains 0 lemmas tagged as determiners (DET):
- This corpus contains 24 lemmas tagged as auxiliaries (AUX): beassat, berret, boahtit, bállet, dáidit, fertet, galgat, gillet, gártat, háliidit, ii, leat, lávet, máhttit, nagodit, orrut, sihtat, soaitit, sáhttit, veadjit, viggat, áigut, álgit, šaddat
- Out of the above, 19 lemmas occurred sometimes as AUX and sometimes as VERB: beassat, boahtit, dáidit, fertet, galgat, gillet, gártat, háliidit, leat, lávet, máhttit, orrut, sihtat, sáhttit, veadjit, viggat, áigut, álgit, šaddat
- There are 5 (de)verbal forms:
- Fin
- AUX: lea, leat, ii, lei, eai, ledje, galgá, lean, sáhttá, in
- VERB: lea, leat, lei, ledje, bođii, boahtá, manai, dieđe, lohká, lean
- Ger
- AUX: leamen, áigume, áigumin
- VERB: beroškeahttá, boahtimin, manadettiin, orodettiin, ráhkadettiin, vácci, fárremin, leamen, čierastallame, čuoigga
- Inf
- AUX: leat, beassat, sáhttit, leahkit, máhttit, álgit
- VERB: leat, vuolgit, boahtit, bargat, mannat, geahččat, oastit, oažžut, dahkat, ráhkadit
- Part
- AUX: leamaš, beassan, berren, gártan, leamašan, sáhttán
- VERB: oaidnán, ožžon, boahtán, leamaš, mannan, oahppan, čállán, bargan, dahkan, váldán
- Sup
- AUX: amadet, amas, amaset, amat
- VERB: vuoššažit
Nominal Features
- Dual
- AUX-Fin: leaba, leahppi, eaba, ean, fertebeahtti, letne, leigga, eahppi, ferteba, fertiiga
- PRON: soai, moai, doai, munno, dudno, sudno, sudnos, ieža, dudnos, munnos
- VERB-Fin: boahtiba, attiiga, leaba, lohkaba, vácciiga, Boahtti, Gárvodeahkku, Leahppi, Manni, bohte
- Plur
- ADJ: buorit, čeahpit, duhtavaččat, stuorrát, buriid, viššalat, bivnnuhat, boarrásepmosat, buoremusaid, dehálaččat
- AUX-Fin: leat, eai, ledje, eat, galget, lehpet, sáhttet, fertejit, sáhtte, eaige
- AUX-Sup: amadet, amaset
- NOUN: olbmot, mánát, mánáid, oahppit, olbmuid, nieiddat, sápmelaččat, biktasiid, sámiid, sámit
- NUM: Galliid, golmmain, guovttit, Galliin, Gallit, golmmaide, golmmaiguin, guvttiid, njealjit, ovttaid
- PRON: mii, sii, min, sin, daid, geat, dii, dat, mat, mis
- PROPN: Sámi_Jienat, Davviriikkaid
- VERB-Fin: leat, ledje, bohte, ožžot, bidjat, orro, bohtet, manne, šaddet, vulget
- Sing
- ADJ: buorre, váttis, nuppi, vejolaš, veara, boaris, dehálaš, suohtas, buori, divrras
- AUX-Fin: lea, ii, lei, galgá, sáhttá, in, lean, leat, áiggun, it
- AUX-Sup: amas, amat
- NOUN: sámi, jagi, sámegiela, eadni, beaivvi, gánda, oahpaheaddji, olmmoš, stállu, olbmo
- NUM: guokte, golbma, ovtta, okta, moadde, máŋga, golmma, vihtta, guovtti, máŋgga
- PRON: son, mun, dan, dat, dán, mu, don, mii, maid, su
- PROPN: Gállá, Liná, Norgga, Kárášjogas, Máret, Finnmárkku, Guovdageainnus, Máhtte, Sámi, Kárášjoga
- VERB-Fin: lea, lei, bođii, boahtá, manai, lohká, šattai, šaddá, oidnen, manná
- Abe
- VERB-Ger: beroškeahttá, eahpitkeahttá, logakeahttá, bážikeahttá, dieđikeahttá, mávssekeahtes
- Acc
- ADJ: buori, buriid, ollu, Goalmmáda, baháid, buhtismeahttumiid, doloža, guoskevačča, sullasačča, suohttasiiddiska
- NOUN: sámegiela, veahki, bierggu, biktasiid, mánáid, reivve, girjji, girjjiid, gáfe, barggu
- NUM: guokte, moadde, golbma, máŋga, ovtta, vihtta, Galliid, guhtta, njeallje, 1300
- PRON: dan, maid, su, iežas, daid, dán, iežaset, du, mu, maidege
- PROPN: Sarvva, Liná, Divvuma, Máhte, Sámedikki, Antonsen, Beckham, Buolbmága, Busi, Efraima
- Com
- ADJ: buriin
- NOUN: biillain, mánáiguin, mánáin, vugiin, beatnagiiddisguin, beatnagiin, biillaiguin, bissuin, boazodoaluin, borramušain
- NUM: ovttain, golmmain, guvttiin, viđain, galliin, golmmaiguin, čuđiin
- PRON: dainna, daiguin, dáinna, iežainis, nuppiin, suinna, duinna, iežainan, iežaineaskka, maiguin
- PROPN: Sámedikkiin, Birehiin, Hanseniin, Iŋggáin, Juffáin, Máhte-Iŋggáin, Márehiin, Nilut_Cupain, Rihtáin, Riibmagállásiin
- Ess
- ADJ: duhtavažžan, nubbin, seavdnjadin, nuorran, ruoksadin, bassin, bivnnuhin, boarisin, buhtisin, buhtismeahttumin
- AUX-Ger: leamen, áigume, áigumin
- NOUN: oahpaheaddjin, lassin, veahkkin, ovdamearkan, vuođđun, buohccedivššárin, nuorran, Eurohpameašttirin, bassin, buohccin
- NUM: guoktin, oktan
- PRON: danin, dákkárin, iehčaneame
- PROPN: Gállábárdnin, Jesusin, Mihkkalažžan, Márehažžan, Smierrun
- VERB-Ger: boahtimin, fárremin, leamen, čierastallame, bargame, bargamin, bassaladdame, bassame, boahtime, oađđimin
- Gen
- ADJ: nuppi, jagáš, buoremusaid, buori, 7-jahkásačča, buriid, doloža, parlamentáralaččaid, ráhkkásis
- NOUN: sámi, jagi, beaivvi, áigge, olbmo, sámegiela, máná, áiggi, skuvlla, sámiid
- NUM: golmma, viđa, máŋgga, ovtta, 12, guovtti, 1.8.2001, moatti, 05.01.00, 12.03.2010
- PRON: mu, dan, dán, min, su, iežas, sin, du, daid, iežaset
- PROPN: Norgga, Sámi, Finnmárkku, Kárášjoga, Romssa, Sámedikki, Ipmila, Guovdageainnu, Deanu, Ruoŧa
- VERB-Ger: vácci, čuoigga, gudnejahttin, ráhkistan, Mearkkašan, Suga, bora, fuopmášan, namahan, njága
- Ill
- ADJ: sullásaččaide
- NOUN: mánáide, skuvlii, gávpogii, meahccái, sámegillii, internáhttii, mollii, bargui, heajaide, siidii
- NUM: golmma, beannot, guovtti, čuohtái, moatti, máŋgga, njealji, ovtta, golmmaide
- PRON: munnje, dasa, sutnje, dutnje, sidjiide, alccesis, dán, midjiide, dan, earáide
- PROPN: Kárášjohkii, Sápmái, Ellii, Finnmárkkuopmodahkii, Gáivutnii, Hámmárfestii, Trosterudii, Aarbortii, Abbai, Arnii
- Loc
- ADJ: nuppi, Nuorabuin, Nuoramusain, doložis, ráhkkásisttán
- NOUN: skuvllas, internáhtas, guovllus, viesus, oasis, oktavuođas, olbmuin, barggus, goađis, gávpogis
- NUM: ovtta, guovtti, 1982:s, 1995:s, golmmain, máŋgga, 1834:s, 1877:s, 1898:s, 1899:s
- PRON: mus, dus, das, mis, sis, sus, dán, dan, mas, dain
- PROPN: Kárášjogas, Guovdageainnus, Finnmárkkus, Deanus, Gáivuonas, Norggas, Romssas, Máhtes, Olmmáivákkis, Oslos
- VERB-Ger: goargŋumis, juhkamis, bargamis, borgguheames, botkemis, deaivvadeamis, gođđimis, guldaleames, jáhkkimis, vuostáváldimis
- Nom
- ADJ: buorre, váttis, vejolaš, veara, buorit, boaris, dehálaš, suohtas, divrras, duohta
- NOUN: olbmot, mánát, eadni, gánda, olmmoš, stállu, oahppit, mánná, nieida, oahpaheaddji
- NUM: okta, guokte, golbma, máŋga, njeallje, vihtta, moadde, 1971, 2005, 50
- PRON: son, mun, mii, dat, sii, don, dát, soai, moai, geat
- PROPN: Gállá, Máret, Máhtte, Liná, Ánde, Sámediggi, Ánne, Biret, Ipmil, Finnmárkkuopmodat
Degree and Polarity
- Cmp
- ADJ: buoret, eanet, stuorát, guhkit, ovddit, vuolit, heajut, stuorit, dárkilet, geahppaset
- ADV: eanet, eambbo, unnit, buorebut, geahppaseabbot, lagat, viidáseappot, viidáseppot, árabuš, árat
- Sup
- ADJ: buoremus, boarráseamos, maŋimuš, maŋimus, nuoramus, eanemus, maŋemus, ođđaseamos, riggámus, Máttimus
- ADV: buoremusat, millosepmosit, unnimustá
- Neg
- AUX-Fin: ii, eai, in, eat, it, ale, eaba, iige, eaige, ean
- AUX-Sup: amadet, amas, amaset, amat
Verbal Features
- Perf
- AUX-Part: leamaš, beassan, berren, gártan, leamašan, sáhttán
- VERB-Part: oaidnán, ožžon, boahtán, leamaš, mannan, oahppan, čállán, bargan, dahkan, váldán
- Cnd
- AUX-Fin: livččii, livčče, galggaše, galggašii, sáhtášii, berrešii, livččen, áiggošin, Sáhtášeidde, Sáhtášeigga
- VERB-Fin: boađášii, livččii, Dieđálin, Gillešeiddet, Gorošii, adnojuvvošii, barggašeimme, barggašii, bisošedje, boađáše
- Imp
- AUX-Fin: ale, allet, Leage, Lehket
- VERB-Fin: boađe, mana, váldde, bija, geahča, Gula, Oahpa, Addet, Atte, Bidjet
- Ind
- AUX-Fin: lea, leat, ii, lei, eai, ledje, galgá, lean, sáhttá, in
- VERB-Fin: lea, leat, lei, ledje, bođii, boahtá, manai, lohká, dieđe, lean
- Pot
- AUX-Fin: leažžá, leaččan, sáhtežan, Leaččat, leažžat, ležže, sáhtežetne, sáhtežit
- VERB-Fin: bođeža, leažžá, bođežit, Bođežehpet, Leaččan, bođežeaba, dagažit, eležat, Boraža, Boražeaba
- Past
- AUX-Fin: lei, ledje, galggai, lean, fertii, ledjen, sáhtte, gillen, leimmet, lávejedje
- VERB-Fin: lei, ledje, bođii, manai, šattai, lean, bohte, oidnen, oinnii, válddii
- Pres
- AUX-Fin: lea, leat, galgá, sáhttá, lean, galget, áiggun, leaba, ferte, lehpet
- VERB-Fin: lea, leat, boahtá, lohká, dieđe, šaddá, ožžot, manná, oažžu, bidjat
- VERB-Part: orru, dábuhahtti, gođđi, johtti, Leahkki, buolli, ealli, fátmmasteaddji, juolludeaddji, vahágahtti
- Pass
- VERB-Fin: adnojuvvo, álggahuvvui, addojuvvo, dárbbašuvvojit, gáibiduvvo, mearriduvvo, biddjojuvvo, bálkestuvvo, daddjojuvvo, geavahuvvo
- VERB-Inf: adnojuvvot, árvvoštallojuvvot, čuovvoluvvot, addojuvvot, bisuhuvvot, dahkkot, dohkkehuvvot, dubmejuvvot, gávnnahuvvot, hábmejuvvot
- VERB-Part: filbmejuvvon, ráddjejuvvon, ráhkaduvvon, biddjon, bovdejuvvon, dahkkojuvvon, gildojuvvon, gorrojuvvon, hábmejuvvon, mearriduvvon
Pronouns, Determiners, Quantifiers
- Dem
- PRON: dat, dan, dán, dát, dakkár, daid, dasa, das, dákkár, dainna
- Ind
- PRON: buot, juohke, eará, muhtun, unnán, muhtin, seamma, oktage, nubbi, makkárge
- Int
- PRON: makkár, maid, Mii, gii, Gean, Goabbá, maidba
- Prs
- PRON: son, mun, mii, sii, mu, don, su, iežas, soai, min
- Rcp
- PRON: guhtet, goabbat, guimmiideaset, guimmiideasetguin, guoimmiska, nubbi, nuppiin
- Rel
- PRON: mii, maid, geat, mat, gii, mas, gean, guhte, man, geain
- Card
- NUM: guokte, golbma, ovtta, okta, moadde, máŋga, golmma, vihtta, guovtti, máŋgga
- Coll
- NOUN: máŋggas, guovttis, Máŋgasat, golbmasa, golbmasis, golmmas, guovttos, máná-guovttos, viđas, Biera-guovttos
- Yes
- PRON: iežas, iežaset, ieš, ieža, iežan, iežat, alccesis, alddis, iežadet, iežamet
- 1
- AUX-Fin: in, lean, eat, áiggun, leat, ledjen, ean, ferten, fertet, leimmet
- PRON: mun, mii, mu, min, moai, munnje, mus, mis, mon, midjiide
- VERB-Fin: oidnen, bidjat, muittán, oainnán, attán, lean, orun, boađán, vuolggán, čállen
- 2
- AUX-Fin: leat, it, lehpet, ale, leahppi, fertet, galggat, sáhtát, allet, fertebeahtti
- AUX-Sup: amadet, amat
- PRON: don, dii, du, dus, doai, din, dutnje, dis, dudno, dudnos
- VERB-Fin: boađe, váldde, dovddat, mana, Bija, boađát, manat, oaččut, Gula, Máhtát
- 3
- AUX-Fin: lea, ii, leat, lei, eai, ledje, galgá, sáhttá, galget, leaba
- AUX-Sup: amas, amaset
- PRON: son, sii, su, soai, sin, sis, sus, sutnje, sidjiide, sudno
- VERB-Fin: lea, leat, lei, ledje, bođii, boahtá, manai, lohká, šattai, bohte
- Dual
- ADJ: suohttasiiddiska, suohttasiiddáme
- NOUN: beatnagasaska, mánáidasame, Oappásteame, basttiideaskka, beatnageatte, beatnagiiddiska, botnjiideaskka, bártnisteatte, girjjiideatte, gusade
- PRON: iežade, iežaska, alcceseame, iežaineaskka, alcceseaskka, alcceseatte, alddiska, alddáde, guoimmiska, iehčaneame
- Plur
- ADJ: suohttasiiddámet
- ADV: gaskaneaset
- NOUN: mánáideaset, Oabbámet, beatnagasaset, biergasiiddiset, biillaideattetguin, dávviriiddádet, eatnigielaset, elliideaset, fulkkiideaset, gieđaideaset
- PRON: iežaset, iežadet, iežamet, alcceseamet, alcceseaset, alcceseattet, alddiset, alddámet, guimmiideaset, guimmiideasetguin
- Sing
- ADJ: ráhkkásis, ráhkkásisttán, suohttasiiddán
- ADV: badjelasas
- NOUN: vielljan, eatnis, áhčis, dahkamušaidis, namas, beatnagiiddisguin, beatnagiiddásis, beatnagis, bártniidis, bártnážan
- PRON: iežas, iežan, iežat, alccesis, alddis, iežainis, alcces, alccesan, alccesat, iežainan
Other Features
- Connegative
- Yes
- AUX-Fin: leat, sáhte, lean, galgga, gillen, sáhttán, nagot, galggaše, háliit, hálit
- VERB-Fin: leat, dieđe, lean, boađe, beasa, diehtán, daga, oaččo, dovdda, liikon
- Yes
- Person[psor]
- 1
- ADJ: ráhkkásisttán, suohttasiiddáme, suohttasiiddámet, suohttasiiddán
- NOUN: vielljan, bártnážan, heaggan, mánáidasame, mánážan, vielljasan, áhkkán, Oabbámet, Oappásteame, beatnagan
- PRON: iežan, iežamet, alcceseamet, alccesan, alcceseame, iežainan, alddámet, alddán, iehčaneame, iežaineame
- 2
- NOUN: mánát, áhččát, beatnagat, beatnageatte, beatnagiiddát, biergasiiddát, biiggáinat, biillaideattetguin, bálvaleddjiidat, bártnisteatte
- PRON: iežat, iežadet, iežade, alccesat, alcceseattet, alcceseatte, alddáde, iežaineatte
- 3
- ADJ: ráhkkásis, suohttasiiddiska
- ADV: badjelasas, gaskaneaset
- NOUN: eatnis, áhčis, dahkamušaidis, mánáideaset, namas, beatnagasaska, beatnagiiddisguin, beatnagiiddásis, beatnagis, bártniidis
- PRON: iežas, iežaset, alccesis, alddis, iežainis, iežaska, alcces, alcceseaset, alddiset, iežaineaskka
- 1
Syntax
Auxiliary Verbs and Copula
- This corpus uses 1 lemmas as copulas (cop). Examples: leat.
- This corpus uses 23 lemmas as auxiliaries (aux). Examples: leat, sáhttit, galgat, fertet, áigut, lávet, háliidit, beassat, máhttit, orrut, álgit, berret, nagodit, dáidit, gillet, boahtit, šaddat, soaitit, gártat, viggat, bállet, sihtat, veadjit.
Core Arguments, Oblique Arguments and Adjuncts
Here we consider only relations between verbs (parent) and nouns or pronouns (child).
- nsubj
- VERB-Fin--NOUN (3)
- VERB-Fin--NOUN-Gen (44)
- VERB-Fin--NOUN-Nom (1049)
- VERB-Fin--PRON-Nom (810)
- VERB-Ger--NOUN-Acc (5)
- VERB-Ger--NOUN-Gen (2)
- VERB-Ger--NOUN-Nom (16)
- VERB-Ger--PRON-Acc (2)
- VERB-Ger--PRON-Gen (2)
- VERB-Ger--PRON-Nom (12)
- VERB-Inf--NOUN-Acc (19)
- VERB-Inf--NOUN-Gen (2)
- VERB-Inf--NOUN-Nom (142)
- VERB-Inf--PRON-Acc (12)
- VERB-Inf--PRON-Nom (137)
- VERB-Part--NOUN-Acc (4)
- VERB-Part--NOUN-Gen (6)
- VERB-Part--NOUN-Nom (128)
- VERB-Part--PRON (1)
- VERB-Part--PRON-Acc (9)
- VERB-Part--PRON-Nom (124)
- obj
- VERB-Fin--NOUN-Acc (632)
- VERB-Fin--NOUN-Gen (23)
- VERB-Fin--PRON (2)
- VERB-Fin--PRON-Acc (170)
- VERB-Ger--NOUN-Acc (24)
- VERB-Ger--PRON-Acc (5)
- VERB-Inf--NOUN-Acc (340)
- VERB-Inf--NOUN-Gen (6)
- VERB-Inf--PRON (2)
- VERB-Inf--PRON-Acc (56)
- VERB-Part--NOUN-Acc (123)
- VERB-Part--NOUN-Gen (4)
- VERB-Part--PRON-Acc (44)
- VERB-Sup--NOUN-Acc (1)
Verbs with Reflexive Core Objects
- This corpus contains 16 lemmas that occur at least once with a reflexive core object (obj or iobj). Examples: lohkat iežas, lohkat iežaset, atnit iežas, bargat iežaset, dahkat iežaset, doalahit iežas, doallat iežas, dovdat iežas, dovdat iežaset, dovdat iežat, geahččat iežas, geažuhit iežas, heivehallat iežaset, kvalifiseret iežaset, ovddidit iežas, rábmot iežat
Relations Overview
- This corpus uses 7 relation subtypes: acl:relcl, aux:neg, cc:preconj, compound:nn, nmod:poss, xcomp:obj, xcomp:pred
- The following 10 relation types are not used in this corpus at all: iobj, expl, dislocated, clf, fixed, list, orphan, goeswith, reparandum, dep