home edit page issue tracker

This page pertains to UD version 2.

UD for Assamese

Tokenization and Word Segmentation

Morphology

Universal Parts of Speech (UPOS)

Features

Nominal and Verbal inflections are listed in the table below:

Universal Feature Feature Observed
Nominal Case Nom, Acc/Dat like -ক -k, in most cases, Gen like -ৰ -or, Loc like -ত -t, Abl like -পৰা -pora, Erg like -ে -e, All like -লৈ -loi
Nominal Number Sing, Plur
Nominal Gender Fem, Mas
Nominal Person 1, 2, 3
Nominal Definite Def
Nominal PronType Prs, Ind, Int, Dem, Tot
Verbal VerbForm Fin, Part, Conv, Inf, Vnoun
Verbal Tense Past, Pres, Fut
Verbal Polarity Neg
Verbal Mood Ind, Imp
Verbal Aspect Perf, Prog
Adposition   AdpType is Post for postpositions

Table-1: shows the inflectional features nominal and verbal, and the values of these features observed in the treebank corpus.

Syntax

Dependency Relations

Out of all the Universal Dependencies v2 relations available, this treebank uses the following primary and subtype dependency relationships for annotating the treebank:

Relation type Observed relations
Core clausal root, nsub, obj, iobj, cop, aux, ccomp, xcomp
Adverbial advcl, advmod, mark
Oblique obl
Nominal modifiers appos, amod, acl, nmod, nmod:poss, nummod, case, det
Coordination and punctuation cc, conj, parataxis, discourse, punct, vocative
compound compound:lvc, compound:svc, compound:redup, compound:nummod, fixed

Table-2: shows the different dependency relationships observed during parsing Assamese sentences.

Core arguments and obliques:

Nominal modification:

Adverbials and subordination:

Complementation:

Coordination:

Light and serial verbs:

Reduplication:

Treebanks

Corpus Annotated by Kaushik Sengupta