home edit page issue tracker

This page pertains to UD version 2.

UD for Georgian

This is a work-in-progress overview of the UD annotation for Georgian.

Tokenization and Word Segmentation


Lemmatization

Lemmatization Strategies

Nominals: The lemma is consistently represented as the nominative singular form, providing a straightforward and standard approach.

Verbs: Georgian verbs lack an infinitive form, resulting in two lemmatization strategies:

Lemmatization in UD Treebanks

For Universal Dependency (UD) treebanks, lemmatization practices typically reflect a hybrid approach, influenced by the diverse strategies used for Georgian verbs.


Morphology

Tags


Features

Lexical Features

Inflectional Features

Nominal Features
Verbal Features

Instruction: Describe inherent and inflectional features for major word classes (at least NOUN and VERB). Describe other noteworthy features. Include links to language-specific feature definitions if any.


Syntax


Treebanks

There are the following treebanks for Old Georgian: