home edit page issue tracker

This page pertains to UD version 2.

SYM: symbol


A symbol is a word-like entity that differs from ordinary words by form, function, or both. Symbols are distinct from punctuation that delimit linguistic units in printed text and do not have any semantic function. What makes them different from punctuation is that they can be substituted by normal words, as in currency symbols (e.g. ) and mathematical operators (e.g. +). Symbols also include emoticons and emoji, URL and e-mail addresses, hasthags and similar.

Abbreviations for single words are not symbols but are assigned the part of speech of the full form (e.g. NOUN for dr.). Similarly, acronyms for proper nouns (e.g. NATO) or common nouns (e.g. USB) are annotated as PROPN and NOUN, respectively.


Conversion from JOS

The list of characters in ssj500k treebank has been manually divided into subgroups of PUNCT and SYM. Note that some characters display characteristics of both POS categories, such as asterisk or dash-like characters that can either function as mathematical operators (SYM) or bullets in itemized lists (PUNCT). In case of such ambiguity, the more common function was chosen.

SYM in other languages: [cs] [cy] [da] [en] [et] [fi] [fr] [ga] [grc] [hy] [it] [ja] [kk] [no] [pt] [ru] [sl] [sv] [tr] [uk] [u] [urj] [yue] [zh]