home edit page issue tracker

This page pertains to UD version 2.

SYM: symbol


A symbol is a word-like entity that differs from ordinary words by form, function, or both.

Many symbols are or contain special non-alphanumeric characters, similarly to punctuation. What makes them different from punctuation is that they can be substituted by normal words. This involves all currency symbols, e.g. ֏ 75 is identical to յոթանասունհինգ դրամ “seventy-five armenian drams”.

Mathematical operators form another group of symbols.

Another group of symbols is emoticons and emoji.

Strings that consists entirely of alphanumeric characters are not symbols but they may be proper nouns: 130XE, DC10; others may be tagged PROPN (rather than SYM) even if they contain special characters: ՏՈՒ-154Մ (“Tu-154M”). Similarly, abbreviations for single words are not symbols but are assigned the part of speech of the full form. For example, պրն (պարոն “Mr.; Mister”), կգ (կիլոգրամ “kg; kilogramm”), կմ (կիլոմետր “km; kilometer”) should be tagged nouns. Acronyms for proper names such as ՄԱԿ “UN” and ՆԱՏՕ “NATO” should be tagged as proper nouns.

Characters used as bullets in itemized lists (•, ‣) are not symbols, they are punctuation.


SYM in other languages: [cs] [cy] [da] [en] [et] [fi] [fr] [ga] [grc] [hy] [it] [ja] [kk] [no] [pt] [ru] [sl] [sv] [tr] [uk] [u] [urj] [yue] [zh]