home edit page issue tracker

This page still pertains to UD version 1.

X: other

Definition

The X tag is used for words that for some reason cannot be assigned a real part-of-speech category.

In Slovenian UD Treebank, this tag is mostly used for cases of code-switching where it was not meaningful to analyze the intervening language, such as Europe of knowledge, La connaissance de soi, Bundesvereinigung det Deutschen Arbeitgeberverbände. In cases where foreign-language sequences include both foreign and loan words, only foreign words are assigned the X tag, as in The Life of Brian, where both Life and Brian are marked as NOUN and PROPN respectively.

Other subcategories marked with X include abbreviations with dots (dr.), URL addresses (www.radenska.si), news author abbreviations (sta) and tokens with alpha-numerical combinations (6230i).

Conversion from JOS

All tokens with tag Residual are converted to X. Additionally, all abreviations are also converted to X.


X in other languages: [cs] [da] [en] [et] [fi] [fr] [ga] [grc] [hy] [it] [ja] [kpv] [myv] [no] [ru] [sl] [sv] [tr] [uk] [u] [urj] [yue] [zh]