home edit page issue tracker

This page pertains to UD version 2.

X: other

Definition

The X tag is used for words that for some reason cannot be assigned a real part-of-speech category.

In Slovenian UD Treebank, this tag is mostly used for cases of code-switching where it was not meaningful to analyze the intervening language, such as Europe of knowledge, La connaissance de soi, Bundesvereinigung det Deutschen Arbeitgeberverbände. In cases where foreign-language sequences include both foreign and loan words, only foreign words are assigned the X tag, as in The Life of Brian, where both Life and Brian are marked as NOUN and PROPN respectively.

Other subcategories marked with X include abbreviations with dots (dr.), URL addresses (www.radenska.si), news author abbreviations (sta) and tokens with alpha-numerical combinations (6230i).

Conversion from JOS

All tokens with tag Residual are converted to X. Additionally, all abreviations are also converted to X.


X in other languages: [bej] [cs] [cy] [da] [el] [en] [es] [ess] [et] [fi] [fr] [ga] [grc] [hy] [it] [ja] [ka] [kpv] [ky] [myv] [no] [qpm] [ru] [sl] [sv] [tr] [tt] [uk] [u] [urj] [yue] [zh]