This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home it/pos issue tracker

X: other

Definition

The tag X is used for words that for some reason cannot be assigned a real part-of-speech category.

Note that the universal guidelines recommend the usage of X for cases of code-switching where it is not possible (or meaningful) to analyze the intervening language grammatically (and where the dependency relation foreign is typically used in the syntactic analysis). The PoS tag for Italian in such cases is SW (foreign noun) and is mapped into X in the conversion.

This usage does not extend to ordinary loan words where it is assigned a normal part-of-speech.

Examples


Treebank Statistics (UD_Italian)

There are 134 X lemmas (1%), 133 X types (0%) and 184 X tokens (0%). Out of 17 observed tags, the rank of X is: 7 in number of lemmas, 10 in number of types and 14 in number of tokens.

The 10 most frequent X lemmas: a, b, c, f, ad, damage, de, home, Come, Habemus

The 10 most frequent X types: a, b, c, f, Damage, ad, de, home, la, Come

The 10 most frequent ambiguous lemmas: a (ADP 7523, NOUN 10, X 8, PROPN 3, DET 3, ADV 2, CONJ 1), b (NOUN 11, X 8), c (NOUN 7, X 3), f (NOUN 2, X 2), ad (ADP 12, X 3, NOUN 3), de (ADP 38, PROPN 6, X 3, DET 3), home (X 2, NOUN 2), Come (X 2, ADP 1), Habemus (X 2, PROPN 2), electronic (ADJ 3, X 2)

The 10 most frequent ambiguous types: a (ADP 6766, NOUN 10, X 8, PROPN 2, ADV 2), b (NOUN 11, X 8), c (NOUN 7, X 3), f (NOUN 2, X 2), ad (ADP 439, NOUN 3, X 3, PROPN 1), de (ADP 43, PROPN 6, DET 2, X 2), home (X 2, NOUN 2), la (DET 8654, PRON 126, X 2, PROPN 2), Come (ADV 174, SCONJ 25, ADP 24, X 2, VERB 1), Electronic (X 2, ADJ 1)

Morphology

The form / lemma ratio of X is 0.992537 (the average of all parts of speech is 1.488836).

The 1st highest number of forms (2) was observed with the lemma “lo”: I, la.

The 2nd highest number of forms (1) was observed with the lemma “(!),”: (!),.

The 3rd highest number of forms (1) was observed with the lemma “Attiva”: Attiva.

X occurs with 2 features: it-feat/Gender (1; 1% instances), it-feat/Number (1; 1% instances)

X occurs with 2 feature-value pairs: Gender=Masc, Number=Sing

X occurs with 3 feature combinations. The most frequent feature combination is _ (182 tokens). Examples: a, b, c, f, Damage, ad, de, home, la, Come

Relations

X nodes are attached to their parents using 15 different relations: it-dep/foreign (74; 40% instances), it-dep/nummod (32; 17% instances), it-dep/nmod (29; 16% instances), it-dep/compound (8; 4% instances), it-dep/xcomp (7; 4% instances), it-dep/appos (6; 3% instances), it-dep/mwe (6; 3% instances), it-dep/advmod (4; 2% instances), it-dep/conj (4; 2% instances), it-dep/name (4; 2% instances), it-dep/root (4; 2% instances), it-dep/dobj (2; 1% instances), it-dep/nsubj (2; 1% instances), it-dep/amod (1; 1% instances), it-dep/discourse (1; 1% instances)

Parents of X nodes belong to 8 different parts of speech: X (86; 47% instances), NOUN (48; 26% instances), VERB (28; 15% instances), NUM (8; 4% instances), PROPN (7; 4% instances), ROOT (4; 2% instances), PRON (2; 1% instances), ADJ (1; 1% instances)

92 (50%) X nodes are leaves.

46 (25%) X nodes have one child.

11 (6%) X nodes have two children.

35 (19%) X nodes have three or more children.

The highest child degree of a X node is 13.

Children of X nodes are attached using 20 different relations: it-dep/foreign (73; 32% instances), it-dep/punct (72; 31% instances), it-dep/case (17; 7% instances), it-dep/det (13; 6% instances), it-dep/conj (8; 3% instances), it-dep/name (7; 3% instances), it-dep/cc (6; 3% instances), it-dep/nmod (6; 3% instances), it-dep/appos (5; 2% instances), it-dep/acl:relcl (4; 2% instances), it-dep/amod (4; 2% instances), it-dep/cop (3; 1% instances), it-dep/nummod (3; 1% instances), it-dep/nsubj (2; 1% instances), it-dep/acl (1; 0% instances), it-dep/advmod (1; 0% instances), it-dep/auxpass (1; 0% instances), it-dep/compound (1; 0% instances), it-dep/nsubjpass (1; 0% instances), it-dep/parataxis (1; 0% instances)

Children of X nodes belong to 12 different parts of speech: X (86; 38% instances), PUNCT (72; 31% instances), ADP (17; 7% instances), DET (13; 6% instances), NOUN (13; 6% instances), VERB (9; 4% instances), CONJ (6; 3% instances), PROPN (6; 3% instances), ADJ (3; 1% instances), NUM (2; 1% instances), ADV (1; 0% instances), AUX (1; 0% instances)


X in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]