This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home sv/pos issue tracker

X: other

Definition

The tag X is used for words that for some reason cannot be assigned a real part-of-speech category.

Note: Version 1 of the Swedish UD treebank used X for foreign words (corresponding to the language-specific tag UO), but from version 1.1 these words are given a proper linguistic category and X is currently not used in Swedish. However, it still exists as a potentially permissible tag.


Treebank Statistics (UD_Swedish-LinES)

There are 1 X lemmas (6%), 13 X types (0%) and 17 X tokens (0%). Out of 17 observed tags, the rank of X is: 17 in number of lemmas, 15 in number of types and 16 in number of tokens.

The 10 most frequent X lemmas: _

The 10 most frequent X types: W3C, SA, TSQL, .adp, .lpk, .mdb, .odc, CNS, EEG, MSDE

The 10 most frequent ambiguous lemmas: _ (NOUN 14002, VERB 11274, ADP 8898, PUNCT 8656, PRON 8194, ADV 6016, ADJ 5522, DET 4283, CONJ 3016, PROPN 2703, SCONJ 2587, AUX 2238, PART 1778, NUM 440, INTJ 179, X 17, SYM 9)

The 10 most frequent ambiguous types: SA (PROPN 3, X 2)

Morphology

The form / lemma ratio of X is 13.000000 (the average of all parts of speech is 794.764706).

The 1st highest number of forms (13) was observed with the lemma “_”: .adp, .lpk, .mdb, .odc, CNS, EEG, MSDE, SA, SAP, TSQL, VBA, W3C, udl.

X does not occur with any features.

Relations

X nodes are attached to their parents using 4 different relations: sv-dep/appos (13; 76% instances), sv-dep/amod (2; 12% instances), sv-dep/conj (1; 6% instances), sv-dep/nsubj (1; 6% instances)

Parents of X nodes belong to 4 different parts of speech: NOUN (13; 76% instances), PROPN (2; 12% instances), NUM (1; 6% instances), VERB (1; 6% instances)

1 (6%) X nodes are leaves.

2 (12%) X nodes have one child.

12 (71%) X nodes have two children.

2 (12%) X nodes have three or more children.

The highest child degree of a X node is 4.

Children of X nodes are attached using 7 different relations: sv-dep/punct (27; 82% instances), sv-dep/advmod (1; 3% instances), sv-dep/appos (1; 3% instances), sv-dep/case (1; 3% instances), sv-dep/cc (1; 3% instances), sv-dep/conj (1; 3% instances), sv-dep/nummod (1; 3% instances)

Children of X nodes belong to 6 different parts of speech: PUNCT (27; 82% instances), NOUN (2; 6% instances), ADP (1; 3% instances), ADV (1; 3% instances), CONJ (1; 3% instances), NUM (1; 3% instances)


Treebank Statistics (UD_Swedish_Sign_Language)

There are 1 X lemmas (9%), 37 X types (11%) and 59 X tokens (9%). Out of 11 observed tags, the rank of X is: 11 in number of lemmas, 3 in number of types and 3 in number of tokens.

The 10 most frequent X lemmas: _

The 10 most frequent X types: LÅTA-VARA, AVGRÄNS, GÅ(N), GLOSA:(PD)@z, HEJ-DÅ@g@z, AJABAJA@g@z, CHOCKA, GLOSA:(PF)@z, KLÄ-PÅ.HUVUDDEL, MÖSSA(G)

The 10 most frequent ambiguous lemmas: _ (VERB 318, NOUN 149, X 59, PRON 45, ADV 35, DET 19, INTJ 14, ADJ 14, NUM 8, ADP 8, CONJ 3)

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of X is 37.000000 (the average of all parts of speech is 29.545455).

The 1st highest number of forms (37) was observed with the lemma “_”: AJABAJA@g@z, AVGRÄNS, AVGRÄNS@z, CHOCKA, ENTITET(A)+FÖRFLYTTA, ENTITET(A)+RÖRELSE, GLOSA:(?)@z, GLOSA:(JÄSA)@z, GLOSA:(PD)@z, GLOSA:(PF)@z, GROVLEK(JJ), GROVLEK(JJv), GÅ(N), HEJ-DÅ@g@z, HUNGRIG, KLÄ-PÅ.HUVUDDEL, KLÄ-PÅ.NEDERDEL, KLÄ-PÅ.ÖVERDEL, KLÄ-PÅ:ÖVERDEL, KORTVARIG(J), LISTBOJ.EN, LISTBOJ.TVÅ, LÅTA-VARA, LÅTA-VARA@z, MÖSSA(G), PU(L)@g@z, PÅKALLA-UPPMÄRKSAMHET@g, SITTA(Vb).FL, STÅ(N), TITTA-FRAM, TITTA-FRAM@hd, TOMAT, VÄNTA@g, glosa@&, hund@&, lukta@&, tp@&.

X does not occur with any features.

Relations

X nodes are attached to their parents using 12 different relations: sv-dep/conj (20; 34% instances), sv-dep/root (10; 17% instances), sv-dep/discourse (7; 12% instances), sv-dep/dep (5; 8% instances), sv-dep/reparandum (4; 7% instances), sv-dep/amod (3; 5% instances), sv-dep/nmod (3; 5% instances), sv-dep/cc (2; 3% instances), sv-dep/dobj (2; 3% instances), sv-dep/acl (1; 2% instances), sv-dep/compound (1; 2% instances), sv-dep/nsubj (1; 2% instances)

Parents of X nodes belong to 4 different parts of speech: VERB (32; 54% instances), NOUN (11; 19% instances), ROOT (10; 17% instances), X (6; 10% instances)

43 (73%) X nodes are leaves.

9 (15%) X nodes have one child.

2 (3%) X nodes have two children.

5 (8%) X nodes have three or more children.

The highest child degree of a X node is 5.

Children of X nodes are attached using 8 different relations: sv-dep/conj (20; 61% instances), sv-dep/nsubj (5; 15% instances), sv-dep/dobj (2; 6% instances), sv-dep/nmod (2; 6% instances), sv-dep/advcl (1; 3% instances), sv-dep/advmod (1; 3% instances), sv-dep/dep (1; 3% instances), sv-dep/discourse (1; 3% instances)

Children of X nodes belong to 7 different parts of speech: VERB (15; 45% instances), NOUN (6; 18% instances), X (6; 18% instances), ADJ (2; 6% instances), PRON (2; 6% instances), ADV (1; 3% instances), INTJ (1; 3% instances)


X in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]