X
: other
Definition
The tag X
is used for words that for some reason cannot be assigned
a real part-of-speech category.
Note: Version 1 of the Swedish UD treebank used X for foreign words (corresponding to the language-specific tag UO), but from version 1.1 these words are given a proper linguistic category and X is currently not used in Swedish. However, it still exists as a potentially permissible tag.
Treebank Statistics (UD_Swedish-LinES)
There are 1 X
lemmas (6%), 13 X
types (0%) and 17 X
tokens (0%).
Out of 17 observed tags, the rank of X
is: 17 in number of lemmas, 15 in number of types and 16 in number of tokens.
The 10 most frequent X
lemmas: _
The 10 most frequent X
types: W3C, SA, TSQL, .adp, .lpk, .mdb, .odc, CNS, EEG, MSDE
The 10 most frequent ambiguous lemmas: _ (NOUN 14002, VERB 11274, ADP 8898, PUNCT 8656, PRON 8194, ADV 6016, ADJ 5522, DET 4283, CONJ 3016, PROPN 2703, SCONJ 2587, AUX 2238, PART 1778, NUM 440, INTJ 179, X 17, SYM 9)
The 10 most frequent ambiguous types: SA (PROPN 3, X 2)
- SA
Morphology
The form / lemma ratio of X
is 13.000000 (the average of all parts of speech is 794.764706).
The 1st highest number of forms (13) was observed with the lemma “_”: .adp, .lpk, .mdb, .odc, CNS, EEG, MSDE, SA, SAP, TSQL, VBA, W3C, udl.
X
does not occur with any features.
Relations
X
nodes are attached to their parents using 4 different relations: sv-dep/appos (13; 76% instances), sv-dep/amod (2; 12% instances), sv-dep/conj (1; 6% instances), sv-dep/nsubj (1; 6% instances)
Parents of X
nodes belong to 4 different parts of speech: NOUN (13; 76% instances), PROPN (2; 12% instances), NUM (1; 6% instances), VERB (1; 6% instances)
1 (6%) X
nodes are leaves.
2 (12%) X
nodes have one child.
12 (71%) X
nodes have two children.
2 (12%) X
nodes have three or more children.
The highest child degree of a X
node is 4.
Children of X
nodes are attached using 7 different relations: sv-dep/punct (27; 82% instances), sv-dep/advmod (1; 3% instances), sv-dep/appos (1; 3% instances), sv-dep/case (1; 3% instances), sv-dep/cc (1; 3% instances), sv-dep/conj (1; 3% instances), sv-dep/nummod (1; 3% instances)
Children of X
nodes belong to 6 different parts of speech: PUNCT (27; 82% instances), NOUN (2; 6% instances), ADP (1; 3% instances), ADV (1; 3% instances), CONJ (1; 3% instances), NUM (1; 3% instances)
Treebank Statistics (UD_Swedish_Sign_Language)
There are 1 X
lemmas (9%), 37 X
types (11%) and 59 X
tokens (9%).
Out of 11 observed tags, the rank of X
is: 11 in number of lemmas, 3 in number of types and 3 in number of tokens.
The 10 most frequent X
lemmas: _
The 10 most frequent X
types: LÅTA-VARA, AVGRÄNS, GÅ(N), GLOSA:(PD)@z, HEJ-DÅ@g@z, AJABAJA@g@z, CHOCKA, GLOSA:(PF)@z, KLÄ-PÅ.HUVUDDEL, MÖSSA(G)
The 10 most frequent ambiguous lemmas: _ (VERB 318, NOUN 149, X 59, PRON 45, ADV 35, DET 19, INTJ 14, ADJ 14, NUM 8, ADP 8, CONJ 3)
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of X
is 37.000000 (the average of all parts of speech is 29.545455).
The 1st highest number of forms (37) was observed with the lemma “_”: AJABAJA@g@z, AVGRÄNS, AVGRÄNS@z, CHOCKA, ENTITET(A)+FÖRFLYTTA, ENTITET(A)+RÖRELSE, GLOSA:(?)@z, GLOSA:(JÄSA)@z, GLOSA:(PD)@z, GLOSA:(PF)@z, GROVLEK(JJ), GROVLEK(JJv), GÅ(N), HEJ-DÅ@g@z, HUNGRIG, KLÄ-PÅ.HUVUDDEL, KLÄ-PÅ.NEDERDEL, KLÄ-PÅ.ÖVERDEL, KLÄ-PÅ:ÖVERDEL, KORTVARIG(J), LISTBOJ.EN, LISTBOJ.TVÅ, LÅTA-VARA, LÅTA-VARA@z, MÖSSA(G), PU(L)@g@z, PÅKALLA-UPPMÄRKSAMHET@g, SITTA(Vb).FL, STÅ(N), TITTA-FRAM, TITTA-FRAM@hd, TOMAT, VÄNTA@g, glosa@&, hund@&, lukta@&, tp@&.
X
does not occur with any features.
Relations
X
nodes are attached to their parents using 12 different relations: sv-dep/conj (20; 34% instances), sv-dep/root (10; 17% instances), sv-dep/discourse (7; 12% instances), sv-dep/dep (5; 8% instances), sv-dep/reparandum (4; 7% instances), sv-dep/amod (3; 5% instances), sv-dep/nmod (3; 5% instances), sv-dep/cc (2; 3% instances), sv-dep/dobj (2; 3% instances), sv-dep/acl (1; 2% instances), sv-dep/compound (1; 2% instances), sv-dep/nsubj (1; 2% instances)
Parents of X
nodes belong to 4 different parts of speech: VERB (32; 54% instances), NOUN (11; 19% instances), ROOT (10; 17% instances), X (6; 10% instances)
43 (73%) X
nodes are leaves.
9 (15%) X
nodes have one child.
2 (3%) X
nodes have two children.
5 (8%) X
nodes have three or more children.
The highest child degree of a X
node is 5.
Children of X
nodes are attached using 8 different relations: sv-dep/conj (20; 61% instances), sv-dep/nsubj (5; 15% instances), sv-dep/dobj (2; 6% instances), sv-dep/nmod (2; 6% instances), sv-dep/advcl (1; 3% instances), sv-dep/advmod (1; 3% instances), sv-dep/dep (1; 3% instances), sv-dep/discourse (1; 3% instances)
Children of X
nodes belong to 7 different parts of speech: VERB (15; 45% instances), NOUN (6; 18% instances), X (6; 18% instances), ADJ (2; 6% instances), PRON (2; 6% instances), ADV (1; 3% instances), INTJ (1; 3% instances)
X in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]