X
: other
The tag X
is used for words that for some reason cannot be assigned
a real part-of-speech category.
Foreign words appearing inside native text are tagged X
(see also
Foreign).
Examples
- [fi] Uskoo ken tahtsssszzt brrrzzzt.
- [fi] Opimme fyysikoiden “Let’s assume a spherical cow” -lähestymistavan.
Treebank Statistics (UD_Finnish)
There are 228 X
lemmas (1%), 237 X
types (0%) and 285 X
tokens (0%).
Out of 15 observed tags, the rank of X
is: 7 in number of lemmas, 8 in number of types and 14 in number of tokens.
The 10 most frequent X
lemmas: metal, common, death, dolly, eHealth, a, and, api, fun, it
The 10 most frequent X
types: metal, common, death, a, and, eHealth, fun, it, pic, DIY
The 10 most frequent ambiguous lemmas: a (NOUN 30, X 3, PROPN 1), and (PROPN 10, X 3), I (ADJ 34, PROPN 5, X 2), Diàoyútái (X 1, PROPN 1), Do (PROPN 5, X 1), Don’t (X 1, PROPN 1), Finnish (PROPN 2, X 1), Grey (PROPN 1, X 1), Life (PROPN 2, X 1), Yourself (PROPN 1, X 1)
The 10 most frequent ambiguous types: a (NOUN 30, X 3, PROPN 1), and (PROPN 10, X 3), I (ADJ 26, PROPN 3, X 2), Do (PROPN 5, X 1), Don’t (X 1, PROPN 1), Finnish (PROPN 2, X 1), Life (PROPN 1, X 1), On (VERB 92, AUX 13, PROPN 5, X 1), Yourself (X 1, PROPN 1), by (X 1, PROPN 1)
- a
- and
- I
- Do
- Don’t
- Finnish
- Life
- On
- Yourself
- X 1: Kuten tuli tuossa ensimmäisessä Arduinoa käsitelleessä postauksessa luvattua , suunnitelmissani on rakentaa DIY ( Do It Yourself , Tee Se Itse ) intervalliajastin sekä moottoroitu alusta kameralle .
- PROPN 1: Go Chuck Yourself ( Euroopassa ja Pohjois-Amerikassa ) tai Happy Live Surprise ( Japanissa ) on yhtyeen Sum 41 livealbumi , joka nauhoitettiin Lontoossa , Ontariossa huhtikuussa 2005 .
- by
Morphology
The form / lemma ratio of X
is 1.039474 (the average of all parts of speech is 2.036755).
The 1st highest number of forms (3) was observed with the lemma “dolly”: dolly, dollyja, dollyn.
The 2nd highest number of forms (2) was observed with the lemma “API”: API, APIn.
The 3rd highest number of forms (2) was observed with the lemma “eHealth”: eHealth, eHealthin.
X
occurs with 1 features: Foreign (276; 97% instances)
X
occurs with 2 feature-value pairs: Foreign=Foreign
, Foreign=Fscript
X
occurs with 3 feature combinations.
The most frequent feature combination is Foreign=Foreign
(249 tokens).
Examples: metal, common, death, a, and, eHealth, fun, it, pic, DIY
Relations
X
nodes are attached to their parents using 22 different relations: foreign (93; 33% instances), appos (48; 17% instances), compound:nn (32; 11% instances), name (21; 7% instances), conj (14; 5% instances), root (14; 5% instances), nmod (13; 5% instances), nsubj (11; 4% instances), discourse (10; 4% instances), dobj (10; 4% instances), nmod:gobj (3; 1% instances), amod (2; 1% instances), cc (2; 1% instances), nmod:poss (2; 1% instances), parataxis (2; 1% instances), remnant (2; 1% instances), advmod (1; 0% instances), ccomp (1; 0% instances), csubj:cop (1; 0% instances), goeswith (1; 0% instances), mwe (1; 0% instances), nsubj:cop (1; 0% instances)
Parents of X
nodes belong to 6 different parts of speech: X (127; 45% instances), NOUN (80; 28% instances), VERB (37; 13% instances), PROPN (24; 8% instances), ROOT (14; 5% instances), ADJ (3; 1% instances)
160 (56%) X
nodes are leaves.
39 (14%) X
nodes have one child.
25 (9%) X
nodes have two children.
61 (21%) X
nodes have three or more children.
The highest child degree of a X
node is 11.
Children of X
nodes are attached using 25 different relations: punct (119; 32% instances), foreign (93; 25% instances), conj (26; 7% instances), name (26; 7% instances), nmod (20; 5% instances), amod (17; 5% instances), cc (14; 4% instances), appos (13; 4% instances), advmod (6; 2% instances), nmod:poss (5; 1% instances), cop (4; 1% instances), nsubj:cop (4; 1% instances), acl:relcl (3; 1% instances), advcl (2; 1% instances), compound:nn (2; 1% instances), det (2; 1% instances), nummod (2; 1% instances), parataxis (2; 1% instances), remnant (2; 1% instances), acl (1; 0% instances), aux (1; 0% instances), cc:preconj (1; 0% instances), discourse (1; 0% instances), mark (1; 0% instances), nsubj (1; 0% instances)
Children of X
nodes belong to 12 different parts of speech: PUNCT (127; 35% instances), X (127; 35% instances), NOUN (41; 11% instances), ADJ (20; 5% instances), VERB (17; 5% instances), CONJ (12; 3% instances), PROPN (10; 3% instances), ADV (6; 2% instances), PRON (3; 1% instances), SYM (3; 1% instances), AUX (1; 0% instances), SCONJ (1; 0% instances)
Treebank Statistics (UD_Finnish-FTB)
There are 270 X
lemmas (1%), 269 X
types (1%) and 304 X
tokens (0%).
Out of 16 observed tags, the rank of X
is: 8 in number of lemmas, 11 in number of types and 15 in number of tokens.
The 10 most frequent X
lemmas: 70-, in, sosiaali-, the, ala-, kauppa-, keng-, maa-, 50-, aquis
The 10 most frequent X
types: 70-, in, sosiaali-, the, Kauppa-, ala-, keng-, maa-, 50-, Lilla
The 10 most frequent ambiguous lemmas: the (X 4, PROPN 1), out (X 2, NOUN 2), home (NOUN 3, X 1), is (PROPN 2, X 1), made (NOUN 1, X 1), me (PRON 483, DET 74, X 1), new (PROPN 8, X 1), partners (X 1, PROPN 1), queen (PROPN 1, X 1), ride (PROPN 1, X 1)
The 10 most frequent ambiguous types: out (NOUN 1, X 1), New (PROPN 8, X 1), Ride (PROPN 1, X 1), m- (X 1, PRON 1), me (PRON 123, X 1, VERB 1), se- (X 1, PRON 1), termi (NOUN 2, X 1)
- out
- New
- Ride
- m-
- me
- se-
- termi
Morphology
The form / lemma ratio of X
is 0.996296 (the average of all parts of speech is 2.044212).
The 1st highest number of forms (1) was observed with the lemma “10-”: 10-.
The 2nd highest number of forms (1) was observed with the lemma “100-”: 100-.
The 3rd highest number of forms (1) was observed with the lemma “150-”: 150-.
X
does not occur with any features.
Relations
X
nodes are attached to their parents using 21 different relations: conj (149; 49% instances), amod (26; 9% instances), dep (24; 8% instances), nmod (23; 8% instances), reparandum (17; 6% instances), root (15; 5% instances), nsubj (13; 4% instances), advmod (6; 2% instances), foreign (6; 2% instances), dobj (4; 1% instances), nsubj:cop (4; 1% instances), ccomp (3; 1% instances), compound:nn (3; 1% instances), name (3; 1% instances), case (2; 1% instances), acl (1; 0% instances), aux (1; 0% instances), cc (1; 0% instances), compound:prt (1; 0% instances), csubj:cop (1; 0% instances), vocative (1; 0% instances)
Parents of X
nodes belong to 10 different parts of speech: NOUN (151; 50% instances), X (64; 21% instances), VERB (29; 10% instances), ADJ (21; 7% instances), PROPN (19; 6% instances), ROOT (15; 5% instances), PRON (2; 1% instances), ADV (1; 0% instances), DET (1; 0% instances), SCONJ (1; 0% instances)
223 (73%) X
nodes are leaves.
48 (16%) X
nodes have one child.
14 (5%) X
nodes have two children.
19 (6%) X
nodes have three or more children.
The highest child degree of a X
node is 7.
Children of X
nodes are attached using 23 different relations: punct (34; 21% instances), amod (27; 17% instances), nmod (19; 12% instances), advmod (10; 6% instances), conj (10; 6% instances), dep (8; 5% instances), cop (7; 4% instances), acl (6; 4% instances), foreign (6; 4% instances), nsubj:cop (6; 4% instances), cc (5; 3% instances), nsubj (5; 3% instances), aux (3; 2% instances), case (3; 2% instances), name (2; 1% instances), vocative (2; 1% instances), compound:nn (1; 1% instances), compound:prt (1; 1% instances), csubj:cop (1; 1% instances), det (1; 1% instances), dobj (1; 1% instances), mark (1; 1% instances), neg (1; 1% instances)
Children of X
nodes belong to 13 different parts of speech: X (64; 40% instances), PUNCT (34; 21% instances), VERB (22; 14% instances), PROPN (9; 6% instances), NOUN (8; 5% instances), ADJ (6; 4% instances), PRON (5; 3% instances), ADV (4; 3% instances), CONJ (4; 3% instances), ADP (1; 1% instances), DET (1; 1% instances), PART (1; 1% instances), SCONJ (1; 1% instances)
X in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]