X
: other
The tag X
is used for words that for some reason cannot be assigned
a real part-of-speech category.
Foreign words appearing inside native text are tagged X
(see also
Foreign).
Examples
- [fi] Uskoo ken tahtsssszzt brrrzzzt.
- [fi] Opimme fyysikoiden “Let’s assume a spherical cow” -lähestymistavan.
Treebank Statistics (UD_Finnish)
There are 228 X
lemmas (1%), 237 X
types (0%) and 285 X
tokens (0%).
Out of 15 observed tags, the rank of X
is: 7 in number of lemmas, 8 in number of types and 14 in number of tokens.
The 10 most frequent X
lemmas: metal, common, death, dolly, eHealth, a, and, api, fun, it
The 10 most frequent X
types: metal, common, death, a, and, eHealth, fun, it, pic, DIY
The 10 most frequent ambiguous lemmas: a (NOUN 30, X 3, PROPN 1), and (PROPN 10, X 3), I (ADJ 34, PROPN 5, X 2), Diàoyútái (X 1, PROPN 1), Do (PROPN 5, X 1), Don’t (X 1, PROPN 1), Finnish (PROPN 2, X 1), Grey (PROPN 1, X 1), Life (PROPN 2, X 1), Yourself (PROPN 1, X 1)
The 10 most frequent ambiguous types: a (NOUN 30, X 3, PROPN 1), and (PROPN 10, X 3), I (ADJ 26, PROPN 3, X 2), Do (PROPN 5, X 1), Don’t (X 1, PROPN 1), Finnish (PROPN 2, X 1), Life (PROPN 1, X 1), On (VERB 92, AUX 13, PROPN 5, X 1), Yourself (X 1, PROPN 1), by (X 1, PROPN 1)
- a
- and
- I
- Do
- Don’t
- Finnish
- Life
- On
- Yourself
- X 1: Kuten tuli tuossa ensimmäisessä Arduinoa käsitelleessä postauksessa luvattua , suunnitelmissani on rakentaa DIY ( Do It Yourself , Tee Se Itse ) intervalliajastin sekä moottoroitu alusta kameralle .
- PROPN 1: Go Chuck Yourself ( Euroopassa ja Pohjois-Amerikassa ) tai Happy Live Surprise ( Japanissa ) on yhtyeen Sum 41 livealbumi , joka nauhoitettiin Lontoossa , Ontariossa huhtikuussa 2005 .
- by
Morphology
The form / lemma ratio of X
is 1.039474 (the average of all parts of speech is 2.036755).
The 1st highest number of forms (3) was observed with the lemma “dolly”: dolly, dollyja, dollyn.
The 2nd highest number of forms (2) was observed with the lemma “API”: API, APIn.
The 3rd highest number of forms (2) was observed with the lemma “eHealth”: eHealth, eHealthin.
X
occurs with 1 features: fi-feat/Foreign (276; 97% instances)
X
occurs with 2 feature-value pairs: Foreign=Foreign
, Foreign=Fscript
X
occurs with 3 feature combinations.
The most frequent feature combination is Foreign=Foreign
(249 tokens).
Examples: metal, common, death, a, and, eHealth, fun, it, pic, DIY
Relations
X
nodes are attached to their parents using 22 different relations: fi-dep/foreign (93; 33% instances), fi-dep/appos (48; 17% instances), fi-dep/compound:nn (32; 11% instances), fi-dep/name (21; 7% instances), fi-dep/conj (14; 5% instances), fi-dep/root (14; 5% instances), fi-dep/nmod (13; 5% instances), fi-dep/nsubj (11; 4% instances), fi-dep/discourse (10; 4% instances), fi-dep/dobj (10; 4% instances), fi-dep/nmod:gobj (3; 1% instances), fi-dep/amod (2; 1% instances), fi-dep/cc (2; 1% instances), fi-dep/nmod:poss (2; 1% instances), fi-dep/parataxis (2; 1% instances), fi-dep/remnant (2; 1% instances), fi-dep/advmod (1; 0% instances), fi-dep/ccomp (1; 0% instances), fi-dep/csubj:cop (1; 0% instances), fi-dep/goeswith (1; 0% instances), fi-dep/mwe (1; 0% instances), fi-dep/nsubj:cop (1; 0% instances)
Parents of X
nodes belong to 6 different parts of speech: X (127; 45% instances), NOUN (80; 28% instances), VERB (37; 13% instances), PROPN (24; 8% instances), ROOT (14; 5% instances), ADJ (3; 1% instances)
160 (56%) X
nodes are leaves.
39 (14%) X
nodes have one child.
25 (9%) X
nodes have two children.
61 (21%) X
nodes have three or more children.
The highest child degree of a X
node is 11.
Children of X
nodes are attached using 25 different relations: fi-dep/punct (119; 32% instances), fi-dep/foreign (93; 25% instances), fi-dep/conj (26; 7% instances), fi-dep/name (26; 7% instances), fi-dep/nmod (20; 5% instances), fi-dep/amod (17; 5% instances), fi-dep/cc (14; 4% instances), fi-dep/appos (13; 4% instances), fi-dep/advmod (6; 2% instances), fi-dep/nmod:poss (5; 1% instances), fi-dep/cop (4; 1% instances), fi-dep/nsubj:cop (4; 1% instances), fi-dep/acl:relcl (3; 1% instances), fi-dep/advcl (2; 1% instances), fi-dep/compound:nn (2; 1% instances), fi-dep/det (2; 1% instances), fi-dep/nummod (2; 1% instances), fi-dep/parataxis (2; 1% instances), fi-dep/remnant (2; 1% instances), fi-dep/acl (1; 0% instances), fi-dep/aux (1; 0% instances), fi-dep/cc:preconj (1; 0% instances), fi-dep/discourse (1; 0% instances), fi-dep/mark (1; 0% instances), fi-dep/nsubj (1; 0% instances)
Children of X
nodes belong to 12 different parts of speech: PUNCT (127; 35% instances), X (127; 35% instances), NOUN (41; 11% instances), ADJ (20; 5% instances), VERB (17; 5% instances), CONJ (12; 3% instances), PROPN (10; 3% instances), ADV (6; 2% instances), PRON (3; 1% instances), SYM (3; 1% instances), AUX (1; 0% instances), SCONJ (1; 0% instances)
Treebank Statistics (UD_Finnish-FTB)
There are 270 X
lemmas (1%), 269 X
types (1%) and 304 X
tokens (0%).
Out of 16 observed tags, the rank of X
is: 8 in number of lemmas, 11 in number of types and 15 in number of tokens.
The 10 most frequent X
lemmas: 70-, in, sosiaali-, the, ala-, kauppa-, keng-, maa-, 50-, aquis
The 10 most frequent X
types: 70-, in, sosiaali-, the, Kauppa-, ala-, keng-, maa-, 50-, Lilla
The 10 most frequent ambiguous lemmas: the (X 4, PROPN 1), out (X 2, NOUN 2), home (NOUN 3, X 1), is (PROPN 2, X 1), made (NOUN 1, X 1), me (PRON 483, DET 74, X 1), new (PROPN 8, X 1), partners (X 1, PROPN 1), queen (PROPN 1, X 1), ride (PROPN 1, X 1)
The 10 most frequent ambiguous types: out (NOUN 1, X 1), New (PROPN 8, X 1), Ride (PROPN 1, X 1), m- (X 1, PRON 1), me (PRON 123, X 1, VERB 1), se- (X 1, PRON 1), termi (NOUN 2, X 1)
- out
- New
- Ride
- m-
- me
- se-
- termi
Morphology
The form / lemma ratio of X
is 0.996296 (the average of all parts of speech is 2.044212).
The 1st highest number of forms (1) was observed with the lemma “10-”: 10-.
The 2nd highest number of forms (1) was observed with the lemma “100-”: 100-.
The 3rd highest number of forms (1) was observed with the lemma “150-”: 150-.
X
does not occur with any features.
Relations
X
nodes are attached to their parents using 21 different relations: fi-dep/conj (149; 49% instances), fi-dep/amod (26; 9% instances), fi-dep/dep (24; 8% instances), fi-dep/nmod (23; 8% instances), fi-dep/reparandum (17; 6% instances), fi-dep/root (15; 5% instances), fi-dep/nsubj (13; 4% instances), fi-dep/advmod (6; 2% instances), fi-dep/foreign (6; 2% instances), fi-dep/dobj (4; 1% instances), fi-dep/nsubj:cop (4; 1% instances), fi-dep/ccomp (3; 1% instances), fi-dep/compound:nn (3; 1% instances), fi-dep/name (3; 1% instances), fi-dep/case (2; 1% instances), fi-dep/acl (1; 0% instances), fi-dep/aux (1; 0% instances), fi-dep/cc (1; 0% instances), fi-dep/compound:prt (1; 0% instances), fi-dep/csubj:cop (1; 0% instances), fi-dep/vocative (1; 0% instances)
Parents of X
nodes belong to 10 different parts of speech: NOUN (151; 50% instances), X (64; 21% instances), VERB (29; 10% instances), ADJ (21; 7% instances), PROPN (19; 6% instances), ROOT (15; 5% instances), PRON (2; 1% instances), ADV (1; 0% instances), DET (1; 0% instances), SCONJ (1; 0% instances)
223 (73%) X
nodes are leaves.
48 (16%) X
nodes have one child.
14 (5%) X
nodes have two children.
19 (6%) X
nodes have three or more children.
The highest child degree of a X
node is 7.
Children of X
nodes are attached using 23 different relations: fi-dep/punct (34; 21% instances), fi-dep/amod (27; 17% instances), fi-dep/nmod (19; 12% instances), fi-dep/advmod (10; 6% instances), fi-dep/conj (10; 6% instances), fi-dep/dep (8; 5% instances), fi-dep/cop (7; 4% instances), fi-dep/acl (6; 4% instances), fi-dep/foreign (6; 4% instances), fi-dep/nsubj:cop (6; 4% instances), fi-dep/cc (5; 3% instances), fi-dep/nsubj (5; 3% instances), fi-dep/aux (3; 2% instances), fi-dep/case (3; 2% instances), fi-dep/name (2; 1% instances), fi-dep/vocative (2; 1% instances), fi-dep/compound:nn (1; 1% instances), fi-dep/compound:prt (1; 1% instances), fi-dep/csubj:cop (1; 1% instances), fi-dep/det (1; 1% instances), fi-dep/dobj (1; 1% instances), fi-dep/mark (1; 1% instances), fi-dep/neg (1; 1% instances)
Children of X
nodes belong to 13 different parts of speech: X (64; 40% instances), PUNCT (34; 21% instances), VERB (22; 14% instances), PROPN (9; 6% instances), NOUN (8; 5% instances), ADJ (6; 4% instances), PRON (5; 3% instances), ADV (4; 3% instances), CONJ (4; 3% instances), ADP (1; 1% instances), DET (1; 1% instances), PART (1; 1% instances), SCONJ (1; 1% instances)
X in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]