X

This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.

home fi/pos issue tracker

`X`: other

The tag X is used for words that for some reason cannot be assigned a real part-of-speech category.

Foreign words appearing inside native text are tagged X (see also Foreign).

Examples

[fi] Uskoo ken tahtsssszzt brrrzzzt.
[fi] Opimme fyysikoiden “Let’s assume a spherical cow” -lähestymistavan.

Treebank Statistics (UD_Finnish)

There are 228 X lemmas (1%), 237 X types (0%) and 285 X tokens (0%). Out of 15 observed tags, the rank of X is: 7 in number of lemmas, 8 in number of types and 14 in number of tokens.

The 10 most frequent X lemmas: metal, common, death, dolly, eHealth, a, and, api, fun, it

The 10 most frequent X types: metal, common, death, a, and, eHealth, fun, it, pic, DIY

The 10 most frequent ambiguous lemmas: a (NOUN 30, X 3, PROPN 1), and (PROPN 10, X 3), I (ADJ 34, PROPN 5, X 2), Diàoyútái (X 1, PROPN 1), Do (PROPN 5, X 1), Don’t (X 1, PROPN 1), Finnish (PROPN 2, X 1), Grey (PROPN 1, X 1), Life (PROPN 2, X 1), Yourself (PROPN 1, X 1)

The 10 most frequent ambiguous types: a (NOUN 30, X 3, PROPN 1), and (PROPN 10, X 3), I (ADJ 26, PROPN 3, X 2), Do (PROPN 5, X 1), Don’t (X 1, PROPN 1), Finnish (PROPN 2, X 1), Life (PROPN 1, X 1), On (VERB 92, AUX 13, PROPN 5, X 1), Yourself (X 1, PROPN 1), by (X 1, PROPN 1)

a
- NOUN 30: ” 3 a artikla
- X 3: It’s a fact .
- PROPN 1: Eilisellä yökyläreissulla katsoimme ystäväni kanssa aivan mahtavan animaation nimeltään Cloudy with a Chance of Meatballs .
and
- PROPN 10: Wilsonin sooloalbumi Love and Youth julkaistiin Ruotsissa 2005 .
- X 3: Illan pääesiintyjä oli jouluna paluunsa ilmoittanut death and roll -yhtye Deuteronomium .
I
- ADJ 26: 1 ) Korvataan liite I tämän päätöksen liitteellä I .
- PROPN 3: I Wanna Be Adored rock-yhtye The Stone Rosesin kolmas singlejulkaisu albumilta The Stone Roses .
- X 2: 2. Kirjoita viaksi “ I can’t find myself in search ”
Do
- PROPN 5: Iglesias on levyttänyt Do You Know ? :n myös espanjaksi nimellä Dímelo .
- X 1: Kuten tuli tuossa ensimmäisessä Arduinoa käsitelleessä postauksessa luvattua , suunnitelmissani on rakentaa DIY ( Do It Yourself , Tee Se Itse ) intervalliajastin sekä moottoroitu alusta kameralle .
Don’t
- X 1: Don’t worry , be happy oli vanhuksen motto ollut viimeiset vuosikymmenet ja yksinkertaisuudessaan tämä ilahdutti miestä .
- PROPN 1: Justin Timberlake laulaa kappaleella “ My Style “ , funk-legenda James Brown kappaleella “ They Don’t Want Music “ ja Sting kappaleella “ Union “ .
Finnish
- PROPN 2: Päivän seminaarit aloitti Finnish Linux User Groupin Arto Teräs .
- X 1: Suomen saa käyttöön menemällä polkua Tools - > Preferences - > Languages - > Finnish , jonka jälkeen Luminance HDR tulee käynnistää uudelleen .
Life
- PROPN 1: Trail of Life Decayed on ruotsalaisen death metal -yhtye Dark Tranquillityn ensimmäinen demo ja se julkaistiin vuonna 1991 .
- X 1: TIPissä yhdistyvät ICT- , Life Science- ja luovien alojen tutkimus sekä liiketoimintaosaaminen .
On
- VERB 92: On kyllä kiva , kun tuolla sai hipelöidä kaikkea . :)
- AUX 13: On nimittäin myös huomannut , että joskus kirkkainkin aurinko pimenee .
- PROPN 5: Aikaa myöten On A Friday:sta kehittyi Radiohead .
- X 1: On - off .
Yourself
- X 1: Kuten tuli tuossa ensimmäisessä Arduinoa käsitelleessä postauksessa luvattua , suunnitelmissani on rakentaa DIY ( Do It Yourself , Tee Se Itse ) intervalliajastin sekä moottoroitu alusta kameralle .
- PROPN 1: Go Chuck Yourself ( Euroopassa ja Pohjois-Amerikassa ) tai Happy Live Surprise ( Japanissa ) on yhtyeen Sum 41 livealbumi , joka nauhoitettiin Lontoossa , Ontariossa huhtikuussa 2005 .
by
- X 1: - Yksityisten toimijoiden välillä ilmenevät kysymykset ovat niin monitahoisia , että usein asioita voidaan selittää vain “ case by case “ -mentaliteetilla , Rosas sanoi .
- PROPN 1: The Garden Collection by H&M

Morphology

The form / lemma ratio of X is 1.039474 (the average of all parts of speech is 2.036755).

The 1st highest number of forms (3) was observed with the lemma “dolly”: dolly, dollyja, dollyn.

The 2nd highest number of forms (2) was observed with the lemma “API”: API, APIn.

The 3rd highest number of forms (2) was observed with the lemma “eHealth”: eHealth, eHealthin.

X occurs with 1 features: fi-feat/Foreign (276; 97% instances)

X occurs with 2 feature-value pairs: Foreign=Foreign, Foreign=Fscript

X occurs with 3 feature combinations. The most frequent feature combination is Foreign=Foreign (249 tokens). Examples: metal, common, death, a, and, eHealth, fun, it, pic, DIY

Relations

X nodes are attached to their parents using 22 different relations: fi-dep/foreign (93; 33% instances), fi-dep/appos (48; 17% instances), fi-dep/compound:nn (32; 11% instances), fi-dep/name (21; 7% instances), fi-dep/conj (14; 5% instances), fi-dep/root (14; 5% instances), fi-dep/nmod (13; 5% instances), fi-dep/nsubj (11; 4% instances), fi-dep/discourse (10; 4% instances), fi-dep/dobj (10; 4% instances), fi-dep/nmod:gobj (3; 1% instances), fi-dep/amod (2; 1% instances), fi-dep/cc (2; 1% instances), fi-dep/nmod:poss (2; 1% instances), fi-dep/parataxis (2; 1% instances), fi-dep/remnant (2; 1% instances), fi-dep/advmod (1; 0% instances), fi-dep/ccomp (1; 0% instances), fi-dep/csubj:cop (1; 0% instances), fi-dep/goeswith (1; 0% instances), fi-dep/mwe (1; 0% instances), fi-dep/nsubj:cop (1; 0% instances)

Parents of X nodes belong to 6 different parts of speech: X (127; 45% instances), NOUN (80; 28% instances), VERB (37; 13% instances), PROPN (24; 8% instances), ROOT (14; 5% instances), ADJ (3; 1% instances)

160 (56%) X nodes are leaves.

39 (14%) X nodes have one child.

25 (9%) X nodes have two children.

61 (21%) X nodes have three or more children.

The highest child degree of a X node is 11.

Children of X nodes are attached using 25 different relations: fi-dep/punct (119; 32% instances), fi-dep/foreign (93; 25% instances), fi-dep/conj (26; 7% instances), fi-dep/name (26; 7% instances), fi-dep/nmod (20; 5% instances), fi-dep/amod (17; 5% instances), fi-dep/cc (14; 4% instances), fi-dep/appos (13; 4% instances), fi-dep/advmod (6; 2% instances), fi-dep/nmod:poss (5; 1% instances), fi-dep/cop (4; 1% instances), fi-dep/nsubj:cop (4; 1% instances), fi-dep/acl:relcl (3; 1% instances), fi-dep/advcl (2; 1% instances), fi-dep/compound:nn (2; 1% instances), fi-dep/det (2; 1% instances), fi-dep/nummod (2; 1% instances), fi-dep/parataxis (2; 1% instances), fi-dep/remnant (2; 1% instances), fi-dep/acl (1; 0% instances), fi-dep/aux (1; 0% instances), fi-dep/cc:preconj (1; 0% instances), fi-dep/discourse (1; 0% instances), fi-dep/mark (1; 0% instances), fi-dep/nsubj (1; 0% instances)

Children of X nodes belong to 12 different parts of speech: PUNCT (127; 35% instances), X (127; 35% instances), NOUN (41; 11% instances), ADJ (20; 5% instances), VERB (17; 5% instances), CONJ (12; 3% instances), PROPN (10; 3% instances), ADV (6; 2% instances), PRON (3; 1% instances), SYM (3; 1% instances), AUX (1; 0% instances), SCONJ (1; 0% instances)

Treebank Statistics (UD_Finnish-FTB)

There are 270 X lemmas (1%), 269 X types (1%) and 304 X tokens (0%). Out of 16 observed tags, the rank of X is: 8 in number of lemmas, 11 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: 70-, in, sosiaali-, the, ala-, kauppa-, keng-, maa-, 50-, aquis

The 10 most frequent X types: 70-, in, sosiaali-, the, Kauppa-, ala-, keng-, maa-, 50-, Lilla

The 10 most frequent ambiguous lemmas: the (X 4, PROPN 1), out (X 2, NOUN 2), home (NOUN 3, X 1), is (PROPN 2, X 1), made (NOUN 1, X 1), me (PRON 483, DET 74, X 1), new (PROPN 8, X 1), partners (X 1, PROPN 1), queen (PROPN 1, X 1), ride (PROPN 1, X 1)

The 10 most frequent ambiguous types: out (NOUN 1, X 1), New (PROPN 8, X 1), Ride (PROPN 1, X 1), m- (X 1, PRON 1), me (PRON 123, X 1, VERB 1), se- (X 1, PRON 1), termi (NOUN 2, X 1)

out
- NOUN 1: Aini toivoi , että työterveysasema olisi koonnut burn out -ryhmän , missä olisi voinut jakaa kokemuksia toisten kansa .
- X 1: Kuten tunnettua yleensä vasta Microsoft pystyy muuttamaan seuraavan lauseen imperfektiin : the cash is out there .
New
- PROPN 8: Lontoo on Tokion ja New Yorkin aikavyöhykkeiden välissä .
- X 1: Nykyisin hän lentää aina muutaman viikon väliajoin Bostonista Tokioon kahdeksi viikoksi johtamaan nuorta New Japan Philharmonic Orchestraa ja lisäksi johtaa aika ajoin Euroopassa .
Ride
- PROPN 1: Hiukan eilistä lämpimämmällä säällä vauhtihirmu olisi varmasti hätyytellyt Ride The Nightin nimissä olevaa SE:tä 14,4 .
- X 1: Express Ride vain mennä hutkutteli loppumatkan .
m-
- X 1: siin poika sit meinas , m- pudotti hattunsa päästä ja … katso taakse siinä niinku … kaatu … ajoi siin semmoseen … suureen kiveen ja kaatu pyörällääm siinä ja …
- PRON 1: Toi ihmettelee välillä että m- nukun selkä häneem päin
me
- PRON 123: no sit yks toinen kaveri lähti armeijaan ja me oltiin sovittu .
- X 1: Why did you do this to me !? Colin valitti itkuisen humalaisella äänellä .
- VERB 1: se oli pannus semmosia ehtoja ett ei semmosiin ehtoin kukaam me
se-
- X 1: ko se on se- semmonen ujo ollu
- PRON 1: Onko se- ku se täällä Pieksämäellä on ni onko se sit ihan täällä kokonaan ett ei se tuu yökskään kottiin ,
termi
- NOUN 2: PC-kortti on selvästi parempi termi kuin PCMCIA .
- X 1: Muuten asialliseen kiuasartikkeliinne oli tullut ikävä , joskin yleinen asia- ( termi ) virhe .

Morphology

The form / lemma ratio of X is 0.996296 (the average of all parts of speech is 2.044212).

The 1st highest number of forms (1) was observed with the lemma “10-”: 10-.

The 2nd highest number of forms (1) was observed with the lemma “100-”: 100-.

The 3rd highest number of forms (1) was observed with the lemma “150-”: 150-.

X does not occur with any features.

Relations

X nodes are attached to their parents using 21 different relations: fi-dep/conj (149; 49% instances), fi-dep/amod (26; 9% instances), fi-dep/dep (24; 8% instances), fi-dep/nmod (23; 8% instances), fi-dep/reparandum (17; 6% instances), fi-dep/root (15; 5% instances), fi-dep/nsubj (13; 4% instances), fi-dep/advmod (6; 2% instances), fi-dep/foreign (6; 2% instances), fi-dep/dobj (4; 1% instances), fi-dep/nsubj:cop (4; 1% instances), fi-dep/ccomp (3; 1% instances), fi-dep/compound:nn (3; 1% instances), fi-dep/name (3; 1% instances), fi-dep/case (2; 1% instances), fi-dep/acl (1; 0% instances), fi-dep/aux (1; 0% instances), fi-dep/cc (1; 0% instances), fi-dep/compound:prt (1; 0% instances), fi-dep/csubj:cop (1; 0% instances), fi-dep/vocative (1; 0% instances)

Parents of X nodes belong to 10 different parts of speech: NOUN (151; 50% instances), X (64; 21% instances), VERB (29; 10% instances), ADJ (21; 7% instances), PROPN (19; 6% instances), ROOT (15; 5% instances), PRON (2; 1% instances), ADV (1; 0% instances), DET (1; 0% instances), SCONJ (1; 0% instances)

223 (73%) X nodes are leaves.

48 (16%) X nodes have one child.

14 (5%) X nodes have two children.

19 (6%) X nodes have three or more children.

The highest child degree of a X node is 7.

Children of X nodes are attached using 23 different relations: fi-dep/punct (34; 21% instances), fi-dep/amod (27; 17% instances), fi-dep/nmod (19; 12% instances), fi-dep/advmod (10; 6% instances), fi-dep/conj (10; 6% instances), fi-dep/dep (8; 5% instances), fi-dep/cop (7; 4% instances), fi-dep/acl (6; 4% instances), fi-dep/foreign (6; 4% instances), fi-dep/nsubj:cop (6; 4% instances), fi-dep/cc (5; 3% instances), fi-dep/nsubj (5; 3% instances), fi-dep/aux (3; 2% instances), fi-dep/case (3; 2% instances), fi-dep/name (2; 1% instances), fi-dep/vocative (2; 1% instances), fi-dep/compound:nn (1; 1% instances), fi-dep/compound:prt (1; 1% instances), fi-dep/csubj:cop (1; 1% instances), fi-dep/det (1; 1% instances), fi-dep/dobj (1; 1% instances), fi-dep/mark (1; 1% instances), fi-dep/neg (1; 1% instances)

Children of X nodes belong to 13 different parts of speech: X (64; 40% instances), PUNCT (34; 21% instances), VERB (22; 14% instances), PROPN (9; 6% instances), NOUN (8; 5% instances), ADJ (6; 4% instances), PRON (5; 3% instances), ADV (4; 3% instances), CONJ (4; 3% instances), ADP (1; 1% instances), DET (1; 1% instances), PART (1; 1% instances), SCONJ (1; 1% instances)

X in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]

X: other

Examples

Treebank Statistics (UD_Finnish)

Morphology

Relations

Treebank Statistics (UD_Finnish-FTB)

Morphology

Relations

`X`: other