home fi/feat edit page issue tracker

Foreign: foreign word

Is this a foreign word? Not a loan word and not a foreign name but a genuinely foreign word appearing inside native text. This feature would apply either to the “X” part of speech (unanalyzable token), or to other parts of speech if we know and are willing to annotate the class to which the word belongs in its original language.

Note: the UD Tscript (transcribed) value is not used in UD Finnish.

Foreign: it is foreign

Examples

TODO

Fscript: it is foreign and written in a foreign script

Examples

TODO


Treebank Statistics (UD_Finnish)

This feature is language-specific. It occurs with 2 different values: Foreign, Fscript.

276 tokens (0%) have a non-empty value of Foreign. 233 types (0%) occur at least once with a non-empty value of Foreign. 224 lemmas (1%) occur at least once with a non-empty value of Foreign. The feature is used with 1 part-of-speech tags: X (276; 0% instances).

X

276 X tokens (97% of all X tokens) have a non-empty value of Foreign.

X tokens may have the following values of Foreign:

Foreign seems to be lexical feature of X. 100% lemmas (224) occur only with one value of Foreign.

Relations with Agreement in Foreign

The 10 most frequent relations where parent and child node agree in Foreign: X –[foreign]–> X (93; 100%), X –[name]–> X (14; 100%), X –[conj]–> X (7; 70%), X –[remnant]–> X (2; 100%), X –[cc]–> X (1; 100%), X –[compound:nn]–> X (1; 100%), X –[nmod:poss]–> X (1; 100%), X –[nsubj:cop]–> X (1; 100%).