Foreign
: foreign word
Is this a foreign word? Not a loan word and not a foreign name but a genuinely foreign word appearing inside native text. This feature would apply either to the “X” part of speech (unanalyzable token), or to other parts of speech if we know and are willing to annotate the class to which the word belongs in its original language.
Note: the UD Tscript
(transcribed) value is not used in UD Finnish.
Foreign
: it is foreign
Examples
TODO
Fscript
: it is foreign and written in a foreign script
Examples
TODO
Treebank Statistics (UD_Finnish)
This feature is language-specific.
It occurs with 2 different values: Foreign
, Fscript
.
276 tokens (0%) have a non-empty value of Foreign
.
233 types (0%) occur at least once with a non-empty value of Foreign
.
224 lemmas (1%) occur at least once with a non-empty value of Foreign
.
The feature is used with 1 part-of-speech tags: X (276; 0% instances).
X
276 X tokens (97% of all X
tokens) have a non-empty value of Foreign
.
X
tokens may have the following values of Foreign
:
Foreign
(249; 90% of non-emptyForeign
): metal, common, death, a, and, eHealth, fun, it, pic, DIYFscript
(27; 10% of non-emptyForeign
): ОАО, Стаханoв, Aεροδρόμιο, Διεθνές, Διόνυσος, Εὐπάτωρ, Λάρνακας, Μιθριδάτης, Кадіївка, Не́рехта
Foreign
seems to be lexical feature of X
. 100% lemmas (224) occur only with one value of Foreign
.
Relations with Agreement in Foreign
The 10 most frequent relations where parent and child node agree in Foreign
:
X –[foreign]–> X (93; 100%),
X –[name]–> X (14; 100%),
X –[conj]–> X (7; 70%),
X –[remnant]–> X (2; 100%),
X –[cc]–> X (1; 100%),
X –[compound:nn]–> X (1; 100%),
X –[nmod:poss]–> X (1; 100%),
X –[nsubj:cop]–> X (1; 100%).