Treebank Statistics: UD_Icelandic-IcePaHC: Features: Foreign
This feature is universal.
It occurs with 1 different values: Yes
.
4726 tokens (0%) have a non-empty value of Foreign
.
2283 types (3%) occur at least once with a non-empty value of Foreign
.
2159 lemmas (6%) occur at least once with a non-empty value of Foreign
.
The feature is used with 14 part-of-speech tags: PROPN (2337; 0% instances), X (1542; 0% instances), NOUN (333; 0% instances), VERB (108; 0% instances), ADV (107; 0% instances), DET (96; 0% instances), NUM (51; 0% instances), ADJ (47; 0% instances), AUX (27; 0% instances), PRON (26; 0% instances), ADP (21; 0% instances), INTJ (20; 0% instances), CCONJ (10; 0% instances), PUNCT (1; 0% instances).
PROPN
2337 PROPN tokens (6% of all PROPN
tokens) have a non-empty value of Foreign
.
The most frequent other feature values with which PROPN
and Foreign
co-occurred: Case=EMPTY (2337; 100%), Definite=EMPTY (2337; 100%), Gender=EMPTY (2337; 100%), Number=EMPTY (2337; 100%).
PROPN
tokens may have the following values of Foreign
:
Yes
(2337; 100% of non-emptyForeign
): Erasmus, Metternich, Darius, Vali, Dominus, Pelissier, Moyses, Menon, Petrus, ThiersEMPTY
(39050): guð, guðs, herra, jesús, guði, drottinn, jesú, Illugi, Jón, Finnbogi
Foreign
seems to be lexical feature of PROPN
. 100% lemmas (1012) occur only with one value of Foreign
.
X
1542 X tokens (68% of all X
tokens) have a non-empty value of Foreign
.
The most frequent other feature values with which X
and Foreign
co-occurred: Case=EMPTY (1542; 100%), Definite=EMPTY (1542; 100%), Gender=EMPTY (1542; 100%), Number=EMPTY (1542; 100%).
X
tokens may have the following values of Foreign
:
Yes
(1542; 100% of non-emptyForeign
): anno, in, item, domini, et, Dominus, etc, de, Achior, corpusEMPTY
(733): Trankival, sankti, Item, domini, Majst, Ektor, sanktus, Anno, Vernakíus, Darii
Foreign
seems to be lexical feature of X
. 100% lemmas (859) occur only with one value of Foreign
.
NOUN
333 NOUN tokens (0% of all NOUN
tokens) have a non-empty value of Foreign
.
The most frequent other feature values with which NOUN
and Foreign
co-occurred: Case=EMPTY (333; 100%), Definite=EMPTY (333; 100%), Gender=EMPTY (333; 100%), Number=EMPTY (333; 100%).
NOUN
tokens may have the following values of Foreign
:
Yes
(333; 100% of non-emptyForeign
): son, anno, dal, kap, Majestets, hold, hertug, leon, von, LofEMPTY
(145554): menn, maður, konungur, manna, biskup, mönnum, móti, orð, dag, tíma
Foreign
seems to be lexical feature of NOUN
. 100% lemmas (226) occur only with one value of Foreign
.
VERB
108 VERB tokens (0% of all VERB
tokens) have a non-empty value of Foreign
.
The most frequent other feature values with which VERB
and Foreign
co-occurred: Case=EMPTY (108; 100%), Gender=EMPTY (108; 100%), Mood=EMPTY (108; 100%), Number=EMPTY (108; 100%), Person=EMPTY (108; 100%), Tense=EMPTY (108; 100%), VerbForm=EMPTY (108; 100%), Voice=EMPTY (108; 100%).
VERB
tokens may have the following values of Foreign
:
Yes
(108; 100% of non-emptyForeign
): Bar, Gessovel, Vita, Komu, Tel, talt, Sest, Stend, Vil, doEMPTY
(128579): sagði, segir, kom, mælti, fór, tók, varð, gekk, fara, sjá
Foreign
seems to be lexical feature of VERB
. 100% lemmas (73) occur only with one value of Foreign
.
ADV
107 ADV tokens (0% of all ADV
tokens) have a non-empty value of Foreign
.
ADV
tokens may have the following values of Foreign
:
Yes
(107; 100% of non-emptyForeign
): ei, sicut, so, ogsvo, Mart, item, fraMe, nu, Allvel, BrottEMPTY
(78913): þá, svo, þar, ekki, nú, eigi, þó, hér, síðan, og
Foreign
seems to be lexical feature of ADV
. 100% lemmas (24) occur only with one value of Foreign
.
DET
96 DET tokens (0% of all DET
tokens) have a non-empty value of Foreign
.
The most frequent other feature values with which DET
and Foreign
co-occurred: Case=EMPTY (96; 100%), Definite=EMPTY (96; 100%), Degree=EMPTY (96; 100%), Gender=EMPTY (96; 100%), Number=EMPTY (96; 100%), PronType=EMPTY (96; 100%).
DET
tokens may have the following values of Foreign
:
Yes
(96; 100% of non-emptyForeign
): in, þenna, engi, ina, inu, mart, enu, Allan, Alt, EinaEMPTY
(44833): þetta, sá, allt, einn, það, þeim, þessi, þann, allir, þá
Foreign
seems to be lexical feature of DET
. 100% lemmas (11) occur only with one value of Foreign
.
NUM
51 NUM tokens (1% of all NUM
tokens) have a non-empty value of Foreign
.
The most frequent other feature values with which NUM
and Foreign
co-occurred: Case=EMPTY (51; 100%), Gender=EMPTY (51; 100%), NumType=EMPTY (51; 100%), Number=EMPTY (51; 100%).
NUM
tokens may have the following values of Foreign
:
Yes
(51; 100% of non-emptyForeign
): ij, iij, iiij, iiii, xii, vii, ccc, ix, xi, xiiiEMPTY
(4361): tveir, tólf, tvo, fimm, tvö, sex, þrír, þrjú, sjö, þrjá
Foreign
seems to be lexical feature of NUM
. 100% lemmas (16) occur only with one value of Foreign
.
ADJ
47 ADJ tokens (0% of all ADJ
tokens) have a non-empty value of Foreign
.
The most frequent other feature values with which ADJ
and Foreign
co-occurred: Case=EMPTY (47; 100%), Definite=EMPTY (47; 100%), Degree=EMPTY (47; 100%), Gender=EMPTY (47; 100%), Number=EMPTY (47; 100%).
ADJ
tokens may have the following values of Foreign
:
Yes
(47; 100% of non-emptyForeign
): Vant, Aum, Darius, Heil, iiii, Besti, Gamall, Heili, Italiani, OfanvertEMPTY
(37125): sama, gott, góða, satt, góður, sömu, stór, fyrsta, góð, fyrstu
Foreign
seems to be lexical feature of ADJ
. 100% lemmas (39) occur only with one value of Foreign
.
AUX
27 AUX tokens (0% of all AUX
tokens) have a non-empty value of Foreign
.
The most frequent other feature values with which AUX
and Foreign
co-occurred: Mood=EMPTY (27; 100%), Number=EMPTY (27; 100%), Person=EMPTY (27; 100%), Tense=EMPTY (27; 100%), VerbForm=EMPTY (27; 100%), Voice=EMPTY (27; 100%).
AUX
tokens may have the following values of Foreign
:
Yes
(27; 100% of non-emptyForeign
): Vil, man, myni, Munu, Vilda, emk, er, hefir, hefoiEMPTY
(51225): var, er, voru, hafði, vera, væri, hafa, eru, mun, verið
PRON
26 PRON tokens (0% of all PRON
tokens) have a non-empty value of Foreign
.
The most frequent other feature values with which PRON
and Foreign
co-occurred: Case=EMPTY (26; 100%), Gender=EMPTY (26; 100%), Number=EMPTY (26; 100%), Person=EMPTY (26; 100%), PronType=EMPTY (26; 100%).
PRON
tokens may have the following values of Foreign
:
Yes
(26; 100% of non-emptyForeign
): Oss, þaug, huer, þeira, Haun, Minn, Sitt, Soddan, Vor, VortEMPTY
(120310): hann, það, þeir, því, þú, eg, ég, honum, hans, hún
Foreign
seems to be lexical feature of PRON
. 100% lemmas (12) occur only with one value of Foreign
.
ADP
21 ADP tokens (0% of all ADP
tokens) have a non-empty value of Foreign
.
ADP
tokens may have the following values of Foreign
:
Yes
(21; 100% of non-emptyForeign
): fyr, of, umb, Und, intra, nemaEMPTY
(103598): í, á, til, af, með, um, fyrir, að, við, upp
INTJ
20 INTJ tokens (2% of all INTJ
tokens) have a non-empty value of Foreign
.
INTJ
tokens may have the following values of Foreign
:
Yes
(20; 100% of non-emptyForeign
): Hei, Jaaaá, hahaha, he, Vei, Bless, Blubbs, Eia, Hahahaha, OEMPTY
(1178): já, nei, ó, amen, æ, jú, nú, jæja, ja, ha
Foreign
seems to be lexical feature of INTJ
. 100% lemmas (10) occur only with one value of Foreign
.
CCONJ
10 CCONJ tokens (0% of all CCONJ
tokens) have a non-empty value of Foreign
.
CCONJ
tokens may have the following values of Foreign
:
Yes
(10; 100% of non-emptyForeign
): oc, etEMPTY
(57243): og, en, eða, eður, bæði, né, hvorki, enda, hvörki, ýmist
PUNCT
1 PUNCT tokens (0% of all PUNCT
tokens) have a non-empty value of Foreign
.
PUNCT
tokens may have the following values of Foreign
:
Yes
(1; 100% of non-emptyForeign
): —EMPTY
(113952): ,, ., “, :, ;, ?, !, -, …, —
Relations with Agreement in Foreign
The 10 most frequent relations where parent and child node agree in Foreign
:
X –[flat:foreign]–> X (599; 85%),
X –[dep]–> X (11; 92%),
PROPN –[flat:foreign]–> X (9; 60%),
PROPN –[iobj]–> PROPN (5; 100%),
X –[obl]–> X (4; 100%),
X –[appos]–> X (3; 100%),
INTJ –[discourse]–> INTJ (2; 100%),
NOUN –[discourse]–> NOUN (2; 100%),
X –[appos]–> PROPN (2; 67%),
X –[nmod:poss]–> PROPN (2; 67%).