Treebank Statistics: UD_Icelandic-IcePaHC: Features: Foreign
This feature is universal.
It occurs with 1 different values: Yes.
5170 tokens (1%) have a non-empty value of Foreign.
2524 types (4%) occur at least once with a non-empty value of Foreign.
2375 lemmas (7%) occur at least once with a non-empty value of Foreign.
The feature is used with 14 part-of-speech tags: PROPN (2304; 0% instances), X (2108; 0% instances), NOUN (317; 0% instances), VERB (107; 0% instances), ADV (99; 0% instances), NUM (51; 0% instances), ADJ (46; 0% instances), DET (34; 0% instances), AUX (26; 0% instances), PRON (25; 0% instances), ADP (22; 0% instances), INTJ (20; 0% instances), CCONJ (10; 0% instances), PUNCT (1; 0% instances).
PROPN
2304 PROPN tokens (6% of all PROPN tokens) have a non-empty value of Foreign.
The most frequent other feature values with which PROPN and Foreign co-occurred: Case=EMPTY (2304; 100%), Definite=EMPTY (2304; 100%), Gender=EMPTY (2304; 100%), Number=EMPTY (2304; 100%).
PROPN tokens may have the following values of Foreign:
Yes(2304; 100% of non-emptyForeign): Erasmus, Metternich, Darius, Dominus, Pelissier, Moyses, Menon, Petrus, Thiers, GeorgíusEMPTY(39080): guð, guðs, herra, jesús, guði, drottinn, jesú, Illugi, Jón, Finnbogi
Foreign seems to be lexical feature of PROPN. 100% lemmas (1003) occur only with one value of Foreign.
X
2108 X tokens (93% of all X tokens) have a non-empty value of Foreign.
X tokens may have the following values of Foreign:
Yes(2108; 100% of non-emptyForeign): anno, item, in, domini, Dominus, et, Majst, Trankival, etc, sanktiEMPTY(169): sankti, sanktus, Miraculum, Potú, trinitatis, Exordium, Item, Taraskonum, delictum, peccatum
Foreign seems to be lexical feature of X. 100% lemmas (1107) occur only with one value of Foreign.
NOUN
317 NOUN tokens (0% of all NOUN tokens) have a non-empty value of Foreign.
The most frequent other feature values with which NOUN and Foreign co-occurred: Case=EMPTY (317; 100%), Definite=EMPTY (317; 100%), Gender=EMPTY (317; 100%), Number=EMPTY (317; 100%).
NOUN tokens may have the following values of Foreign:
Yes(317; 100% of non-emptyForeign): son, anno, dal, kap, Majestets, hold, hertug, leon, von, alteraEMPTY(145562): menn, maður, konungur, manna, biskup, mönnum, móti, orð, dag, tíma
Foreign seems to be lexical feature of NOUN. 100% lemmas (214) occur only with one value of Foreign.
VERB
107 VERB tokens (0% of all VERB tokens) have a non-empty value of Foreign.
The most frequent other feature values with which VERB and Foreign co-occurred: Case=EMPTY (107; 100%), Gender=EMPTY (107; 100%), Mood=EMPTY (107; 100%), Number=EMPTY (107; 100%), Person=EMPTY (107; 100%), Tense=EMPTY (107; 100%), VerbForm=EMPTY (107; 100%), Voice=EMPTY (107; 100%).
VERB tokens may have the following values of Foreign:
Yes(107; 100% of non-emptyForeign): Bar, Gessovel, Vita, Komu, Tel, talt, Sest, Stend, Vil, doEMPTY(128582): sagði, segir, kom, mælti, fór, tók, varð, gekk, fara, sjá
Foreign seems to be lexical feature of VERB. 100% lemmas (72) occur only with one value of Foreign.
ADV
99 ADV tokens (0% of all ADV tokens) have a non-empty value of Foreign.
ADV tokens may have the following values of Foreign:
Yes(99; 100% of non-emptyForeign): ei, sicut, so, item, Mart, fraMe, nu, ogsvo, Allvel, BrottEMPTY(78920): þá, svo, þar, ekki, nú, eigi, þó, hér, síðan, og
Foreign seems to be lexical feature of ADV. 100% lemmas (24) occur only with one value of Foreign.
NUM
51 NUM tokens (1% of all NUM tokens) have a non-empty value of Foreign.
The most frequent other feature values with which NUM and Foreign co-occurred: Case=EMPTY (51; 100%), Gender=EMPTY (51; 100%), NumType=EMPTY (51; 100%), Number=EMPTY (51; 100%).
NUM tokens may have the following values of Foreign:
Yes(51; 100% of non-emptyForeign): ij, iij, iiij, iiii, xii, vii, ccc, ix, xi, xiiiEMPTY(4362): tveir, tólf, tvo, fimm, tvö, sex, þrír, þrjú, sjö, þrjá
Foreign seems to be lexical feature of NUM. 100% lemmas (16) occur only with one value of Foreign.
ADJ
46 ADJ tokens (0% of all ADJ tokens) have a non-empty value of Foreign.
The most frequent other feature values with which ADJ and Foreign co-occurred: Case=EMPTY (46; 100%), Definite=EMPTY (46; 100%), Degree=EMPTY (46; 100%), Gender=EMPTY (46; 100%), Number=EMPTY (46; 100%).
ADJ tokens may have the following values of Foreign:
Yes(46; 100% of non-emptyForeign): Vant, Aum, Darius, Heil, iiii, Besti, Gamall, Heili, Italiani, OfanvertEMPTY(37117): sama, gott, góða, satt, góður, sömu, stór, fyrsta, góð, fyrstu
Foreign seems to be lexical feature of ADJ. 100% lemmas (38) occur only with one value of Foreign.
DET
34 DET tokens (0% of all DET tokens) have a non-empty value of Foreign.
The most frequent other feature values with which DET and Foreign co-occurred: Case=EMPTY (34; 100%), Definite=EMPTY (34; 100%), Degree=EMPTY (34; 100%), Gender=EMPTY (34; 100%), Number=EMPTY (34; 100%), PronType=EMPTY (34; 100%).
DET tokens may have the following values of Foreign:
Yes(34; 100% of non-emptyForeign): engi, mart, enu, Allan, Alt, Eina, Eitthvert, Margs, Meir, allsEMPTY(44913): þetta, sá, allt, einn, það, þeim, þessi, þann, allir, þá
Foreign seems to be lexical feature of DET. 100% lemmas (10) occur only with one value of Foreign.
AUX
26 AUX tokens (0% of all AUX tokens) have a non-empty value of Foreign.
The most frequent other feature values with which AUX and Foreign co-occurred: Mood=EMPTY (26; 100%), Number=EMPTY (26; 100%), Person=EMPTY (26; 100%), Tense=EMPTY (26; 100%), VerbForm=EMPTY (26; 100%), Voice=EMPTY (26; 100%).
AUX tokens may have the following values of Foreign:
Yes(26; 100% of non-emptyForeign): Vil, man, myni, Munu, Vilda, emk, er, hefir, hefoiEMPTY(51218): var, er, voru, hafði, vera, væri, hafa, eru, mun, verið
PRON
25 PRON tokens (0% of all PRON tokens) have a non-empty value of Foreign.
The most frequent other feature values with which PRON and Foreign co-occurred: Case=EMPTY (25; 100%), Gender=EMPTY (25; 100%), Number=EMPTY (25; 100%), Person=EMPTY (25; 100%), PronType=EMPTY (25; 100%).
PRON tokens may have the following values of Foreign:
Yes(25; 100% of non-emptyForeign): Oss, þaug, huer, þeira, Haun, Minn, Sitt, Soddan, Vor, VortEMPTY(120312): hann, það, þeir, því, þú, eg, ég, honum, hans, hún
Foreign seems to be lexical feature of PRON. 100% lemmas (10) occur only with one value of Foreign.
ADP
22 ADP tokens (0% of all ADP tokens) have a non-empty value of Foreign.
ADP tokens may have the following values of Foreign:
Yes(22; 100% of non-emptyForeign): fyr, of, umb, Und, intra, nema, umEMPTY(103598): í, á, til, af, með, um, fyrir, að, við, upp
INTJ
20 INTJ tokens (2% of all INTJ tokens) have a non-empty value of Foreign.
INTJ tokens may have the following values of Foreign:
Yes(20; 100% of non-emptyForeign): Hei, Jaaaá, hahaha, he, Vei, Bless, Blubbs, Eia, Hahahaha, OEMPTY(1178): já, nei, ó, amen, æ, jú, nú, jæja, ja, ha
Foreign seems to be lexical feature of INTJ. 100% lemmas (10) occur only with one value of Foreign.
CCONJ
10 CCONJ tokens (0% of all CCONJ tokens) have a non-empty value of Foreign.
CCONJ tokens may have the following values of Foreign:
Yes(10; 100% of non-emptyForeign): oc, etEMPTY(57243): og, en, eða, eður, bæði, né, hvorki, enda, hvörki, ýmist
PUNCT
1 PUNCT tokens (0% of all PUNCT tokens) have a non-empty value of Foreign.
PUNCT tokens may have the following values of Foreign:
Yes(1; 100% of non-emptyForeign): —EMPTY(113950): ,, ., “, :, ;, ?, !, -, …, —
Relations with Agreement in Foreign
The 10 most frequent relations where parent and child node agree in Foreign:
X –[flat:foreign]–> X (737; 100%),
X –[dep]–> X (13; 100%),
X –[conj]–> X (10; 56%),
X –[flat:foreign]–> PROPN (7; 58%),
PROPN –[iobj]–> PROPN (5; 100%),
X –[amod]–> X (5; 100%),
X –[obl]–> X (4; 100%),
X –[appos]–> X (3; 100%),
X –[flat:name]–> X (3; 60%),
INTJ –[discourse]–> INTJ (2; 100%).