Treebank Statistics: UD_Western_Sierra_Puebla_Nahuatl-MesoTree: Features: Foreign
This feature is universal.
It occurs with 1 different values: Yes.
971 tokens (5%) have a non-empty value of Foreign.
365 types (8%) occur at least once with a non-empty value of Foreign.
293 lemmas (11%) occur at least once with a non-empty value of Foreign.
The feature is used with 13 part-of-speech tags: NOUN (467; 2% instances), ADP (218; 1% instances), ADV (115; 1% instances), SCONJ (47; 0% instances), ADJ (33; 0% instances), PROPN (24; 0% instances), CCONJ (19; 0% instances), DET (12; 0% instances), VERB (11; 0% instances), NUM (9; 0% instances), INTJ (8; 0% instances), PRON (7; 0% instances), AUX (1; 0% instances).
NOUN
467 NOUN tokens (12% of all NOUN tokens) have a non-empty value of Foreign.
The most frequent other feature values with which NOUN and Foreign co-occurred: Case=EMPTY (467; 100%).
NOUN tokens may have the following values of Foreign:
Yes(467; 100% of non-emptyForeign): pueblo, topueblo, escuela, rana, danzas, irana, burro, guerra, vez, añosEMPTY(3331): itich, ica, tlaxcal, atl, cali, ich, itzcuintli, canasta, comitl, tlacatl
Foreign seems to be lexical feature of NOUN. 100% lemmas (212) occur only with one value of Foreign.
ADP
218 ADP tokens (55% of all ADP tokens) have a non-empty value of Foreign.
ADP tokens may have the following values of Foreign:
Yes(218; 100% of non-emptyForeign): de, para, por, hasta, a, en, desde, como, sinEMPTY(181): de, den, para, quemeh, hasta, ic, por, que, desde, ik
ADV
115 ADV tokens (4% of all ADV tokens) have a non-empty value of Foreign.
ADV tokens may have the following values of Foreign:
Yes(115; 100% of non-emptyForeign): después, entonces, pues, ahorita, siempre, igual, ahora, bueno, más, casiEMPTY(2467): amo, ya, simi, ompa, axan, nochipa, yalua, y, nikah, san
Foreign seems to be lexical feature of ADV. 100% lemmas (20) occur only with one value of Foreign.
SCONJ
47 SCONJ tokens (10% of all SCONJ tokens) have a non-empty value of Foreign.
SCONJ tokens may have the following values of Foreign:
Yes(47; 100% of non-emptyForeign): porque, que, como, cuando, para, hasta, MejorEMPTY(404): tla, para, ihcuac, que, n, porque, tleca, nic, tlen, cuando
ADJ
33 ADJ tokens (7% of all ADJ tokens) have a non-empty value of Foreign.
The most frequent other feature values with which ADJ and Foreign co-occurred: Number=Sing (23; 70%).
ADJ tokens may have the following values of Foreign:
Yes(33; 100% of non-emptyForeign): Nuevo, atrasado, cerca, cerquita, civil, mexicano, patronal, primera, reconocido, tranquiloEMPTY(440): ueyi, cuali, tliltic, sisic, yancuic, istac, cualli, kwale, kwaltsih, tzocotzin
Foreign seems to be lexical feature of ADJ. 100% lemmas (22) occur only with one value of Foreign.
PROPN
24 PROPN tokens (5% of all PROPN tokens) have a non-empty value of Foreign.
PROPN tokens may have the following values of Foreign:
Yes(24; 100% of non-emptyForeign): Juan, estados, unidos, español, Juana, diosEMPTY(448): juan, Ticpintzin, Ticpinitos, pedro, uamantla, Huamantla, Lupita, San, Dios, Marìa
CCONJ
19 CCONJ tokens (4% of all CCONJ tokens) have a non-empty value of Foreign.
CCONJ tokens may have the following values of Foreign:
Yes(19; 100% of non-emptyForeign): pero, y, oEMPTY(488): uan, wan, huan, pero, o, ica, dion, masqui, mas, noso
DET
12 DET tokens (1% of all DET tokens) have a non-empty value of Foreign.
DET tokens may have the following values of Foreign:
Yes(12; 100% of non-emptyForeign): cada, l, las, cualquier, unEMPTY(1902): in, n, se, nin, non, necah, necan, quesqui, ce, tlen
VERB
11 VERB tokens (0% of all VERB tokens) have a non-empty value of Foreign.
The most frequent other feature values with which VERB and Foreign co-occurred: Aspect=EMPTY (11; 100%), Mood=Ind (10; 91%), VerbForm=Fin (7; 64%), Tense=Pres (6; 55%).
VERB tokens may have the following values of Foreign:
Yes(11; 100% of non-emptyForeign): ver, sé, Anda, dar, ponen, sale, sea, sirvesEMPTY(3937): oquis, catqui, katki, oquihtoh, quipia, mota, icpia, onauat, oyah, omic
NUM
9 NUM tokens (4% of all NUM tokens) have a non-empty value of Foreign.
NUM tokens may have the following values of Foreign:
Yes(9; 100% of non-emptyForeign): ocho, quince, dieciocho, nueve, siete, veinte, millonesEMPTY(199): se, ome, yeyi, omeh, caxtol, quince, ce, mahtlactl, mil, simpohual
INTJ
8 INTJ tokens (9% of all INTJ tokens) have a non-empty value of Foreign.
INTJ tokens may have the following values of Foreign:
Yes(8; 100% of non-emptyForeign): bueno, sí, AEMPTY(85): quema, quemah, amo, aa, ay, queutoc, cuixi, ja, sí, Ah
PRON
7 PRON tokens (1% of all PRON tokens) have a non-empty value of Foreign.
The most frequent other feature values with which PRON and Foreign co-occurred: Number=EMPTY (7; 100%), Person=EMPTY (7; 100%), PronType=EMPTY (7; 100%).
PRON tokens may have the following values of Foreign:
Yes(7; 100% of non-emptyForeign): eso, que, nada, tercero, todoEMPTY(1195): yeh, tlen, neh, tleno, non, yehuan, teh, tlenoh, tehuan, nin
AUX
1 AUX tokens (0% of all AUX tokens) have a non-empty value of Foreign.
The most frequent other feature values with which AUX and Foreign co-occurred: Tense=EMPTY (1; 100%).
AUX tokens may have the following values of Foreign:
Yes(1; 100% of non-emptyForeign): esEMPTY(363): catqui, ma, o, mo, uili, isqui, katka, oc, ocatca, uilis
Relations with Agreement in Foreign
The 10 most frequent relations where parent and child node agree in Foreign:
NOUN –[case]–> ADP (66; 60%),
NOUN –[conj]–> NOUN (28; 68%),
NOUN –[amod]–> ADJ (11; 73%),
ADP –[fixed]–> NOUN (9; 100%),
ADJ –[nsubj]–> NOUN (5; 56%),
PROPN –[flat]–> PROPN (5; 100%),
ADJ –[case]–> ADP (3; 75%),
NOUN –[flat]–> ADJ (3; 100%),
ADJ –[conj]–> NOUN (1; 100%),
ADJ –[mark]–> ADV (1; 100%).