Treebank Statistics: UD_Western_Sierra_Puebla_Nahuatl-ITML: Features: Foreign
This feature is universal.
It occurs with 1 different values: Yes
.
940 tokens (9%) have a non-empty value of Foreign
.
353 types (14%) occur at least once with a non-empty value of Foreign
.
285 lemmas (17%) occur at least once with a non-empty value of Foreign
.
The feature is used with 13 part-of-speech tags: NOUN (467; 5% instances), ADP (213; 2% instances), ADV (115; 1% instances), SCONJ (45; 0% instances), ADJ (21; 0% instances), CCONJ (18; 0% instances), PROPN (13; 0% instances), DET (12; 0% instances), VERB (11; 0% instances), NUM (9; 0% instances), INTJ (8; 0% instances), PRON (7; 0% instances), AUX (1; 0% instances).
NOUN
467 NOUN tokens (31% of all NOUN
tokens) have a non-empty value of Foreign
.
The most frequent other feature values with which NOUN
and Foreign
co-occurred: Case=EMPTY (467; 100%), NounType=EMPTY (467; 100%), Number[psor]=EMPTY (397; 85%), Person[psor]=EMPTY (397; 85%).
NOUN
tokens may have the following values of Foreign
:
Yes
(467; 100% of non-emptyForeign
): pueblo, topueblo, escuela, rana, danzas, irana, burro, guerra, vez, añosEMPTY
(1017): ich, itich, atl, ica, ika, itzcuintli, telpukatl, tonal, ilwitl, tokniwah
Foreign
seems to be lexical feature of NOUN
. 100% lemmas (212) occur only with one value of Foreign
.
ADP
213 ADP tokens (92% of all ADP
tokens) have a non-empty value of Foreign
.
ADP
tokens may have the following values of Foreign
:
Yes
(213; 100% of non-emptyForeign
): de, para, a, por, hasta, en, desde, como, sinEMPTY
(19): quemeh, ic, ik, kemej, kemeh, que, asta, queme
ADV
115 ADV tokens (9% of all ADV
tokens) have a non-empty value of Foreign
.
The most frequent other feature values with which ADV
and Foreign
co-occurred: Polarity=EMPTY (115; 100%).
ADV
tokens may have the following values of Foreign
:
Yes
(115; 100% of non-emptyForeign
): después, entonces, pues, ahorita, siempre, ahora, igual, bueno, más, casiEMPTY
(1227): amo, ya, ompa, nikah, y, simi, ohcon, san, ok, axan
Foreign
seems to be lexical feature of ADV
. 100% lemmas (20) occur only with one value of Foreign
.
SCONJ
45 SCONJ tokens (15% of all SCONJ
tokens) have a non-empty value of Foreign
.
SCONJ
tokens may have the following values of Foreign
:
Yes
(45; 100% of non-emptyForeign
): porque, que, como, cuando, hasta, para, MejorEMPTY
(247): para, que, tla, n, porque, ihcuac, nic, tlen, cuando, ijkwak
ADJ
21 ADJ tokens (15% of all ADJ
tokens) have a non-empty value of Foreign
.
The most frequent other feature values with which ADJ
and Foreign
co-occurred: Number=Sing (16; 76%).
ADJ
tokens may have the following values of Foreign
:
Yes
(21; 100% of non-emptyForeign
): atrasado, Primera, mexicano, patronal, primer, reconocido, tranquilo, Pobre, amables, chistosoEMPTY
(116): cualli, kwale, kwaltsih, weyi, kwali, Nuevo, chihchikichih, igual, chikawak, cocolisohqueh
Foreign
seems to be lexical feature of ADJ
. 100% lemmas (16) occur only with one value of Foreign
.
CCONJ
18 CCONJ tokens (5% of all CCONJ
tokens) have a non-empty value of Foreign
.
CCONJ
tokens may have the following values of Foreign
:
Yes
(18; 100% of non-emptyForeign
): pero, y, oEMPTY
(381): wan, uan, huan, pero, o, mas, y, Peroh, yan
PROPN
13 PROPN tokens (6% of all PROPN
tokens) have a non-empty value of Foreign
.
PROPN
tokens may have the following values of Foreign
:
Yes
(13; 100% of non-emptyForeign
): estados, unidos, español, diosEMPTY
(198): Ticpintzin, Ticpinitos, Lupita, San, Dios, Pedro, Francisco, Lencho, Mexico, Luis
DET
12 DET tokens (1% of all DET
tokens) have a non-empty value of Foreign
.
DET
tokens may have the following values of Foreign
:
Yes
(12; 100% of non-emptyForeign
): cada, l, las, cualquier, unEMPTY
(933): n, in, se, non, nin, ce, nochi, siki, mik, mic
VERB
11 VERB tokens (1% of all VERB
tokens) have a non-empty value of Foreign
.
The most frequent other feature values with which VERB
and Foreign
co-occurred: Aspect=EMPTY (11; 100%), Number[obj]=EMPTY (11; 100%), Person[obj]=EMPTY (11; 100%), Mood=Ind (10; 91%), Subcat=Tran (7; 64%), VerbForm=Fin (7; 64%), Number[subj]=Sing (6; 55%), Tense=Pres (6; 55%).
VERB
tokens may have the following values of Foreign
:
Yes
(11; 100% of non-emptyForeign
): ver, sé, Anda, dar, ponen, sale, sea, sirvesEMPTY
(1762): katki, quihtoh, nauat, yah, mota, katka, niquihtoz, peu, yuwi, nesi
NUM
9 NUM tokens (12% of all NUM
tokens) have a non-empty value of Foreign
.
NUM
tokens may have the following values of Foreign
:
Yes
(9; 100% of non-emptyForeign
): ocho, quince, dieciocho, nueve, siete, veinte, millonesEMPTY
(65): ome, quince, yeyi, caxtol, ce, cuatro, ocho, omi, tres, dieciocho
INTJ
8 INTJ tokens (13% of all INTJ
tokens) have a non-empty value of Foreign
.
INTJ
tokens may have the following values of Foreign
:
Yes
(8; 100% of non-emptyForeign
): bueno, sí, AEMPTY
(53): quemah, aa, ay, quema, queutoc, cuixi, ja, sí, Ah, Ayy
PRON
7 PRON tokens (1% of all PRON
tokens) have a non-empty value of Foreign
.
The most frequent other feature values with which PRON
and Foreign
co-occurred: Number=EMPTY (7; 100%), Person=EMPTY (7; 100%), PronType=EMPTY (7; 100%).
PRON
tokens may have the following values of Foreign
:
Yes
(7; 100% of non-emptyForeign
): eso, que, nada, tercero, todoEMPTY
(501): neh, yeh, tlen, non, nochi, teh, yej, tlenoh, tehwah, ye
AUX
1 AUX tokens (0% of all AUX
tokens) have a non-empty value of Foreign
.
AUX
tokens may have the following values of Foreign
:
Yes
(1; 100% of non-emptyForeign
): esEMPTY
(815): o, ma, mo, katka, pewi, mach, wili, nimi, catca, huili
Relations with Agreement in Foreign
The 10 most frequent relations where parent and child node agree in Foreign
:
NOUN –[case]–> ADP (66; 61%),
NOUN –[conj]–> NOUN (28; 68%),
ADP –[fixed]–> NOUN (9; 100%),
NOUN –[amod]–> ADJ (8; 53%),
PROPN –[flat]–> PROPN (5; 100%),
ADJ –[nsubj]–> NOUN (4; 57%),
ADJ –[case]–> ADP (2; 67%),
ADJ –[conj]–> ADJ (1; 100%),
ADJ –[mark]–> ADV (1; 100%),
ADJ –[obj]–> NOUN (1; 100%).