Treebank Statistics: UD_Czech-PDT: Features: Foreign
This feature is universal.
It occurs with 1 different values: Yes
.
9318 tokens (1%) have a non-empty value of Foreign
.
3670 types (3%) occur at least once with a non-empty value of Foreign
.
3485 lemmas (6%) occur at least once with a non-empty value of Foreign
.
The feature is used with 14 part-of-speech tags: PROPN (3685; 0% instances), ADJ (2670; 0% instances), NOUN (1813; 0% instances), ADP (592; 0% instances), PART (120; 0% instances), VERB (117; 0% instances), ADV (116; 0% instances), CCONJ (80; 0% instances), PRON (60; 0% instances), NUM (29; 0% instances), DET (20; 0% instances), SCONJ (8; 0% instances), INTJ (6; 0% instances), AUX (2; 0% instances).
PROPN
3685 PROPN tokens (4% of all PROPN
tokens) have a non-empty value of Foreign
.
The most frequent other feature values with which PROPN
and Foreign
co-occurred: Polarity=Pos (3685; 100%), Case=EMPTY (2905; 79%), Abbr=EMPTY (2671; 72%), NameType=Com (2512; 68%), Animacy=EMPTY (2259; 61%), Number=EMPTY (2177; 59%).
PROPN
tokens may have the following values of Foreign
:
Yes
(3685; 100% of non-emptyForeign
): HZDS, IRA, Floyd, Nature, International, Science, Sinn, Fein, Times, CupEMPTY
(80347): Praha, ČR, Praze, LN, ODS, USA, J, Jiří, Jan, OSN
Foreign
seems to be lexical feature of PROPN
. 100% lemmas (1422) occur only with one value of Foreign
.
ADJ
2670 ADJ tokens (1% of all ADJ
tokens) have a non-empty value of Foreign
.
The most frequent other feature values with which ADJ
and Foreign
co-occurred: VerbForm=EMPTY (2669; 100%), Voice=EMPTY (2669; 100%), Polarity=Pos (2666; 100%), Degree=Pos (2655; 99%), Animacy=EMPTY (2571; 96%), Case=EMPTY (2546; 95%), Number=EMPTY (2447; 92%), Gender=EMPTY (2439; 91%).
ADJ
tokens may have the following values of Foreign
:
Yes
(2670; 100% of non-emptyForeign
): New, the, open, US, Pink, la, Le, Deutsche, die, UnitedEMPTY
(186516): první, další, české, nové, druhé, poslední, státní, dalších, možné, vlastní
Foreign
seems to be lexical feature of ADJ
. 100% lemmas (1002) occur only with one value of Foreign
.
NOUN
1813 NOUN tokens (0% of all NOUN
tokens) have a non-empty value of Foreign
.
The most frequent other feature values with which NOUN
and Foreign
co-occurred: Polarity=Pos (1812; 100%), Case=EMPTY (1250; 69%), Animacy=EMPTY (1015; 56%), Number=EMPTY (975; 54%).
NOUN
tokens may have the following values of Foreign
:
Yes
(1813; 100% of non-emptyForeign
): play, managementu, management, CD, s, facto, st, o, homo, neemEMPTY
(370487): roku, korun, let, roce, strany, procent, společnosti, době, případě, firmy
Foreign
seems to be lexical feature of NOUN
. 100% lemmas (945) occur only with one value of Foreign
.
ADP
592 ADP tokens (0% of all ADP
tokens) have a non-empty value of Foreign
.
The most frequent other feature values with which ADP
and Foreign
co-occurred: AdpType=Prep (592; 100%), Case=EMPTY (353; 60%).
ADP
tokens may have the following values of Foreign
:
Yes
(592; 100% of non-emptyForeign
): de, of, di, van, in, von, versus, ad, Pro, toEMPTY
(145352): v, na, o, z, s, do, ve, k, pro, za
Foreign
seems to be lexical feature of ADP
. 100% lemmas (55) occur only with one value of Foreign
.
PART
120 PART tokens (1% of all PART
tokens) have a non-empty value of Foreign
.
PART
tokens may have the following values of Foreign
:
Yes
(120; 100% of non-emptyForeign
): off, džambo, not, t, oui, Bienvenue, So, ne, sorry, vivaEMPTY
(8412): jen, až, asi, li, ne, nejen, prý, to, jenom, ano
Foreign
seems to be lexical feature of PART
. 100% lemmas (28) occur only with one value of Foreign
.
VERB
117 VERB tokens (0% of all VERB
tokens) have a non-empty value of Foreign
.
The most frequent other feature values with which VERB
and Foreign
co-occurred: Aspect=EMPTY (117; 100%), Gender=EMPTY (112; 96%), Polarity=Pos (111; 95%), Person=EMPTY (63; 54%), Tense=EMPTY (62; 53%), Voice=EMPTY (62; 53%), VerbForm=Fin (59; 50%).
VERB
tokens may have the following values of Foreign
:
Yes
(117; 100% of non-emptyForeign
): is, Be, can, est, transit, Check, Come, Habent, Keep, LoveEMPTY
(130157): má, může, řekl, měl, mají, musí, jde, měla, lze, mít
Foreign
seems to be lexical feature of VERB
. 100% lemmas (84) occur only with one value of Foreign
.
ADV
116 ADV tokens (0% of all ADV
tokens) have a non-empty value of Foreign
.
The most frequent other feature values with which ADV
and Foreign
co-occurred: PronType=EMPTY (114; 98%), Degree=EMPTY (107; 92%), Polarity=EMPTY (107; 92%).
ADV
tokens may have the following values of Foreign
:
Yes
(116; 100% of non-emptyForeign
): cca, priori, Today, live, Here, Only, Sic, Very, dove, echtEMPTY
(80116): tak, už, také, jak, včera, ještě, již, tedy, dnes, pak
Foreign
seems to be lexical feature of ADV
. 100% lemmas (71) occur only with one value of Foreign
.
CCONJ
80 CCONJ tokens (0% of all CCONJ
tokens) have a non-empty value of Foreign
.
CCONJ
tokens may have the following values of Foreign
:
Yes
(80; 100% of non-emptyForeign
): and, et, und, As, or, ma, So, e, nEMPTY
(56792): a, i, ale, však, nebo, ani, či, proto, až, ovšem
PRON
60 PRON tokens (0% of all PRON
tokens) have a non-empty value of Foreign
.
The most frequent other feature values with which PRON
and Foreign
co-occurred: PrepCase=EMPTY (60; 100%), Reflex=EMPTY (59; 98%), Variant=EMPTY (59; 98%), Gender=EMPTY (43; 72%), PronType=Prs (42; 70%), Number=Sing (32; 53%).
PRON
tokens may have the following values of Foreign
:
Yes
(60; 100% of non-emptyForeign
): it, All, you, I, Me, We, Us, She, WAS, jaEMPTY
(44755): se, si, co, nás, je, nám, nich, kdo, což, mu
Foreign
seems to be lexical feature of PRON
. 100% lemmas (22) occur only with one value of Foreign
.
NUM
29 NUM tokens (0% of all NUM
tokens) have a non-empty value of Foreign
.
The most frequent other feature values with which NUM
and Foreign
co-occurred: Gender=EMPTY (29; 100%), NumForm=Word (29; 100%), NumType=Card (29; 100%), Case=EMPTY (26; 90%), NumValue=1,2,3 (24; 83%), Number=Plur (22; 76%).
NUM
tokens may have the following values of Foreign
:
Yes
(29; 100% of non-emptyForeign
): Four, Twenty, Seven, Six, one, Five, Three, Tre, Tri, seděmEMPTY
(41479): 1, 2, 3, dva, tři, 4, jeden, 6, dvě, tisíc
Foreign
seems to be lexical feature of NUM
. 100% lemmas (12) occur only with one value of Foreign
.
DET
20 DET tokens (0% of all DET
tokens) have a non-empty value of Foreign
.
The most frequent other feature values with which DET
and Foreign
co-occurred: Animacy=EMPTY (19; 95%), Number[psor]=EMPTY (16; 80%), Case=EMPTY (15; 75%), Gender=EMPTY (15; 75%), Person=EMPTY (14; 70%), Poss=EMPTY (12; 60%).
DET
tokens may have the following values of Foreign
:
Yes
(20; 100% of non-emptyForeign
): Some, My, That, This, Your, sua, C, Notre, These, ceEMPTY
(56196): to, které, který, jeho, která, jejich, své, tím, kteří, tom
Foreign
seems to be lexical feature of DET
. 100% lemmas (13) occur only with one value of Foreign
.
SCONJ
8 SCONJ tokens (0% of all SCONJ
tokens) have a non-empty value of Foreign
.
SCONJ
tokens may have the following values of Foreign
:
Yes
(8; 100% of non-emptyForeign
): as, If, When, ak, ako, gdyž, kakEMPTY
(27485): že, jako, aby, než, když, pokud, protože, zda, jak, zatímco
INTJ
6 INTJ tokens (5% of all INTJ
tokens) have a non-empty value of Foreign
.
INTJ
tokens may have the following values of Foreign
:
Yes
(6; 100% of non-emptyForeign
): O, propos, Bang, Boom, CrashEMPTY
(107): PA, Pink, ach, Inu, hle, proboha, Haló, což, fajn, Ó
AUX
2 AUX tokens (0% of all AUX
tokens) have a non-empty value of Foreign
.
The most frequent other feature values with which AUX
and Foreign
co-occurred: Gender=Neut (2; 100%), Mood=EMPTY (2; 100%), Number=Sing (2; 100%), Person=EMPTY (2; 100%), Polarity=Pos (2; 100%), Tense=Past (2; 100%), VerbForm=Part (2; 100%), Voice=Act (2; 100%).
AUX
tokens may have the following values of Foreign
:
Yes
(2; 100% of non-emptyForeign
): boloEMPTY
(46598): je, by, jsou, bude, byl, být, není, bylo, jsem, jsme
Relations with Agreement in Foreign
The 10 most frequent relations where parent and child node agree in Foreign
:
ADJ –[flat:foreign]–> PROPN (837; 99%),
ADJ –[flat:foreign]–> NOUN (434; 100%),
ADJ –[flat:foreign]–> ADJ (380; 100%),
PROPN –[flat:foreign]–> PROPN (240; 98%),
NOUN –[flat:foreign]–> ADJ (160; 99%),
PROPN –[flat:foreign]–> ADJ (118; 99%),
NOUN –[flat:foreign]–> NOUN (90; 98%),
PROPN –[flat:foreign]–> ADP (72; 95%),
NOUN –[flat:foreign]–> PART (65; 100%),
ADJ –[flat:foreign]–> ADP (60; 100%).