This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home cs/dep issue tracker

foreign: foreign words

We use foreign to label sequences of foreign words. These are given a linear analysis: the head is the first token in the foreign phrase.

foreign does not apply to loanwords or to foreign names. It applies to quoted foreign text incorporated in a sentence/discourse of the host language (unless we want to and know how to annotate the internal structure according to the syntax of the foreign language).

Jarmusch se objevil ve Wangově snímku Modrá ve tváři ( Blue in the Face ) .
foreign(Blue, in)
foreign(Blue, the)
foreign(Blue, Face)

Treebank Statistics (UD_Czech)

This relation is universal.

3123 nodes (0%) are attached to their parents as foreign.

2420 instances of foreign (77%) are right-to-left (child precedes parent). Average distance between parent and child is 1.69452449567723.

The following 75 pairs of parts of speech are connected with foreign: cs-pos/PROPN-cs-pos/ADJ (920; 29% instances), cs-pos/NOUN-cs-pos/ADJ (594; 19% instances), cs-pos/PROPN-cs-pos/ADP (293; 9% instances), cs-pos/PROPN-cs-pos/PROPN (288; 9% instances), cs-pos/NOUN-cs-pos/NOUN (165; 5% instances), cs-pos/ADJ-cs-pos/ADJ (139; 4% instances), cs-pos/NOUN-cs-pos/ADP (127; 4% instances), cs-pos/ADJ-cs-pos/PROPN (96; 3% instances), cs-pos/NOUN-cs-pos/PART (51; 2% instances), cs-pos/ADJ-cs-pos/NOUN (40; 1% instances), cs-pos/NOUN-cs-pos/PROPN (31; 1% instances), cs-pos/PROPN-cs-pos/NOUN (27; 1% instances), cs-pos/PROPN-cs-pos/CONJ (26; 1% instances), cs-pos/NOUN-cs-pos/VERB (24; 1% instances), cs-pos/NOUN-cs-pos/PRON (19; 1% instances), cs-pos/PART-cs-pos/NOUN (18; 1% instances), cs-pos/NOUN-cs-pos/CONJ (15; 0% instances), cs-pos/VERB-cs-pos/NOUN (14; 0% instances), cs-pos/NOUN-cs-pos/ADV (13; 0% instances), cs-pos/PRON-cs-pos/VERB (13; 0% instances), cs-pos/VERB-cs-pos/PRON (13; 0% instances), cs-pos/ADV-cs-pos/VERB (12; 0% instances), cs-pos/ADV-cs-pos/ADJ (10; 0% instances), cs-pos/VERB-cs-pos/ADJ (10; 0% instances), cs-pos/VERB-cs-pos/ADV (9; 0% instances), cs-pos/ADJ-cs-pos/ADV (7; 0% instances), cs-pos/ADJ-cs-pos/VERB (7; 0% instances), cs-pos/VERB-cs-pos/VERB (7; 0% instances), cs-pos/ADP-cs-pos/PROPN (6; 0% instances), cs-pos/VERB-cs-pos/PART (6; 0% instances), cs-pos/ADJ-cs-pos/ADP (5; 0% instances), cs-pos/ADJ-cs-pos/CONJ (5; 0% instances), cs-pos/ADJ-cs-pos/PART (5; 0% instances), cs-pos/ADP-cs-pos/ADJ (5; 0% instances), cs-pos/ADP-cs-pos/NOUN (5; 0% instances), cs-pos/ADV-cs-pos/PROPN (5; 0% instances), cs-pos/CONJ-cs-pos/ADJ (5; 0% instances), cs-pos/PRON-cs-pos/PRON (5; 0% instances), cs-pos/PROPN-cs-pos/PART (5; 0% instances), cs-pos/PROPN-cs-pos/PRON (5; 0% instances), cs-pos/ADV-cs-pos/ADP (4; 0% instances), cs-pos/ADV-cs-pos/PRON (4; 0% instances), cs-pos/PART-cs-pos/ADJ (4; 0% instances), cs-pos/PRON-cs-pos/ADJ (4; 0% instances), cs-pos/PRON-cs-pos/ADP (4; 0% instances), cs-pos/PRON-cs-pos/ADV (4; 0% instances), cs-pos/PRON-cs-pos/NOUN (4; 0% instances), cs-pos/VERB-cs-pos/ADP (4; 0% instances), cs-pos/ADJ-cs-pos/PRON (3; 0% instances), cs-pos/PROPN-cs-pos/ADV (3; 0% instances), cs-pos/PROPN-cs-pos/VERB (3; 0% instances), cs-pos/ADV-cs-pos/ADV (2; 0% instances), cs-pos/ADV-cs-pos/CONJ (2; 0% instances), cs-pos/CONJ-cs-pos/CONJ (2; 0% instances), cs-pos/NOUN-cs-pos/SCONJ (2; 0% instances), cs-pos/PART-cs-pos/PRON (2; 0% instances), cs-pos/PART-cs-pos/VERB (2; 0% instances), cs-pos/PRON-cs-pos/PART (2; 0% instances), cs-pos/PRON-cs-pos/PROPN (2; 0% instances), cs-pos/ADJ-cs-pos/SCONJ (1; 0% instances), cs-pos/ADP-cs-pos/ADP (1; 0% instances), cs-pos/ADV-cs-pos/NOUN (1; 0% instances), cs-pos/ADV-cs-pos/PART (1; 0% instances), cs-pos/CONJ-cs-pos/ADV (1; 0% instances), cs-pos/CONJ-cs-pos/PART (1; 0% instances), cs-pos/CONJ-cs-pos/PRON (1; 0% instances), cs-pos/CONJ-cs-pos/PROPN (1; 0% instances), cs-pos/CONJ-cs-pos/VERB (1; 0% instances), cs-pos/NUM-cs-pos/NOUN (1; 0% instances), cs-pos/PART-cs-pos/PART (1; 0% instances), cs-pos/PART-cs-pos/PROPN (1; 0% instances), cs-pos/PART-cs-pos/SCONJ (1; 0% instances), cs-pos/PRON-cs-pos/INTJ (1; 0% instances), cs-pos/VERB-cs-pos/CONJ (1; 0% instances), cs-pos/VERB-cs-pos/PROPN (1; 0% instances).

# visual-style 1	bgColor:blue
# visual-style 1	fgColor:white
# visual-style 2	bgColor:blue
# visual-style 2	fgColor:white
# visual-style 2 1 foreign	color:blue
1	British	British	ADJ	AAXXX----1A----	Degree=Pos|Foreign=Foreign|NameType=Com|Negative=Pos	2	foreign	_	LId=British-1|LGloss=(souč._názvů,_např._B._Airways)
2	Petroleum	Petroleum	PROPN	NNNXX-----A----	Foreign=Foreign|Gender=Neut|NameType=Com|Negative=Pos	3	nsubj	_	_
3	rozšiřuje	rozšiřovat	VERB	VB-S---3P-AA---	Aspect=Imp|Mood=Ind|Negative=Pos|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act	0	root	_	_
4	síť	síť	NOUN	NNFS4-----A----	Case=Acc|Gender=Fem|Negative=Pos|Number=Sing	3	dobj	_	_
5	čerpacích	čerpací	ADJ	AAFP2----1A----	Case=Gen|Degree=Pos|Gender=Fem|Negative=Pos|Number=Plur	6	amod	_	LDeriv=čerpat
6	stanic	stanice	NOUN	NNFP2-----A----	Case=Gen|Gender=Fem|Negative=Pos|Number=Plur	4	nmod	_	_
7	v	v	ADP	RR--6----------	AdpType=Prep|Case=Loc	8	case	_	LId=v-1
8	ČR	ČR	PROPN	NNFXX-----A---8	Abbr=Yes|Gender=Fem|NameType=Geo|Negative=Pos	4	nmod	_	LId=ČR-1|LGloss=(Česká_republika)

# visual-style 1	bgColor:blue
# visual-style 1	fgColor:white
# visual-style 2	bgColor:blue
# visual-style 2	fgColor:white
# visual-style 2 1 foreign	color:blue
1	Prime	Prime	ADJ	AAXXX----1A----	Degree=Pos|Foreign=Foreign|Negative=Pos	2	foreign	_	LId=Prime-1|LGloss=(fin.:_souč._term._Prime_Rate)
2	Rate	Rate	NOUN	NNFXX-----A----	Foreign=Foreign|Gender=Fem|Negative=Pos	3	nsubj	_	LId=Rate-1|LGloss=(fin.:_souč._term._Prime_Rate)
3	označuje	označovat	VERB	VB-S---3P-AA---	Aspect=Imp|Mood=Ind|Negative=Pos|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act	0	root	_	_
4	sazbu	sazba	NOUN	NNFS4-----A----	Case=Acc|Gender=Fem|Negative=Pos|Number=Sing	3	dobj	_	_
5	pro	pro	ADP	RR--4----------	AdpType=Prep|Case=Acc	7	case	_	LId=pro-1
6	špičkové	špičkový	ADJ	AAMP4----1A----	Animacy=Anim|Case=Acc|Degree=Pos|Gender=Masc|Negative=Pos|Number=Plur	7	amod	_	_
7	klienty	klient	NOUN	NNMP4-----A----	Animacy=Anim|Case=Acc|Gender=Masc|Negative=Pos|Number=Plur	4	nmod	_	SpaceAfter=No
8	.	.	PUNCT	Z:-------------	_	3	punct	_	_

# visual-style 10	bgColor:blue
# visual-style 10	fgColor:white
# visual-style 11	bgColor:blue
# visual-style 11	fgColor:white
# visual-style 11 10 foreign	color:blue
1	Jsem	být	VERB	VB-S---1P-AA---	Mood=Ind|Negative=Pos|Number=Sing|Person=1|Tense=Pres|VerbForm=Fin|Voice=Act	9	ccomp	_	_
2	prý	prý	PART	TT-------------	_	1	advmod	_	_
3	v	v	ADP	RR--6----------	AdpType=Prep|Case=Loc	4	case	_	LId=v-1
4	USA	USA	PROPN	NNIPX-----A----	Animacy=Inan|Gender=Masc|NameType=Geo|Negative=Pos|Number=Plur	1	nmod	_	_
5	a	a	CONJ	J^-------------	_	1	cc	_	LId=a-1
6	hry	hra	NOUN	NNFP1-----A----	Case=Nom|Gender=Fem|Negative=Pos|Number=Plur	7	nsubj	_	LGloss=(dětská;_v_divadle;...)
7	skončily	skončit	VERB	VpTP---XR-AA---	Animacy=Inan|Aspect=Perf|Gender=Fem,Masc|Negative=Pos|Number=Plur|Tense=Past|VerbForm=Part|Voice=Act	1	conj	_	SpaceAfter=No
8	,	,	PUNCT	Z:-------------	_	1	punct	_	_
9	uvedl	uvést	VERB	VpYS---XR-AA---	Gender=Masc|Negative=Pos|Number=Sing|Tense=Past|VerbForm=Part|Voice=Act	0	root	_	_
10	de	de	ADP	RR--1----------	AdpType=Prep|Case=Nom|Foreign=Foreign	11	foreign	_	LGloss=(z_franc._de,_zejm._ve_jménech)
11	Merode	Merode	PROPN	NNMS1-----A----	Animacy=Anim|Case=Nom|Gender=Masc|NameType=Sur|Negative=Pos|Number=Sing	9	nsubj	_	SpaceAfter=No
12	.	.	PUNCT	Z:-------------	_	9	punct	_	_


Treebank Statistics (UD_Czech-CAC)

This relation is universal.

142 nodes (0%) are attached to their parents as foreign.

88 instances of foreign (62%) are right-to-left (child precedes parent). Average distance between parent and child is 1.38732394366197.

The following 22 pairs of parts of speech are connected with foreign: cs-pos/NOUN-cs-pos/ADJ (35; 25% instances), cs-pos/NOUN-cs-pos/ADP (18; 13% instances), cs-pos/NOUN-cs-pos/NOUN (17; 12% instances), cs-pos/PROPN-cs-pos/ADP (14; 10% instances), cs-pos/ADJ-cs-pos/NOUN (13; 9% instances), cs-pos/PROPN-cs-pos/ADJ (7; 5% instances), cs-pos/NOUN-cs-pos/PART (6; 4% instances), cs-pos/PROPN-cs-pos/PROPN (5; 4% instances), cs-pos/ADJ-cs-pos/ADJ (4; 3% instances), cs-pos/ADJ-cs-pos/ADP (4; 3% instances), cs-pos/NOUN-cs-pos/PRON (4; 3% instances), cs-pos/ADJ-cs-pos/PROPN (2; 1% instances), cs-pos/PRON-cs-pos/PRON (2; 1% instances), cs-pos/PROPN-cs-pos/PART (2; 1% instances), cs-pos/VERB-cs-pos/NOUN (2; 1% instances), cs-pos/ADP-cs-pos/PROPN (1; 1% instances), cs-pos/ADV-cs-pos/ADJ (1; 1% instances), cs-pos/ADV-cs-pos/ADP (1; 1% instances), cs-pos/NOUN-cs-pos/PROPN (1; 1% instances), cs-pos/PROPN-cs-pos/PRON (1; 1% instances), cs-pos/VERB-cs-pos/PRON (1; 1% instances), cs-pos/VERB-cs-pos/VERB (1; 1% instances).

# visual-style 6	bgColor:blue
# visual-style 6	fgColor:white
# visual-style 7	bgColor:blue
# visual-style 7	fgColor:white
# visual-style 7 6 foreign	color:blue
1	Hostům	host	NOUN	NNMP3-----A----	Animacy=Anim|Case=Dat|Gender=Masc|Negative=Pos|Number=Plur	4	dobj	_	_
2	na	na	ADP	RR--4----------	AdpType=Prep|Case=Acc	3	case	_	LId=na-1
3	*	*	SYM	Xx-------------	Abbr=Yes	4	advmod	_	_
4	nepomohla	pomoci	VERB	VpQW---XR-NA---	Gender=Fem,Neut|Negative=Neg|Number=Plur,Sing|Tense=Past|VerbForm=Part|Voice=Act	0	root	_	_
5	ani	ani	CONJ	J^-------------	_	7	advmod:emph	_	_
6	power	power	ADJ	AAXXX----1A----	Degree=Pos|Foreign=Foreign|Negative=Pos	7	foreign	_	_
7	play	play	NOUN	NNFXX-----A----	Foreign=Foreign|Gender=Fem|Negative=Pos	4	nsubj	_	SpaceAfter=No|LId=play-1
8	.	.	PUNCT	Z:-------------	_	4	punct	_	_

# visual-style 9	bgColor:blue
# visual-style 9	fgColor:white
# visual-style 10	bgColor:blue
# visual-style 10	fgColor:white
# visual-style 10 9 foreign	color:blue
1	Všechny	všechno	PRON	PLIP1----------	Animacy=Inan|Case=Nom|Gender=Masc|Number=Plur|PronType=Tot	2	nmod	_	_
2	druhy	druh	NOUN	NNIP1-----A----	Animacy=Inan|Case=Nom|Gender=Masc|Negative=Pos|Number=Plur	4	nsubj	_	LId=druh-1|LGloss=(typ)
3	jedenáctistovek	jedenáctistovka	NOUN	NNFP2-----A----	Case=Gen|Gender=Fem|Negative=Pos|Number=Plur	2	nmod	_	_
4	byly	být	VERB	VpTP---XR-AA---	Animacy=Inan|Gender=Fem,Masc|Negative=Pos|Number=Plur|Tense=Past|VerbForm=Part|Voice=Act	0	root	_	_
5	vždy	vždy	ADV	Db-------------	_	4	advmod	_	_
6	jen	jen	PART	TT-------------	_	8	advmod:emph	_	LId=jen-1|LGloss=(pouze)
7	v	v	ADP	RR--6----------	AdpType=Prep|Case=Loc	8	case	_	LId=v-1
8	provedení	provedení	NOUN	NNNS6-----A----	Case=Loc|Gender=Neut|Negative=Pos|Number=Sing	4	nmod	_	LDeriv=provést
9	de	de	ADP	RR--X----------	AdpType=Prep|Foreign=Foreign	10	foreign	_	_
10	luxe	luxe	NOUN	NNXXX-----A----	Foreign=Foreign|Negative=Pos	8	nmod	_	_
11	.	.	PUNCT	Z:-------------	_	4	punct	_	_

# visual-style 14	bgColor:blue
# visual-style 14	fgColor:white
# visual-style 12	bgColor:blue
# visual-style 12	fgColor:white
# visual-style 12 14 foreign	color:blue
1	Jemný	jemný	ADJ	AAIS4----1A----	Animacy=Inan|Case=Acc|Degree=Pos|Gender=Masc|Negative=Pos|Number=Sing	2	amod	_	_
2	způsob	způsob	NOUN	NNIS4-----A----	Animacy=Inan|Case=Acc|Gender=Masc|Negative=Pos|Number=Sing	4	dobj	_	_
3	peelingu	peeling	NOUN	NNIS2-----A----	Animacy=Inan|Case=Gen|Foreign=Foreign|Gender=Masc|Negative=Pos|Number=Sing	2	nmod	_	_
4	představuje	představovat	VERB	VB-S---3P-AA---	Aspect=Imp|Mood=Ind|Negative=Pos|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act	0	root	_	_
5	směs	směs	NOUN	NNFS1-----A----	Case=Nom|Gender=Fem|Negative=Pos|Number=Sing	4	nsubj	_	_
6	mandlových	mandlový	ADJ	AAFP2----1A----	Case=Gen|Degree=Pos|Gender=Fem|Negative=Pos|Number=Plur	7	amod	_	_
7	otrub	otruby	NOUN	NNFP2-----A----	Case=Gen|Gender=Fem|Negative=Pos|Number=Plur	5	nmod	_	_
8	a	a	CONJ	J^-------------	_	7	cc	_	LId=a-1
9	mořského	mořský	ADJ	AAIS2----1A----	Animacy=Inan|Case=Gen|Degree=Pos|Gender=Masc|Negative=Pos|Number=Sing	10	amod	_	_
10	písku	písek	NOUN	NNIS2-----A----	Animacy=Inan|Case=Gen|Gender=Masc|Negative=Pos|Number=Sing	7	conj	_	SpaceAfter=No
11	,	,	PUNCT	Z:-------------	_	5	punct	_	_
12	grains	grains	NOUN	NNIPX-----A----	Animacy=Inan|Foreign=Foreign|Gender=Masc|Negative=Pos|Number=Plur	5	dep	_	_
13	de	de	ADP	RR--1----------	AdpType=Prep|Case=Nom|Foreign=Foreign	12	foreign	_	LGloss=(z_franc._de,_zejm._ve_jménech)
14	beauté	beauté	NOUN	NNFXX-----A----	Foreign=Foreign|Gender=Fem|Negative=Pos	12	foreign	_	SpaceAfter=No
15	.	.	PUNCT	Z:-------------	_	4	punct	_	_


foreign in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]