This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home en/dep issue tracker

foreign: foreign words

We use foreign to label sequences of foreign words. These are given a linear analysis: the head is the first token in the foreign phrase.

I guess that c' est la vie
nsubj(guess-2, I-1)
ccomp(guess-2, c'-4)
mark(c'-4, that-3)
foreign(c'-4, est-5)
foreign(c'-4, la-6)
foreign(c'-4, vie-7)

Treebank Statistics (UD_English)

This relation is universal.

16 nodes (0%) are attached to their parents as foreign.

14 instances of foreign (88%) are right-to-left (child precedes parent). Average distance between parent and child is 2.875.

The following 1 pairs of parts of speech are connected with foreign: en-pos/X-en-pos/X (16; 100% instances).

# visual-style 1	bgColor:blue
# visual-style 1	fgColor:white
# visual-style 8	bgColor:blue
# visual-style 8	fgColor:white
# visual-style 8 1 foreign	color:blue
1	A	a	X	FW	_	8	foreign	_	_
2	la	la	X	FW	_	8	foreign	_	_
3	guerre	guerre	X	FW	_	8	foreign	_	_
4	c'est	c'est	X	FW	_	8	foreign	_	_
5	comme	comme	X	FW	_	8	foreign	_	_
6	a	a	X	FW	_	8	foreign	_	_
7	la	la	X	FW	_	8	foreign	_	_
8	guerre	guerre	X	FW	_	0	root	_	SpaceAfter=No
9	!	!	PUNCT	.	_	8	punct	_	_


Treebank Statistics (UD_English-ESL)

This relation is universal.

4 nodes (0%) are attached to their parents as foreign.

4 instances of foreign (100%) are left-to-right (parent precedes child). Average distance between parent and child is 1.75.

The following 2 pairs of parts of speech are connected with foreign: en-pos/X-en-pos/X (3; 75% instances), en-pos/NOUN-en-pos/NOUN (1; 25% instances).

# visual-style 11	bgColor:blue
# visual-style 11	fgColor:white
# visual-style 10	bgColor:blue
# visual-style 10	fgColor:white
# visual-style 10 11 foreign	color:blue
1	_	_	CONJ	CC	_	10	cc	_	_
2	_	_	PUNCT	,	_	10	punct	_	_
3	_	_	ADV	RB	_	10	advmod	_	_
4	_	_	PUNCT	,	_	10	punct	_	_
5	_	_	DET	DT	_	6	det	_	_
6	_	_	NOUN	NN	_	8	nmod:poss	_	_
7	_	_	PART	POS	_	6	case	_	_
8	_	_	NOUN	NN	_	10	nsubj	_	_
9	_	_	VERB	VBZ	_	10	cop	_	_
10	_	_	X	FW	_	0	root	_	_
11	_	_	X	FW	_	10	foreign	_	_
12	_	_	X	FW	_	10	foreign	_	_
13	_	_	X	FW	_	10	foreign	_	_
14	_	_	CONJ	CC	_	10	cc	_	_
15	_	_	PRON	DT	_	19	nsubj	_	_
16	_	_	VERB	VBZ	_	19	cop	_	_
17	_	_	DET	DT	_	19	det	_	_
18	_	_	ADJ	JJS	_	19	amod	_	_
19	_	_	NOUN	NN	_	10	conj	_	_
20	_	_	PART	TO	_	21	mark	_	_
21	_	_	VERB	VB	_	19	acl	_	_
22	_	_	ADP	IN	_	21	nmod	_	_
23	_	_	PUNCT	.	_	10	punct	_	_

# visual-style 15	bgColor:blue
# visual-style 15	fgColor:white
# visual-style 14	bgColor:blue
# visual-style 14	fgColor:white
# visual-style 14 15 foreign	color:blue
1	_	_	ADV	RB	_	2	advmod	_	_
2	_	_	ADV	RB	_	8	advmod	_	_
3	_	_	PUNCT	,	_	8	punct	_	_
4	_	_	DET	DT	_	5	det	_	_
5	_	_	NOUN	NN	_	8	nmod:tmod	_	_
6	_	_	ADJ	JJR	_	5	amod	_	_
7	_	_	PRON	PRP	_	8	nsubj	_	_
8	_	_	VERB	VBD	_	0	root	_	_
9	_	_	PRON	PRP	_	8	iobj	_	_
10	_	_	DET	DT	_	11	det	_	_
11	_	_	NOUN	NN	_	8	dobj	_	_
12	_	_	ADP	IN	_	14	case	_	_
13	_	_	DET	DT	_	14	det	_	_
14	_	_	NOUN	NN	_	11	acl	_	_
15	_	_	NOUN	NN	_	14	foreign	_	_
16	_	_	PUNCT	,	_	11	punct	_	_
17	_	_	VERB	VBG	_	11	acl:relcl	_	_
18	_	_	NOUN	NN	_	17	dobj	_	_
19	_	_	ADP	IN	_	22	case	_	_
20	_	_	NUM	CD	_	22	nummod	_	_
21	_	_	ADJ	JJ	_	22	amod	_	_
22	_	_	NOUN	NNS	_	18	nmod	_	_
23	_	_	PRON	DT	_	25	nsubj	_	_
24	_	_	ADV	RB	_	25	advmod	_	_
25	_	_	VERB	VBD	_	22	acl:relcl	_	_
26	_	_	PROPN	NNP	_	25	dobj	_	_
27	_	_	ADP	IN	_	29	case	_	_
28	_	_	DET	DT	_	29	det	_	_
29	_	_	NOUN	NN	_	25	nmod	_	_
30	_	_	PUNCT	.	_	8	punct	_	_


foreign in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]