home eu/dep edit page issue tracker

punct: punctuation

punct relation is used to annotate tpunctuation marks.

Two kinds of punctuations can be distinguished:

  1. Punctuation in coordination

Coordination in the Basque UD annotation follows the general schema where the first element of the conjunction is the head, and each conjunct, conjunction complementizer or puntuation mark acting as a conjunction should be attached to the first conjunct::

Example:

Zidane, Henry, Barthez, Deschamps, Blanc and the rest form the most robust team according to most experts.

Zidane, Henry, Barthez, Deschamps, Blanc eta enparauek Europako Talde sendoena osatzen dute aditu gehienentzat.

Zidane, Henry, Barthez, Deschamps, Blanc and rest-the-erg Europe-gen team robust-the-most form aux-they-transtive-present expert most-for.

Note that it is the last conjunct the element showing the ergative case. The ergative case corresponds to subjects of transitive verbs. In this case to form

~~~ sdparse Zidane , Henry , Barthez , Deschamps , Blanc eta enparauek Europako talde sendoena osatzen dute aditu gehienentzat . punct(Zidane-1, ,-2) conj(Zidane-1, Henry-3) punct(Zidane-1, ,-4) conj(Zidane-1, Barthez-5) punct(Zidane-1, ,-6) conj(Zidane-1, Deschamps-7) conj(Zidane-1, ,-8) conj(Zidane-1, Blanc-9) cc(Zidane-1, eta-10) conj(Zidane-1, enparauek-11) nmod(talde-13, Europako-12) dobj(osatzen-15, talde-13) amod(talde-13, sendoena-14) aux(osatzen-15, dute-16) nmod(osatzen-15, aditu-17) det(aditu-17, gehienentzat-18) punct(osatzen-15, .-19) ~~~ .

  1. The rest of the punctuations

Treebank Statistics (UD_Basque)

This relation is universal.

19808 nodes (16%) are attached to their parents as punct.

15294 instances of punct (77%) are left-to-right (parent precedes child). Average distance between parent and child is 5.93820678513732.

The following 19 pairs of parts of speech are connected with punct: eu-pos/VERB-eu-pos/PUNCT (16109; 81% instances), eu-pos/NOUN-eu-pos/PUNCT (2099; 11% instances), eu-pos/ADJ-eu-pos/PUNCT (616; 3% instances), eu-pos/PROPN-eu-pos/PUNCT (593; 3% instances), eu-pos/ADV-eu-pos/PUNCT (100; 1% instances), eu-pos/NUM-eu-pos/PUNCT (78; 0% instances), eu-pos/CONJ-eu-pos/PUNCT (52; 0% instances), eu-pos/AUX-eu-pos/PUNCT (47; 0% instances), eu-pos/DET-eu-pos/PUNCT (42; 0% instances), eu-pos/PUNCT-eu-pos/PUNCT (32; 0% instances), eu-pos/ADP-eu-pos/PUNCT (16; 0% instances), eu-pos/VERB-eu-pos/CONJ (6; 0% instances), eu-pos/VERB-eu-pos/PROPN (6; 0% instances), eu-pos/X-eu-pos/PUNCT (4; 0% instances), eu-pos/NOUN-eu-pos/PROPN (3; 0% instances), eu-pos/PART-eu-pos/PUNCT (2; 0% instances), eu-pos/INTJ-eu-pos/PUNCT (1; 0% instances), eu-pos/PRON-eu-pos/PUNCT (1; 0% instances), eu-pos/VERB-eu-pos/NOUN (1; 0% instances).

# visual-style 3	bgColor:blue
# visual-style 3	fgColor:white
# visual-style 8	bgColor:blue
# visual-style 8	fgColor:white
# visual-style 8 3 punct	color:blue
1	Atenasen	Atenas	PROPN	_	Case=Ine|Definite=Def|Number=Sing	8	nmod	_	_
2	ordea	ordea	CONJ	_	_	8	advmod	_	_
3	,	,	PUNCT	_	_	8	punct	_	_
4	beste	beste	DET	_	_	6	det	_	_
5	bost	bost	NUM	_	_	6	nummod	_	_
6	jarduera	jarduera	NOUN	_	Case=Abs|Definite=Ind	8	nsubj	_	_
7	gehiago	gehiago	DET	_	Case=Abs|Definite=Ind	6	det	_	_
8	izan	izan	VERB	_	Aspect=Perf|VerbForm=Part	0	root	_	_
9	daitezke	*edin	AUX	_	Mood=Pot|Number[abs]=Plur|Person[abs]=3	8	aux	_	_
10	.	.	PUNCT	_	_	8	punct	_	_

# visual-style 9	bgColor:blue
# visual-style 9	fgColor:white
# visual-style 4	bgColor:blue
# visual-style 4	fgColor:white
# visual-style 4 9 punct	color:blue
1	Gure	gu	PRON	_	PronType=Prs	2	nmod	_	_
2	etxea	etxe	NOUN	_	Animacy=Inan|Case=Abs|Definite=Def|Number=Sing	11	dobj	_	_
3	,	,	PUNCT	_	_	11	punct	_	_
4	egun	egun	NOUN	_	_	11	nmod	_	_
5	batean	bat	NUM	_	NumType=Card	4	nummod	_	_
6	,	,	PUNCT	_	_	11	punct	_	_
7	ordu	ordu	NOUN	_	_	4	appos	_	_
8	batzuetan	batzuk	DET	_	Case=Ine|Definite=Def|Number=Plur	7	det	_	_
9	,	,	PUNCT	_	_	4	punct	_	_
10	zeharo	zeharo	ADV	_	_	11	advmod	_	_
11	aldatu	aldatu	VERB	_	Aspect=Perf|VerbForm=Part	0	root	_	_
12	zen	izan	AUX	_	Mood=Ind|Number[abs]=Sing|Person[abs]=3	11	aux	_	_
13	.	.	PUNCT	_	_	11	punct	_	_

# visual-style 7	bgColor:blue
# visual-style 7	fgColor:white
# visual-style 4	bgColor:blue
# visual-style 4	fgColor:white
# visual-style 4 7 punct	color:blue
1	Hurrengo	hurrengo	ADJ	_	_	2	amod	_	_
2	orrialdeko	orrialde	NOUN	_	Animacy=Inan|Case=Loc|Definite=Def|Number=Sing	3	nmod	_	_
3	mapa	mapa	NOUN	_	Case=Abs|Definite=Def|Number=Sing	4	nsubj	_	_
4	baliagarria	baliagarri	ADJ	_	Case=Abs|Definite=Def|Number=Sing	0	root	_	_
5	izan	izan	VERB	_	VerbForm=Inf	4	cop	_	_
6	dakizuke	*edin	AUX	_	Mood=Pot|Number[abs]=Sing|Number[dat]=Sing|Person[abs]=3|Person[dat]=2	5	aux	_	_
7	.	.	PUNCT	_	_	4	punct	_	_


punct in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]