home fr/fr issue tracker

This page pertains to UD version 2.

Dependencies for French corpora

One of the French corpora (French-Spoken) uses a large number of new relations which are not present in other corpora.

We present first the common list of relations used in most Treebanks and at the end of the page, the full list (including French-Spoken specific Dependencies)

Common list

Nominals
Clauses
Modifier words
Function Words
Core arguments
nsubj
nsubj:caus
nsubj:pass
obj
obj:agent
iobj
iobj:agent
csubj
csubj:pass
ccomp
xcomp
Non-core dependents
obl
obl:agent
vocative
expl
dislocated
advcl
advmod
discourse
aux
aux:caus
aux:pass
cop
mark
Nominal dependents
nmod
appos
nummod
acl
acl:relcl
amod
det
case
Coordination
MWE
Loose
Special
Other
conj
cc
fixed
flat
compound
list
parataxis
orphan
goeswith
reparandum
punct
root
dep

Comparison of relations in French corpora (version 2.1)

  UD_French UD_French-FTB UD_French-PUD UD_French-ParTUT UD_French-Sequoia
TOTAL 402404 573370 24734 28597 70624
acl 5770 [1.43%] 7249 [1.26%] 28 [0.11%] 485 [1.70%] 1077 [1.52%]
acl:cleft =========== =========== =========== 2 [0.01%] ===========
acl:relcl 3369 [0.84%] 5171 [0.90%] 226 [0.91%] 301 [1.05%] 538 [0.76%]
advcl 3123 [0.78%] 4457 [0.78%] 219 [0.89%] 302 [1.06%] 695 [0.98%]
advmod 13047 [3.24%] 24697 [4.31%] 1002 [4.05%] 1093 [3.82%] 2468 [3.49%]
amod 20081 [4.99%] 24324 [4.24%] 1393 [5.63%] 1455 [5.09%] 3768 [5.34%]
appos 6677 [1.66%] 12 [0.00%] 275 [1.11%] 66 [0.23%] 489 [0.69%]
aux 4723 [1.17%] 7196 [1.26%] 569 [2.30%] 547 [1.91%] 947 [1.34%]
aux:caus 262 [0.07%] 258 [0.04%] =========== 13 [0.05%] 34 [0.05%]
aux:pass 2858 [0.71%] 3333 [0.58%] 227 [0.92%] 242 [0.85%] 756 [1.07%]
case 58713 [14.59%] 70858 [12.36%] 3428 [13.86%] 4077 [14.26%] 9774 [13.84%]
cc 10809 [2.69%] 11751 [2.05%] 544 [2.20%] 876 [3.06%] 1651 [2.34%]
ccomp 1380 [0.34%] 1783 [0.31%] 305 [1.23%] 217 [0.76%] 352 [0.50%]
compound 879 [0.22%] =========== 78 [0.32%] 72 [0.25%] ===========
conj 14625 [3.63%] 15067 [2.63%] 653 [2.64%] 1031 [3.61%] 2032 [2.88%]
cop 5022 [1.25%] 3789 [0.66%] 226 [0.91%] 311 [1.09%] 566 [0.80%]
csubj 44 [0.01%] 83 [0.01%] 23 [0.09%] 64 [0.22%] 3 [0.00%]
csubj:pass 1 [0.00%] =========== 1 [0.00%] 1 [0.00%] 1 [0.00%]
dep 151 [0.04%] 1453 [0.25%] 9 [0.04%] 1 [0.00%] 156 [0.22%]
det 61639 [15.32%] 84136 [14.67%] 3589 [14.51%] 4757 [16.63%] 10238 [14.50%]
det:predet =========== =========== 20 [0.08%] =========== ===========
discourse 44 [0.01%] =========== 30 [0.12%] 15 [0.05%] ===========
dislocated 28 [0.01%] 2 [0.00%] 3 [0.01%] 8 [0.03%] 17 [0.02%]
dislocated:cleft =========== =========== =========== 3 [0.01%] ===========
expl 1344 [0.33%] 4112 [0.72%] 85 [0.34%] 225 [0.79%] 351 [0.50%]
fixed 4269 [1.06%] 50148 [8.75%] 452 [1.83%] 298 [1.04%] 1889 [2.67%]
flat =========== 2 [0.00%] 17 [0.07%] 139 [0.49%] ===========
flat:foreign 5 [0.00%] =========== =========== 1 [0.00%] 76 [0.11%]
flat:name 7122 [1.77%] 4014 [0.70%] 227 [0.92%] 60 [0.21%] 898 [1.27%]
goeswith 22 [0.01%] =========== 3 [0.01%] 4 [0.01%] 2 [0.00%]
iobj 837 [0.21%] 2302 [0.40%] 36 [0.15%] 111 [0.39%] 237 [0.34%]
iobj:agent 20 [0.00%] =========== =========== 1 [0.00%] ===========
mark 6547 [1.63%] 12544 [2.19%] 450 [1.82%] 850 [2.97%] 1483 [2.10%]
nmod 33759 [8.39%] 45783 [7.98%] 1820 [7.36%] 2433 [8.51%] 6561 [9.29%]
nmod:poss =========== =========== 277 [1.12%] =========== ===========
nsubj 20970 [5.21%] 29623 [5.17%] 1425 [5.76%] 1420 [4.97%] 3090 [4.38%]
nsubj:caus 145 [0.04%] 17 [0.00%] =========== 4 [0.01%] 16 [0.02%]
nsubj:expl =========== =========== =========== 2 [0.01%] ===========
nsubj:pass 2569 [0.64%] =========== 200 [0.81%] 224 [0.78%] 594 [0.84%]
nummod 5510 [1.37%] 10236 [1.79%] 243 [0.98%] 314 [1.10%] 1438 [2.04%]
obj 14232 [3.54%] 17456 [3.04%] 1095 [4.43%] 1099 [3.84%] 2230 [3.16%]
obj:agent 109 [0.03%] =========== =========== 9 [0.03%] 10 [0.01%]
obl 26310 [6.54%] 32429 [5.66%] 1404 [5.68%] 1465 [5.12%] 3724 [5.27%]
obl:agent 19 [0.00%] =========== =========== 69 [0.24%] 282 [0.40%]
obl:tmod =========== =========== 80 [0.32%] =========== ===========
orphan 6 [0.00%] 755 [0.13%] 4 [0.02%] 3 [0.01%] 39 [0.06%]
parataxis 604 [0.15%] 2661 [0.46%] 106 [0.43%] 7 [0.02%] 59 [0.08%]
punct 44360 [11.02%] 68826 [12.00%] 2553 [10.32%] 2631 [9.20%] 7871 [11.14%]
reparandum 7 [0.00%] =========== =========== =========== ===========
root 16448 [4.09%] 18535 [3.23%] 1000 [4.04%] 1020 [3.57%] 3099 [4.39%]
vocative 8 [0.00%] 1 [0.00%] 1 [0.00%] 71 [0.25%] 53 [0.08%]
xcomp 3937 [0.98%] 8307 [1.45%] 408 [1.65%] 198 [0.69%] 1060 [1.50%]

Full list

Nominals
Clauses
Modifier words
Function Words
Core arguments
nsubj
nsubj:caus
nsubj:expl
nsubj:pass
nsubj:quasi
obj
obj:agent
iobj
iobj:agent
obl:comp
csubj
csubj:pass
csubj:quasi
ccomp
ccomp:cleft
xcomp
Non-core dependents
obl
obl:agent
obl:mod
obl:periph
vocative
expl
dislocated
dislocated:cleft
advcl
advcl:periph
advmod
advmod:periph
discourse
aux
aux:caus
aux:pass
cop
mark
Nominal dependents
nmod
nmod:appos
appos
nummod
acl
acl:cleft
acl:relcl
amod
det
det:complex
case
case:complex
Coordination
MWE
Loose
Special
Other
conj
conj:appos
conj:coord
conj:dicto
cc
fixed
flat
compound
list
parataxis
parataxis:conj
parataxis:discourse
parataxis:dislocated
parataxis:insert
parataxis:obj
parataxis:parenth
orphan
goeswith
reparandum
punct
root
dep