Regenerated
Aux chain
Auxiliary dependencies should not form a chain.
Search expression: _ <aux (_ <aux _)
Correct example:
Do you think that he will have left when we come ?
aux(think, Do)
aux(left, will)
aux(left, have)
Incorrect example:
Do you think that he will have left when we come ?
aux(think, Do)
aux(have, will)
aux(left, have)
ADJ | AUX | VERB | |
---|---|---|---|
UD_Basque | 3 | ||
UD_Catalan | 36 | ||
UD_Dutch | 2 | 62 | |
UD_Galician | 11 | ||
UD_Italian | 2 | ||
UD_Latvian | 1 | ||
UD_Persian | 1 | 1 | |
UD_Spanish-AnCora | 28 |
MWE chain
MWE dependencies should not be chained. All dependents should be attached directly to the first one.
Search expression: _ <mwe (_ <mwe _)
ADJ | ADP | ADV | CONJ | DET | NOUN | NUM | PRON | SCONJ | VERB | X | |
---|---|---|---|---|---|---|---|---|---|---|---|
UD_Buryat | 1 | 11 | |||||||||
UD_Croatian | 7 | 2 | 7 | 1 | |||||||
UD_Galician-TreeGal | 6 | 1 | |||||||||
UD_German | 4 | 3 | 2 | ||||||||
UD_Indonesian | 8 | 1 | 2 | 2 | 4 | 1 | |||||
UD_Persian | 3 | 13 | 2 | 233 | 44 | 1 | 38 | 18 | |||
UD_Portuguese-Bosque | 6 | 10 | 24 | 89 | |||||||
UD_Romanian | 2 | 1 | |||||||||
UD_Russian | 13 | 6 | 5 | ||||||||
UD_Russian-SynTagRus | 1 | 1 | 1 |
Foreign chain
Foreign dependencies should not be chained. All dependents should be attached directly to the first one. If we wish to annotate real syntactic structure of foreign material, we must not use the foreign relation.
Search expression: _ <foreign (_ <foreign _)
ADJ | ADP | ADV | CONJ | NOUN | PART | PRON | PROPN | VERB | X | |
---|---|---|---|---|---|---|---|---|---|---|
UD_Chinese | 7 | |||||||||
UD_Croatian | 9 | 2 | 3 | |||||||
UD_Czech | 66 | 5 | 2 | 16 | 1 | 2 | 25 | 1 | ||
UD_Czech-CAC | 3 | 3 | 1 | 1 | ||||||
UD_Danish | 8 | 17 | ||||||||
UD_Irish | 1 | |||||||||
UD_Latin-PROIEL | 2 | |||||||||
UD_Persian | 1 | 1 | 136 | |||||||
UD_Russian | 8 | |||||||||
UD_Russian-SynTagRus | 703 | 188 | ||||||||
UD_Slovak | 5 |
Conj is right-headed
Coordination dependencies should be left-headed, not right.
Search expression: _ <conj@R _
Correct example:
Bill is big and honest
conj(big, honest)
Incorrect example:
Bill is big and honest
conj(honest, big)
ADJ | ADP | ADV | AUX | CONJ | DET | INTJ | NOUN | NUM | PART | PRON | PROPN | PUNCT | SCONJ | SYM | VERB | X | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
UD_Ancient_Greek | 82 | 2 | 22 | 1 | 467 | 30 | 256 | ||||||||||
UD_Ancient_Greek-PROIEL | 30 | 4 | 18 | 2 | 5 | 2 | 9 | ||||||||||
UD_Basque | 2 | 4 | 1 | 5 | |||||||||||||
UD_Bulgarian | 2 | ||||||||||||||||
UD_Buryat | 4 | 5 | 3 | ||||||||||||||
UD_Chinese | 3 | 4 | 8 | ||||||||||||||
UD_Croatian | 5 | 1 | 2 | ||||||||||||||
UD_Danish | 1 | 1 | 2 | 7 | 1 | ||||||||||||
UD_Dutch-LassySmall | 1 | ||||||||||||||||
UD_English | 1 | 15 | 3 | 7 | 1 | 6 | 2 | 4 | 2 | 8 | |||||||
UD_Finnish-FTB | 2 | 3 | 1 | 20 | 133 | ||||||||||||
UD_Galician-TreeGal | 1 | 2 | |||||||||||||||
UD_German | 7 | 2 | 4 | 1 | 2 | 35 | 2 | 3 | 10 | 5 | |||||||
UD_Gothic | 13 | 8 | 1 | 4 | |||||||||||||
UD_Greek | 1 | 12 | 1 | ||||||||||||||
UD_Hindi | 1 | 3 | 2 | 1 | 6 | ||||||||||||
UD_Hungarian | 5 | 1 | 1 | 9 | 1 | 2 | 1 | ||||||||||
UD_Indonesian | 1 | 2 | 6 | 1 | 1 | 7 | 3 | 2 | 1 | ||||||||
UD_Irish | 1 | ||||||||||||||||
UD_Kazakh | 31 | 128 | 11 | 1 | 49 | 88 | |||||||||||
UD_Latin | 22 | 11 | 181 | 8 | 48 | ||||||||||||
UD_Latin-PROIEL | 21 | 15 | 1 | 3 | |||||||||||||
UD_Old_Church_Slavonic | 16 | 2 | 3 | 2 | |||||||||||||
UD_Persian | 7 | 1 | 1 | 49 | 4 | 4 | 2 | ||||||||||
UD_Portuguese | 1 | ||||||||||||||||
UD_Portuguese-Bosque | 1 | 3 | 1 | 1 | |||||||||||||
UD_Romanian | 1 | 1 | 1 | 2 | 1 | ||||||||||||
UD_Russian | 2 | 9 | 1 | 1 | |||||||||||||
UD_Russian-SynTagRus | 2 | 23 | 3 | ||||||||||||||
UD_Spanish | 3 | 4 | 10 | 2 | 35 | 8 | 7 | 9 | 1 | 8 | |||||||
UD_Turkish | 377 | 4 | 81 | 230 | 3 | 1066 | 42 | 42 | 208 | 135 | 1495 | ||||||
UD_Uyghur | 17 | 1 | 2 | 1 | 57 | 2 | 71 | ||||||||||
UD_Vietnamese | 1 | 1 | 4 | 15 | 12 | 4 |
Conj is left-headed
DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID. In contrast, this test shows examples where coordination is left-headed.
Search expression: _ <conj@L _
ADJ | ADP | ADV | AUX | CONJ | DET | INTJ | NOUN | NUM | PART | PRON | PROPN | PUNCT | SCONJ | SYM | VERB | X | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
UD_Ancient_Greek | 1073 | 3 | 145 | 25 | 3256 | 21 | 140 | 1 | 6742 | 1 | |||||||
UD_Ancient_Greek-PROIEL | 940 | 2 | 102 | 5 | 2 | 2806 | 96 | 153 | 555 | 10 | 5406 | ||||||
UD_Arabic | 1194 | 24 | 35 | 18 | 2 | 6128 | 1029 | 27 | 155 | 4 | 3001 | 1611 | |||||
UD_Basque | 279 | 91 | 8 | 37 | 26 | 2 | 1385 | 98 | 5 | 782 | 1 | 2475 | 3 | ||||
UD_Bulgarian | 444 | 6 | 97 | 11 | 5 | 2412 | 42 | 3 | 12 | 633 | 1904 | ||||||
UD_Buryat | 25 | 1 | 143 | 4 | 2 | 19 | 26 | ||||||||||
UD_Catalan | 1287 | 34 | 242 | 26 | 7 | 61 | 6104 | 442 | 2 | 127 | 3464 | 4 | 1 | 44 | 4141 | ||
UD_Chinese | 85 | 3 | 1 | 3 | 3 | 1628 | 52 | 521 | 2 | 521 | 317 | 45 | |||||
UD_Coptic | 7 | 68 | 1 | 4 | 51 | ||||||||||||
UD_Croatian | 781 | 15 | 96 | 33 | 4 | 2900 | 111 | 4 | 39 | 803 | 1 | 1449 | 12 | ||||
UD_Czech | 7760 | 16 | 2334 | 6 | 32 | 10 | 27838 | 4243 | 158 | 839 | 8174 | 29 | 6 | 31 | 19611 | ||
UD_Czech-CAC | 4345 | 4 | 1088 | 9 | 5 | 2 | 16518 | 499 | 33 | 322 | 1545 | 8 | 17 | 290 | 7744 | ||
UD_Czech-CLTT | 211 | 1 | 35 | 1685 | 44 | 10 | 3 | 280 | 130 | ||||||||
UD_Danish | 323 | 9 | 102 | 6 | 15 | 1062 | 22 | 1 | 36 | 314 | 1323 | 40 | |||||
UD_Dutch | 431 | 9 | 79 | 173 | 6 | 1 | 1900 | 135 | 68 | 839 | 1 | 15 | 1137 | 43 | |||
UD_Dutch-LassySmall | 291 | 11 | 19 | 45 | 1485 | 122 | 14 | 982 | 1 | 1 | 80 | 650 | 52 | ||||
UD_English | 1084 | 132 | 7 | 5 | 35 | 4 | 2933 | 91 | 21 | 170 | 998 | 1 | 2 | 18 | 3656 | 54 | |
UD_English-ESL | 425 | 10 | 60 | 4 | 6 | 3 | 1 | 1026 | 27 | 7 | 56 | 93 | 5 | 1603 | 13 | ||
UD_English-LinES | 286 | 6 | 73 | 9 | 2 | 1103 | 36 | 5 | 55 | 139 | 1579 | 2 | |||||
UD_Estonian | 1144 | 5 | 242 | 4113 | 108 | 140 | 986 | 2 | 5 | 4263 | |||||||
UD_Faroese | 3 | ||||||||||||||||
UD_Finnish | 692 | 6 | 134 | 3 | 1 | 3823 | 106 | 103 | 901 | 1 | 12 | 3463 | 14 | ||||
UD_Finnish-FTB | 646 | 12 | 163 | 13 | 3 | 22 | 1937 | 98 | 25 | 186 | 403 | 35 | 2779 | 16 | |||
UD_French | 1181 | 22 | 83 | 7 | 12 | 10 | 2 | 6067 | 373 | 3 | 174 | 2676 | 3 | 52 | 3658 | 119 | |
UD_Galician-TreeGal | 138 | 1 | 12 | 1 | 1 | 413 | 18 | 21 | 135 | 1 | 261 | 4 | |||||
UD_German | 1196 | 31 | 89 | 14 | 14 | 1 | 4933 | 363 | 6 | 66 | 2540 | 1 | 2756 | 12 | |||
UD_Gothic | 189 | 3 | 18 | 8 | 840 | 13 | 46 | 120 | 3 | 1950 | |||||||
UD_Greek | 198 | 6 | 36 | 2 | 1251 | 38 | 7 | 18 | 597 | ||||||||
UD_Hebrew | 417 | 7 | 76 | 119 | 8 | 2 | 2329 | 37 | 47 | 428 | 1961 | 1 | |||||
UD_Hindi | 316 | 5 | 11 | 4 | 7 | 2664 | 21 | 1 | 28 | 2478 | 1896 | 1 | |||||
UD_Hungarian | 206 | 22 | 1 | 1 | 3 | 573 | 21 | 18 | 170 | 721 | |||||||
UD_Indonesian | 158 | 9 | 16 | 1 | 1890 | 49 | 5 | 1539 | 4 | 1110 | 1 | ||||||
UD_Irish | 37 | 36 | 4 | 1 | 2 | 378 | 15 | 18 | 49 | 6 | 158 | 7 | |||||
UD_Italian | 1021 | 5 | 93 | 3 | 1 | 11 | 4812 | 186 | 194 | 830 | 1 | 6 | 3 | 2678 | 4 | ||
UD_Japanese | 284 | 10 | 8 | 1 | 3771 | 171 | 9 | 4094 | |||||||||
UD_Japanese-KTC | 103 | 9 | 2 | 3195 | 16 | 4 | 399 | 299 | |||||||||
UD_Latin | 240 | 1 | 35 | 10 | 1 | 1080 | 4 | 29 | 1 | 1794 | |||||||
UD_Latin-ITTB | 1586 | 5 | 452 | 2 | 4583 | 61 | 697 | 70 | 1 | 2982 | 35 | ||||||
UD_Latin-PROIEL | 624 | 1 | 148 | 2 | 2528 | 34 | 190 | 429 | 17 | 4897 | 26 | ||||||
UD_Latvian | 66 | 12 | 443 | 16 | 16 | 121 | 17 | 1 | 11 | 304 | 4 | ||||||
UD_Norwegian | 1345 | 92 | 44 | 8 | 77 | 7 | 4567 | 139 | 105 | 1152 | 4 | 4112 | 2 | ||||
UD_Old_Church_Slavonic | 87 | 10 | 3 | 658 | 25 | 27 | 109 | 1 | 2293 | ||||||||
UD_Persian | 1284 | 3 | 83 | 3 | 1 | 1 | 3 | 4810 | 187 | 65 | 7 | 1 | 2252 | 5 | |||
UD_Polish | 230 | 2 | 7 | 897 | 22 | 14 | 122 | 2 | 1381 | 4 | |||||||
UD_Portuguese | 534 | 7 | 107 | 1 | 9 | 2 | 2535 | 132 | 103 | 1375 | 24 | 1818 | 2 | ||||
UD_Portuguese-BR | 435 | 58 | 3 | 12 | 5 | 3841 | 286 | 47 | 82 | 2366 | 3121 | 32 | |||||
UD_Portuguese-Bosque | 463 | 11 | 46 | 1 | 1 | 24 | 2295 | 125 | 72 | 623 | 8 | 1673 | |||||
UD_Romanian | 878 | 27 | 140 | 7 | 30 | 21 | 1 | 3828 | 288 | 11 | 115 | 471 | 1 | 2 | 2789 | 6 | |
UD_Russian | 491 | 237 | 52 | 5 | 6 | 2038 | 57 | 1 | 1 | 631 | 6 | 8 | 989 | ||||
UD_Russian-SynTagRus | 5822 | 1655 | 20908 | 159 | 52 | 399 | 2705 | 3 | 17504 | 6 | |||||||
UD_Sanskrit | 1 | 21 | 7 | 7 | |||||||||||||
UD_Slovak | 338 | 1 | 93 | 3 | 1274 | 119 | 16 | 62 | 298 | 1 | 1943 | 33 | |||||
UD_Slovenian | 814 | 4 | 99 | 1 | 2287 | 101 | 41 | 361 | 1555 | 4 | |||||||
UD_Slovenian-SST | 87 | 36 | 219 | 30 | 12 | 29 | 49 | 365 | 9 | ||||||||
UD_Spanish | 1433 | 14 | 185 | 8 | 14 | 38 | 6956 | 428 | 208 | 3243 | 75 | 4331 | 179 | ||||
UD_Spanish-AnCora | 1628 | 45 | 258 | 107 | 9 | 55 | 5657 | 304 | 1 | 185 | 2612 | 2 | 4 | 20 | 5197 | ||
UD_Swedish | 505 | 18 | 138 | 15 | 7 | 2310 | 27 | 45 | 265 | 1011 | |||||||
UD_Swedish-LinES | 320 | 30 | 98 | 4 | 3 | 1159 | 25 | 4 | 45 | 150 | 1835 | 1 | |||||
UD_Swedish_Sign_Language | 4 | 8 | 13 | 15 | 170 | 20 | |||||||||||
UD_Tamil | 3 | 113 | 4 | 4 | 87 | 19 | |||||||||||
UD_Turkish | 3 | 30 | 8 | 13 | 6 | 1 | |||||||||||
UD_Ukrainian | 2 | 7 | 6 | ||||||||||||||
UD_Uyghur | 1 | ||||||||||||||||
UD_Vietnamese | 121 | 5 | 531 | 32 | 8 | 17 | 781 | 11 |
Appos is right-headed
Apposition dependencies should be left-headed, not right.
Search expression: _ <appos@R _
Correct example:
Sam , my brother , arrived
appos(Sam, brother)
Incorrect example:
Sam , my brother , arrived
appos(brother, Sam)
ADJ | ADP | ADV | AUX | CONJ | DET | NOUN | NUM | PART | PRON | PROPN | PUNCT | SCONJ | SYM | VERB | X | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
UD_Ancient_Greek | 2 | 3 | 4 | 1 | ||||||||||||
UD_Ancient_Greek-PROIEL | 45 | 5 | 61 | 5 | 24 | 11 | 1 | 36 | ||||||||
UD_Arabic | 13 | 1 | 143 | 10 | 7 | 1 | 24 | |||||||||
UD_Basque | 1 | 3 | 1 | |||||||||||||
UD_Bulgarian | 1 | 1 | ||||||||||||||
UD_Catalan | 1 | 1 | 25 | 1 | 1 | 7 | 1 | |||||||||
UD_Chinese | 1 | 488 | 2 | 84 | 3 | 8 | 8 | 3 | ||||||||
UD_Croatian | 8 | 1 | 10 | 3 | ||||||||||||
UD_Czech | 17 | |||||||||||||||
UD_Czech-CAC | 6 | 4 | ||||||||||||||
UD_Czech-CLTT | 1 | |||||||||||||||
UD_Danish | 1 | 6 | ||||||||||||||
UD_Dutch | 1 | |||||||||||||||
UD_Dutch-LassySmall | 1 | 1 | 7 | |||||||||||||
UD_English | 1 | 1 | 7 | 37 | 6 | 1 | 2 | |||||||||
UD_English-ESL | 1 | |||||||||||||||
UD_English-LinES | 1 | 1 | ||||||||||||||
UD_Estonian | 1058 | 3 | ||||||||||||||
UD_French | 1 | 123 | 4 | 6 | 29 | 1 | 1 | 2 | ||||||||
UD_Galician-TreeGal | 2 | 1 | ||||||||||||||
UD_German | 10 | 1 | 9 | 1 | 38 | 2 | 7 | 4 | 26 | 1 | 7 | 2 | ||||
UD_Gothic | 12 | 1 | 12 | 5 | 20 | 4 | 8 | |||||||||
UD_Hebrew | 2 | 3 | ||||||||||||||
UD_Hungarian | 42 | |||||||||||||||
UD_Indonesian | 16 | 3 | 2 | 21 | 1 | 1 | ||||||||||
UD_Italian | 1 | |||||||||||||||
UD_Japanese | 2 | 1067 | 103 | 5 | 7 | |||||||||||
UD_Japanese-KTC | 3 | 619 | 12 | 1 | 157 | |||||||||||
UD_Kazakh | 6 | 1 | ||||||||||||||
UD_Latin | 1 | 2 | ||||||||||||||
UD_Latin-ITTB | 1 | |||||||||||||||
UD_Latin-PROIEL | 124 | 1 | 5 | 77 | 14 | 21 | 7 | 1 | 2 | |||||||
UD_Old_Church_Slavonic | 14 | 3 | 17 | 7 | 2 | 2 | 1 | |||||||||
UD_Persian | 1 | 8 | 1 | |||||||||||||
UD_Portuguese | 1 | 2 | ||||||||||||||
UD_Portuguese-BR | 4 | 4 | ||||||||||||||
UD_Portuguese-Bosque | 1 | 1 | 1 | 1 | 4 | |||||||||||
UD_Romanian | 3 | 1 | ||||||||||||||
UD_Russian | 2 | 7 | 1 | 1 | 4 | |||||||||||
UD_Russian-SynTagRus | 3 | 2 | 63 | 1 | 5 | 7 | ||||||||||
UD_Spanish | 1 | 2 | 11 | 3 | 2 | 43 | 57 | 6 | ||||||||
UD_Spanish-AnCora | 59 | 1 | 3 | 2 | ||||||||||||
UD_Turkish | 19 | |||||||||||||||
UD_Uyghur | 12 | 1 | ||||||||||||||
UD_Vietnamese | 1 |
Copula is not VERB
Copulas should always be verbs. Not punctuation (dashes) and definitely not nominals.
Search expression: (!(VERB|AUX)) <cop _
ADJ | ADP | ADV | AUX | CONJ | DET | NOUN | NUM | PART | PRON | PROPN | PUNCT | SCONJ | SYM | X | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
UD_Arabic | 15 | 14 | 12 | 5 | 110 | 26 | 14 | 17 | 1 | 1 | 33 | ||||
UD_Basque | 2 | 3 | |||||||||||||
UD_Catalan | 3 | 4 | 2 | 44 | 1 | ||||||||||
UD_Chinese | 8 | ||||||||||||||
UD_Croatian | 1 | 5 | 1 | ||||||||||||
UD_Czech-CAC | 9 | 2 | 1 | 15 | 1 | 20 | |||||||||
UD_Czech-CLTT | 7 | 23 | |||||||||||||
UD_Dutch | 5 | 5 | 5 | 31 | 8 | 18 | 36 | 1 | 3 | ||||||
UD_English-LinES | 1 | ||||||||||||||
UD_Finnish-FTB | 2 | ||||||||||||||
UD_Galician | 818 | 168 | 17 | 6 | 586 | 10 | 69 | 13 | 11 | 1 | |||||
UD_Galician-TreeGal | 7 | ||||||||||||||
UD_German | 31 | 27 | 1 | 86 | 8 | 6 | 23 | 1 | 2 | ||||||
UD_Hebrew | 7 | ||||||||||||||
UD_Hindi | 1 | ||||||||||||||
UD_Irish | 3 | 1 | |||||||||||||
UD_Latin | 1 | ||||||||||||||
UD_Latin-ITTB | 6 | 4 | 27 | 1 | 2 | 2 | |||||||||
UD_Persian | 3 | ||||||||||||||
UD_Polish | 5 | 1 | |||||||||||||
UD_Portuguese | 2 | 1 | 203 | 2 | 6 | 24 | 4 | 6 | 2 | 21 | |||||
UD_Portuguese-BR | 1 | 3 | 1 | 16 | 1 | ||||||||||
UD_Portuguese-Bosque | 1 | 7 | 2 | ||||||||||||
UD_Romanian | 1 | 3 | |||||||||||||
UD_Russian | 1 | 23 | 5 | 28 | 1 | ||||||||||
UD_Russian-SynTagRus | 636 | ||||||||||||||
UD_Slovak | 1 | ||||||||||||||
UD_Slovenian-SST | 1 | ||||||||||||||
UD_Spanish | 13 | 3 | 23 | 4 | 20 | 1 | 5 | ||||||||
UD_Spanish-AnCora | 4 | 1 | 7 | 3 | 47 | 6 | 1 | 3 | |||||||
UD_Uyghur | 2 | 1 | 1 | 1 | |||||||||||
UD_Vietnamese | 2 |
PRON is mark
Pronouns must not be attached using the mark relation. Relative pronouns must not be confused with subordinating conjunctions, even if the word is ambiguous.
Search expression: PRON <mark _
Correct example:
This is a fact that we cannot ignore . You know that we cannot ignore it .
dobj(ignore-8, that-5)
mark(ignore-15, that-12)
Incorrect example:
This is a fact that we cannot ignore .
mark(ignore-8, that-5)
Relation det is used for node that is neither DET nor PRON
The det relation is primarily intended for determiners, i.e. words tagged DET. Pronouns are tolerated at least until the borderline between the two classes is better investigated and defined.
Search expression: (!DET&!PRON) <det _
Relation punct is used for node that is not PUNCT
Only nodes tagged PUNCT can be attached using the punct relation.
Search expression: !PUNCT <punct _
PUNCT is attached as neither punct nor root
Nodes tagged PUNCT can only be attached using the punct relation (and exceptionally, if it is the only node in the sentence, also root).
Search expression: PUNCT (<!punct&<!root) _
PRON or DET lacks the PronType feature
All pronouns and determiners should be further categorized using the PronType feature. Other POS may or may not have this feature.
Search expression: (PRON|DET)&!PronType
NUM lacks the NumType feature
All numerals should be further categorized using the NumType feature. Other POS may or may not have this feature.
Search expression: NUM&!NumType
VERB or AUX lacks the VerbForm feature
All verbs should be further categorized using the VerbForm feature. Other POS may or may not have this feature.
Search expression: (VERB|AUX)&!VerbForm
Finite verb is not a verb
Only non-finite VerbForms are expected to appear with non-verb parts of speech (NOUN, ADJ, ADV).
Search expression: (VerbForm=Fin)&!(VERB|AUX)
Finite verb lacks the Mood feature
All finite verb forms should be further categorized using the Mood feature.
Search expression: (VerbForm=Fin)&!Mood
Degree feature used with a word that is neither adjective nor adverb
The Degree feature is normally associated with (a subset of) adjectives and adverbs. Other tags than ADJ and ADV are probably wrong.
Search expression: Degree&!ADJ&!ADV
Maximum one subject
DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID. No predicate can have more than one subject. Note that subordinate clauses do not head copulas for this reason.
Search expression: _ >nsubj|>csubj|>nsubjpass|>csubjpass _ >nsubj|>csubj|>nsubjpass|>csubjpass _
ADJ | ADP | ADV | AUX | CONJ | DET | NOUN | NUM | PART | PRON | PROPN | PUNCT | SCONJ | VERB | X | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
UD_Ancient_Greek | 21 | 2 | 5 | 5 | 1 | 13 | 1 | 3 | 4 | 156 | |||||
UD_Ancient_Greek-PROIEL | 3 | 31 | 1 | 45 | |||||||||||
UD_Arabic | 34 | 1 | 1 | 24 | 4 | 3 | 100 | 10 | |||||||
UD_Basque | 1 | 2 | 2 | 1 | 31 | ||||||||||
UD_Bulgarian | 2 | ||||||||||||||
UD_Buryat | 2 | ||||||||||||||
UD_Catalan | 25 | 18 | 26 | 2 | 6 | 17 | 1 | 331 | |||||||
UD_Chinese | 2 | 3 | 1 | 84 | |||||||||||
UD_Croatian | 5 | 1 | 7 | ||||||||||||
UD_Czech | 9 | 3 | 56 | ||||||||||||
UD_Czech-CAC | 3 | 2 | 15 | ||||||||||||
UD_Czech-CLTT | 2 | 5 | |||||||||||||
UD_Danish | 8 | 1 | 8 | 1 | 1 | 219 | |||||||||
UD_Dutch | 104 | 10 | 20 | 1 | 32 | 4 | 61 | 3 | |||||||
UD_Dutch-LassySmall | 1 | 2 | 14 | ||||||||||||
UD_English | 4 | 1 | 1 | 1 | 1 | 27 | |||||||||
UD_English-ESL | 3 | 2 | 20 | ||||||||||||
UD_English-LinES | 5 | 3 | 2 | 58 | |||||||||||
UD_Finnish | 606 | ||||||||||||||
UD_Finnish-FTB | 2 | ||||||||||||||
UD_French | 13 | 8 | 1 | 1 | 89 | ||||||||||
UD_Galician | 632 | ||||||||||||||
UD_Galician-TreeGal | 1 | 12 | |||||||||||||
UD_German | 6 | 2 | 1 | 39 | |||||||||||
UD_Gothic | 2 | 7 | |||||||||||||
UD_Greek | 2 | 1 | 1 | 27 | |||||||||||
UD_Hebrew | 3 | 2 | 1 | 22 | |||||||||||
UD_Hindi | 16 | 13 | 5 | 170 | |||||||||||
UD_Hungarian | 9 | 1 | 7 | 1 | 23 | ||||||||||
UD_Indonesian | 8 | 6 | 3 | 79 | |||||||||||
UD_Irish | 1 | 1 | 26 | ||||||||||||
UD_Italian | 1 | 4 | |||||||||||||
UD_Japanese-KTC | 19 | 17 | 73 | ||||||||||||
UD_Latin | 9 | 2 | 3 | 1 | 60 | ||||||||||
UD_Latin-ITTB | 4 | 1 | 5 | 1 | 2 | 50 | |||||||||
UD_Latin-PROIEL | 4 | 18 | 3 | 128 | 1 | ||||||||||
UD_Latvian | 1 | 6 | |||||||||||||
UD_Norwegian | 26 | 1 | 1 | 1 | 16 | 1 | 2 | 216 | |||||||
UD_Old_Church_Slavonic | 2 | 3 | |||||||||||||
UD_Persian | 18 | 13 | 62 | ||||||||||||
UD_Portuguese | 4 | 4 | 6 | 1 | 2 | 1 | 18 | ||||||||
UD_Portuguese-BR | 2 | 1 | 12 | ||||||||||||
UD_Portuguese-Bosque | 4 | 5 | 24 | 37 | |||||||||||
UD_Romanian | 2 | 1 | 1 | 22 | |||||||||||
UD_Russian | 2 | 1 | 15 | ||||||||||||
UD_Russian-SynTagRus | 3 | 1 | 2 | 2 | 11 | ||||||||||
UD_Spanish | 5 | 5 | 1 | 1 | 46 | 1 | |||||||||
UD_Spanish-AnCora | 30 | 6 | 18 | 5 | 6 | 13 | 3 | 293 | |||||||
UD_Swedish-LinES | 10 | 1 | 84 | ||||||||||||
UD_Tamil | 1 | 2 | |||||||||||||
UD_Turkish | 1 | 1 | 40 | ||||||||||||
UD_Vietnamese | 7 | 6 | 67 |
Maximum one direct object
DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID. No predicate can have more than one direct object. Ccomp counts as direct object. (To certain extent xcomp does too, but dobj can co-occur with xcomp in cases of secondary predication, thus this test does not look at xcomp.)
Search expression: _ >dobj|>ccomp _ >dobj|>ccomp _
ADJ | ADP | ADV | AUX | CONJ | DET | NOUN | NUM | PART | PRON | PROPN | PUNCT | SCONJ | SYM | VERB | X | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
UD_Ancient_Greek | 2 | 4 | 25 | 2 | 7 | 1 | 869 | |||||||||
UD_Ancient_Greek-PROIEL | 2 | 2 | 1 | 1 | 1 | 244 | ||||||||||
UD_Arabic | 43 | 1 | 7 | 151 | 1 | 3 | 484 | 107 | ||||||||
UD_Basque | 1 | 2 | 1 | 33 | ||||||||||||
UD_Bulgarian | 32 | |||||||||||||||
UD_Buryat | 4 | |||||||||||||||
UD_Catalan | 54 | 27 | 37 | 25 | 30 | 18 | 1 | 5586 | ||||||||
UD_Chinese | 242 | |||||||||||||||
UD_Coptic | 37 | |||||||||||||||
UD_Croatian | 4 | 1 | 1 | 124 | ||||||||||||
UD_Czech | 58 | 3 | 1 | 1 | 1134 | |||||||||||
UD_Czech-CAC | 22 | 1 | 2 | 343 | ||||||||||||
UD_Czech-CLTT | 2 | 11 | ||||||||||||||
UD_Danish | 55 | 1 | ||||||||||||||
UD_Dutch | 100 | 5 | 85 | 119 | 30 | 2 | 3 | 2 | 1315 | 5 | ||||||
UD_Dutch-LassySmall | 1 | 2 | 2 | 60 | ||||||||||||
UD_English | 4 | 225 | ||||||||||||||
UD_English-ESL | 1 | 144 | ||||||||||||||
UD_English-LinES | 4 | 95 | ||||||||||||||
UD_Estonian | 82 | |||||||||||||||
UD_Finnish | 4 | 6 | 22 | 42 | 808 | |||||||||||
UD_Finnish-FTB | 2 | |||||||||||||||
UD_French | 1 | 2 | 1 | 216 | ||||||||||||
UD_Galician | 4 | 98 | 4 | 140 | 2 | 2 | 1 | 1587 | ||||||||
UD_Galician-TreeGal | 2 | 34 | ||||||||||||||
UD_German | 1 | 108 | ||||||||||||||
UD_Gothic | 1 | 1 | 86 | |||||||||||||
UD_Greek | 2 | 1 | 1 | 95 | ||||||||||||
UD_Hebrew | 1 | 27 | ||||||||||||||
UD_Hindi | 4 | |||||||||||||||
UD_Hungarian | 16 | |||||||||||||||
UD_Indonesian | 2 | 1 | 2 | 1 | 1 | 147 | ||||||||||
UD_Irish | 1 | 9 | 48 | |||||||||||||
UD_Italian | 115 | |||||||||||||||
UD_Japanese | 1 | |||||||||||||||
UD_Japanese-KTC | 141 | |||||||||||||||
UD_Kazakh | 6 | |||||||||||||||
UD_Latin | 1 | 2 | 1 | 1 | 2 | 77 | ||||||||||
UD_Latin-ITTB | 5 | 2 | 4 | 1 | 1 | 313 | ||||||||||
UD_Latin-PROIEL | 1 | 3 | 195 | 2 | ||||||||||||
UD_Latvian | 51 | |||||||||||||||
UD_Old_Church_Slavonic | 1 | 67 | ||||||||||||||
UD_Persian | 11 | 3 | 1 | 13 | 530 | |||||||||||
UD_Polish | 1 | 10 | 154 | |||||||||||||
UD_Portuguese | 3 | 1 | 8 | 2 | 610 | |||||||||||
UD_Portuguese-BR | 46 | |||||||||||||||
UD_Portuguese-Bosque | 1 | 5 | 72 | |||||||||||||
UD_Romanian | 1 | 1 | 64 | |||||||||||||
UD_Russian | 23 | |||||||||||||||
UD_Russian-SynTagRus | 37 | |||||||||||||||
UD_Slovak | 97 | |||||||||||||||
UD_Slovenian | 1 | |||||||||||||||
UD_Slovenian-SST | 22 | |||||||||||||||
UD_Spanish | 1 | 2 | 1 | 162 | ||||||||||||
UD_Spanish-AnCora | 77 | 11 | 44 | 13 | 33 | 1 | 16 | 3 | 1 | 4732 | ||||||
UD_Swedish-LinES | 2 | 63 | ||||||||||||||
UD_Tamil | 7 | 1 | 1 | 1 | 25 | |||||||||||
UD_Turkish | 4 | 4 | 142 | |||||||||||||
UD_Vietnamese | 12 | 2 | 519 |
Case not dependent on nmod
DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID. If a word is marked as having a case dependency, its head should usually be marked as being a nominal modifier. But note that several legitimate exceptions occur, in particular, through coordination (where the head is labeled conj instead of nmod) and in nominal clauses (where the head is the main predicate of a clause). Also, in some languages case can modify also objects and other nominals.
Search expression: _ <case (_ (<!nmod&<!nmod:agent&<!conj) _)
ADJ | ADP | ADV | AUX | CONJ | DET | INTJ | NOUN | NUM | PART | PRON | PROPN | PUNCT | SCONJ | SYM | VERB | X | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
UD_Ancient_Greek | 5 | 5200 | 39 | 30 | 7 | 4 | |||||||||||
UD_Ancient_Greek-PROIEL | 7206 | ||||||||||||||||
UD_Arabic | 13234 | 22 | 24 | 4 | 5 | 360 | |||||||||||
UD_Basque | 253 | 3 | 1 | ||||||||||||||
UD_Bulgarian | 3388 | 5 | |||||||||||||||
UD_Buryat | 18 | ||||||||||||||||
UD_Catalan | 17630 | 31 | 10 | 19 | 439 | 11 | |||||||||||
UD_Chinese | 785 | ||||||||||||||||
UD_Coptic | 121 | 1 | 21 | ||||||||||||||
UD_Croatian | 2 | 1006 | 8 | 8 | 3 | 2 | 1 | 28 | 1 | 7 | |||||||
UD_Czech | 1 | 27254 | 37 | 1 | 40 | 2 | 2 | 11 | 6 | ||||||||
UD_Czech-CAC | 8960 | 10 | 1 | 15 | 14 | 7 | 8 | 2 | |||||||||
UD_Czech-CLTT | 605 | 1 | 1 | ||||||||||||||
UD_Danish | 739 | 2 | |||||||||||||||
UD_Dutch | 3 | 4543 | 2 | 75 | 14 | 1 | 1 | 142 | 252 | 2 | |||||||
UD_Dutch-LassySmall | 174 | 1 | |||||||||||||||
UD_English | 364 | 7 | 1 | 2 | 774 | 1 | 2 | ||||||||||
UD_English-ESL | 4 | 275 | 2 | 2 | 161 | 6 | 3 | 3 | |||||||||
UD_English-LinES | 4 | 760 | 3 | 310 | 1 | ||||||||||||
UD_Estonian | 180 | ||||||||||||||||
UD_Faroese | 381 | ||||||||||||||||
UD_Finnish | 31 | 1 | |||||||||||||||
UD_Finnish-FTB | 124 | ||||||||||||||||
UD_French | 3 | 630 | 4 | 12 | 12 | ||||||||||||
UD_Galician | 6 | 11101 | 1030 | 1 | 50 | 5 | 5 | 88 | 3 | 1 | 39 | 1 | 191 | ||||
UD_Galician-TreeGal | 218 | 8 | 5 | ||||||||||||||
UD_German | 515 | 16 | 1 | 1 | 1 | ||||||||||||
UD_Gothic | 2336 | ||||||||||||||||
UD_Greek | 1233 | 47 | |||||||||||||||
UD_Hebrew | 4965 | 13 | 45 | 2 | 4 | 23 | 10 | 7 | |||||||||
UD_Hindi | 158 | 15642 | 10 | 7 | 5 | 323 | 316 | ||||||||||
UD_Hungarian | 79 | 1 | |||||||||||||||
UD_Indonesian | 1391 | ||||||||||||||||
UD_Irish | 642 | 2 | 1 | 1 | |||||||||||||
UD_Italian | 628 | 53 | |||||||||||||||
UD_Japanese-KTC | 30501 | 22 | 979 | 2 | 3 | ||||||||||||
UD_Kazakh | 23 | ||||||||||||||||
UD_Latin | 182 | 4 | 6 | ||||||||||||||
UD_Latin-ITTB | 7270 | 1 | |||||||||||||||
UD_Latin-PROIEL | 6531 | ||||||||||||||||
UD_Latvian | 341 | ||||||||||||||||
UD_Norwegian | 1393 | ||||||||||||||||
UD_Old_Church_Slavonic | 2550 | ||||||||||||||||
UD_Persian | 887 | 4 | 2 | 2533 | |||||||||||||
UD_Polish | 62 | 3236 | 26 | 5 | 131 | 10 | 16 | 35 | 4 | 94 | 50 | ||||||
UD_Portuguese | 3 | 7029 | 34 | 2 | 23 | 1 | 1 | 6 | 7 | ||||||||
UD_Portuguese-BR | 1 | 859 | 7 | 3 | 5 | 19 | 8 | ||||||||||
UD_Portuguese-Bosque | 1168 | ||||||||||||||||
UD_Romanian | 1 | 4051 | 45 | 1 | 8 | 2 | 2 | 3 | 21 | 1 | |||||||
UD_Russian | 1 | 149 | 1 | 1 | 2 | 1 | |||||||||||
UD_Russian-SynTagRus | 7209 | 1 | |||||||||||||||
UD_Slovak | 1354 | 1 | 28 | ||||||||||||||
UD_Slovenian | 434 | 19 | |||||||||||||||
UD_Slovenian-SST | 129 | 1 | 10 | 1 | |||||||||||||
UD_Spanish | 1 | 4150 | 43 | 70 | 1 | 4 | 4 | 2 | 3 | ||||||||
UD_Spanish-AnCora | 6 | 17004 | 18 | 4 | 9 | 25 | 14 | 16 | |||||||||
UD_Swedish | 211 | 2 | 2 | 6 | |||||||||||||
UD_Swedish-LinES | 891 | ||||||||||||||||
UD_Tamil | 16 | 1 | |||||||||||||||
UD_Turkish | 841 | ||||||||||||||||
UD_Ukrainian | 7 | ||||||||||||||||
UD_Uyghur | 9 | 3 | 4 | 7 | 1 | ||||||||||||
UD_Vietnamese | 443 |
NOUN and case
DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID. A word marked as a NOUN should not be a case dependency.
Search expression: NOUN <case _
NOUN | |
---|---|
UD_Ancient_Greek | 15 |
UD_Arabic | 100 |
UD_Basque | 10 |
UD_Catalan | 340 |
UD_Croatian | 19 |
UD_Czech | 645 |
UD_Czech-CAC | 194 |
UD_Czech-CLTT | 5 |
UD_Dutch | 19 |
UD_English | 5 |
UD_English-ESL | 1 |
UD_Finnish | 10 |
UD_French | 75 |
UD_Galician | 101 |
UD_German | 4 |
UD_Greek | 1 |
UD_Hebrew | 4 |
UD_Hindi | 10 |
UD_Hungarian | 2 |
UD_Irish | 1 |
UD_Italian | 39 |
UD_Japanese-KTC | 166 |
UD_Persian | 8 |
UD_Polish | 194 |
UD_Portuguese | 151 |
UD_Portuguese-BR | 36 |
UD_Romanian | 58 |
UD_Russian | 2 |
UD_Slovak | 6 |
UD_Spanish | 91 |
UD_Spanish-AnCora | 326 |
UD_Swedish | 3 |
UD_Uyghur | 11 |
UD_Vietnamese | 1 |
ADP is not leaf
DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID. By default adposition is a leaf node attached to a nominal as case. Exceptions where adpositions are not leaves involve technical relations such as mwe and conj.
Search expression: ADP >!conj&>!cc&>!punct&>!mwe&>!foreign _
Appos chain
DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID. Apposition dependencies should not be chained. Multiple appositions should all be attached to the head, or they should be coordinated (with the first apposition as the head). Legitimate exceptions do occur, however, in the rare case when an apposition has an apposition of its own, which does not apply to head of the first apposition.
Search expression: _ <appos (_ <appos _)
Correct example:
Sam , my brother , John 's cousin , arrived
appos(Sam, brother)
appos(Sam, cousin)
Incorrect example:
Sam , my brother , John 's cousin , arrived
appos(Sam, brother)
appos(brother, cousin)
ADJ | ADP | ADV | AUX | CONJ | DET | NOUN | NUM | PART | PRON | PROPN | PUNCT | SYM | VERB | X | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
UD_Ancient_Greek-PROIEL | 12 | 40 | 1 | 2 | 7 | 9 | |||||||||
UD_Arabic | 3 | 1 | |||||||||||||
UD_Basque | 1 | 1 | 1 | ||||||||||||
UD_Catalan | 1 | 219 | 68 | 4 | 205 | 1 | |||||||||
UD_Chinese | 12 | 2 | 15 | ||||||||||||
UD_Croatian | 15 | 38 | 1 | ||||||||||||
UD_Czech | 3 | 113 | 1 | 1 | 18 | 6 | |||||||||
UD_Czech-CAC | 2 | 27 | 1 | ||||||||||||
UD_Danish | 1 | 3 | 1 | ||||||||||||
UD_Dutch | 5 | 4 | 45 | 2 | 1 | ||||||||||
UD_Dutch-LassySmall | 1 | 8 | 14 | 19 | 1 | ||||||||||
UD_English | 2 | 1 | 26 | 7 | 1 | 70 | 1 | 9 | |||||||
UD_English-LinES | 1 | 5 | 2 | 1 | |||||||||||
UD_Finnish | 24 | 3 | 29 | 3 | 6 | ||||||||||
UD_French | 2 | 1 | 135 | 4 | 7 | 198 | 5 | 16 | |||||||
UD_German | 10 | 2 | 76 | 93 | 1 | 519 | 11 | ||||||||
UD_Gothic | 2 | 9 | 4 | ||||||||||||
UD_Hebrew | 2 | 11 | 2 | 65 | 1 | ||||||||||
UD_Hungarian | 4 | 2 | |||||||||||||
UD_Indonesian | 1 | 26 | 3 | 534 | 4 | 14 | |||||||||
UD_Japanese | 33 | 2 | |||||||||||||
UD_Japanese-KTC | 3 | 1 | |||||||||||||
UD_Kazakh | 1 | ||||||||||||||
UD_Latin-ITTB | 3 | 1 | 1 | ||||||||||||
UD_Latin-PROIEL | 7 | 30 | 1 | 4 | |||||||||||
UD_Norwegian | 1 | 2 | 7 | 1 | 4 | ||||||||||
UD_Old_Church_Slavonic | 1 | 2 | 7 | 1 | |||||||||||
UD_Persian | 3 | ||||||||||||||
UD_Polish | 1 | 27 | |||||||||||||
UD_Portuguese | 1 | ||||||||||||||
UD_Portuguese-BR | 91 | 126 | 2 | 3 | 238 | 4 | |||||||||
UD_Portuguese-Bosque | 2 | 125 | 11 | 2 | 180 | ||||||||||
UD_Romanian | 1 | 1 | 2 | 1 | |||||||||||
UD_Russian | 15 | 2 | 6 | 61 | 2 | 1 | 93 | 5 | |||||||
UD_Russian-SynTagRus | 3 | 140 | 17 | 355 | 30 | ||||||||||
UD_Slovak | 4 | 1 | |||||||||||||
UD_Spanish | 6 | 1 | 1 | 127 | 75 | 6 | 384 | 8 | 2 | 53 | |||||
UD_Spanish-AnCora | 3 | 145 | 32 | 7 | 202 | 1 | |||||||||
UD_Swedish | 7 | ||||||||||||||
UD_Swedish-LinES | 1 | 1 | 2 | ||||||||||||
UD_Ukrainian | 1 | ||||||||||||||
UD_Vietnamese | 1 |
Relation advmod used for node that is not ADV
DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID. Advmod is intended only for adverbs. Modifiers that are called adverbial in some traditional grammars, but are in fact prepositional or noun phrases, should be attached as nmod.
Search expression: (!ADV) <advmod _
Heads of an advmod are nominal
DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID. The heads of an adverbial modifier should normally not be nominal (noun, proper noun, numeral, or pronoun). Exceptions occur in nominal clauses, where a nominal is the main predicate and can therefore take clause adverbials. Example: This is probably an exception.
Search expression: _ <advmod (NOUN|PROPN|NUM|PRON)
ADJ | ADP | ADV | AUX | CONJ | DET | INTJ | NOUN | NUM | PART | PRON | PROPN | PUNCT | SCONJ | SYM | VERB | X | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
UD_Ancient_Greek | 831 | 9 | 3033 | 211 | 26 | 3 | 117 | 2 | 45 | 3 | 475 | ||||||
UD_Ancient_Greek-PROIEL | 156 | 4 | 659 | 1 | |||||||||||||
UD_Arabic | 164 | 1 | 77 | 13 | 110 | 3 | 1 | 220 | |||||||||
UD_Basque | 1 | 19 | 32 | 1 | |||||||||||||
UD_Bulgarian | 1012 | ||||||||||||||||
UD_Buryat | 59 | ||||||||||||||||
UD_Catalan | 39 | 172 | 1796 | 34 | 24 | 21 | 4 | 7 | 15 | 3 | 5 | 1 | |||||
UD_Chinese | 3 | 241 | 8 | 58 | 4 | 1 | 2 | ||||||||||
UD_Coptic | 2 | 36 | 1 | 2 | 6 | 2 | 1 | ||||||||||
UD_Croatian | 12 | 21 | 877 | 2 | 9 | 4 | 5 | 1 | 2 | ||||||||
UD_Czech | 51 | 1 | 2825 | 21 | 13 | 44 | 71 | 61 | 1 | 14 | 1 | 2 | |||||
UD_Czech-CAC | 15 | 2 | 918 | 3 | 9 | 3 | 35 | 28 | 27 | 13 | |||||||
UD_Czech-CLTT | 45 | 2 | 4 | ||||||||||||||
UD_Danish | 71 | 828 | 1 | 39 | 29 | ||||||||||||
UD_Dutch | 281 | 18 | 1350 | 3 | 1 | 1 | 113 | 21 | 39 | 2 | 31 | 60 | |||||
UD_Dutch-LassySmall | 2 | 2 | 681 | 4 | 1 | 344 | |||||||||||
UD_English | 47 | 24 | 1484 | 1 | 8 | 9 | 8 | 1 | 1 | 13 | 3 | 30 | |||||
UD_English-ESL | 8 | 548 | 1 | 1 | 1 | 2 | |||||||||||
UD_English-LinES | 14 | 12 | 507 | 8 | 2 | 10 | 3 | 2 | 7 | ||||||||
UD_Estonian | 3739 | ||||||||||||||||
UD_Faroese | 1 | ||||||||||||||||
UD_Finnish | 14 | 1 | 3207 | 6 | 5 | 14 | 13 | 2 | 1 | ||||||||
UD_Finnish-FTB | 1039 | 5 | 3 | 1322 | 9 | 2 | 5 | ||||||||||
UD_French | 8 | 108 | 1750 | 2 | 9 | 1 | 7 | 9 | 5 | 3 | 4 | 2 | |||||
UD_Galician | 195 | 9 | |||||||||||||||
UD_Galician-TreeGal | 1 | 1 | 179 | 1 | 2 | 2 | |||||||||||
UD_German | 425 | 22 | 3085 | 4 | 3 | 5 | 16 | 32 | 24 | ||||||||
UD_Gothic | 9 | 255 | |||||||||||||||
UD_Greek | 5 | 268 | 2 | 4 | 1 | 7 | |||||||||||
UD_Hebrew | 8 | 12 | 365 | 23 | 2 | 42 | 5 | 51 | 5 | 1 | 9 | 4 | |||||
UD_Hindi | 13 | 58 | 1 | 21 | 5 | 29 | 3 | ||||||||||
UD_Hungarian | 1 | ||||||||||||||||
UD_Indonesian | 6 | 4 | 635 | 1 | 2 | 12 | 6 | 1 | 14 | 7 | 14 | ||||||
UD_Irish | 38 | 4 | 134 | 3 | 39 | 1 | 7 | 1 | |||||||||
UD_Italian | 2 | 1912 | 8 | 1 | 2 | ||||||||||||
UD_Japanese | 37 | 523 | 1 | 135 | 12 | 3 | |||||||||||
UD_Japanese-KTC | 4 | 299 | 2 | 2 | 1 | ||||||||||||
UD_Kazakh | 1 | 11 | 1 | 30 | |||||||||||||
UD_Latin | 7 | 313 | 234 | 13 | 4 | 1 | 1 | 7 | |||||||||
UD_Latin-ITTB | 418 | 1 | 1209 | 17 | 16 | 5 | |||||||||||
UD_Latin-PROIEL | 212 | 4 | 536 | 1 | 2 | ||||||||||||
UD_Latvian | 91 | ||||||||||||||||
UD_Norwegian | 595 | 1835 | |||||||||||||||
UD_Old_Church_Slavonic | 10 | 258 | |||||||||||||||
UD_Persian | 77 | 3 | 527 | 1 | 105 | 2 | 6 | 209 | |||||||||
UD_Polish | 34 | 7 | 46 | 16 | 6 | 52 | 10 | ||||||||||
UD_Portuguese | 16 | 135 | 1044 | 8 | 15 | 38 | 32 | 24 | 4 | 3 | 1 | 2 | |||||
UD_Portuguese-BR | 15 | 1651 | 1 | 8 | 1 | 36 | 1 | ||||||||||
UD_Portuguese-Bosque | 7 | 1443 | 26 | 1 | 3 | 2 | |||||||||||
UD_Romanian | 17 | 207 | 1324 | 1 | 303 | 62 | 13 | 3 | 16 | 2 | |||||||
UD_Russian | 1 | 2 | 238 | 185 | 6 | 16 | 4 | 4 | 1 | 1 | 2 | ||||||
UD_Russian-SynTagRus | 104 | 5070 | 1 | 693 | 5 | 6084 | 21 | 27 | 6 | 32 | 1 | ||||||
UD_Sanskrit | 1 | 40 | |||||||||||||||
UD_Slovak | 78 | 6 | 5 | 52 | |||||||||||||
UD_Slovenian | 523 | 65 | 254 | 1 | 7 | ||||||||||||
UD_Slovenian-SST | 4 | 1 | 337 | 83 | 1 | 175 | 3 | ||||||||||
UD_Spanish | 10 | 77 | 1893 | 2 | 82 | 3 | 2 | 4 | 23 | 2 | 3 | ||||||
UD_Spanish-AnCora | 32 | 339 | 2002 | 66 | 5 | 48 | 12 | 6 | 14 | 2 | 2 | ||||||
UD_Swedish | 15 | 57 | 895 | 111 | 1 | 2 | 1 | ||||||||||
UD_Swedish-LinES | 6 | 13 | 609 | 13 | 2 | 1 | 10 | 7 | 7 | ||||||||
UD_Swedish_Sign_Language | 1 | 1 | |||||||||||||||
UD_Tamil | 3 | 26 | 4 | 1 | |||||||||||||
UD_Turkish | 249 | ||||||||||||||||
UD_Ukrainian | 14 | ||||||||||||||||
UD_Uyghur | 11 | 1 | 9 | 1 | 8 | 3 | 1 | 3 | 3 | ||||||||
UD_Vietnamese | 52 | 6 | 1 | 262 |
Acl not dependent on NOUN/PROPN/PRON
DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID. Clausal modifiers of nouns should depend on NOUN/PROPN only; those in the following table depend on other parts of speech.
Search expression: !PRON&!NOUN&!PROPN >acl _
ADJ | ADP | ADV | AUX | CONJ | DET | INTJ | NUM | PART | PROPN | PUNCT | SCONJ | SYM | VERB | X | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
UD_Ancient_Greek | 210 | 1 | 36 | 5 | 10 | 3 | 1 | 41 | 1 | ||||||
UD_Ancient_Greek-PROIEL | 386 | 4 | 14 | 1 | 20 | 26 | |||||||||
UD_Arabic | 94 | 6 | 190 | 1 | 101 | 1 | 30 | 288 | |||||||
UD_Basque | 35 | 26 | 6 | 27 | 20 | 8 | |||||||||
UD_Bulgarian | 22 | 2 | 1 | 117 | 7 | 1 | 1 | ||||||||
UD_Buryat | 1 | ||||||||||||||
UD_Catalan | 149 | 7 | 47 | 14 | 14 | 22 | 1 | 10 | 1 | ||||||
UD_Chinese | 24 | 24 | 261 | 2 | 2759 | 13 | |||||||||
UD_Coptic | 57 | 1 | 2 | ||||||||||||
UD_Croatian | 37 | 1 | 10 | 1 | 10 | 1 | 12 | 2 | |||||||
UD_Czech | 151 | 2 | 11 | 2 | 89 | 47 | 1 | 21 | |||||||
UD_Czech-CAC | 58 | 4 | 30 | 11 | 1 | 7 | 24 | 25 | |||||||
UD_Czech-CLTT | 1 | 1 | |||||||||||||
UD_Dutch | 2 | 1 | 3 | 1 | 1 | 2 | 1 | 6 | |||||||
UD_Dutch-LassySmall | 38 | 4 | 5 | 21 | 14 | 1 | 2 | 17 | 8 | ||||||
UD_English | 16 | 1 | 1 | 4 | 1 | 2 | 8 | ||||||||
UD_English-ESL | 5 | 2 | 1 | 2 | |||||||||||
UD_English-LinES | 117 | 5 | 48 | 1 | 2 | 1 | 7 | ||||||||
UD_Estonian | 3 | ||||||||||||||
UD_Finnish | 23 | 1 | 2 | 2 | 2 | 19 | 1 | ||||||||
UD_Finnish-FTB | 337 | 125 | 2 | 13 | 3 | 35 | 5 | ||||||||
UD_French | 212 | 3 | 9 | 1 | 2 | 21 | 1 | 3 | 1734 | 5 | |||||
UD_Galician-TreeGal | 8 | 2 | 1 | 11 | |||||||||||
UD_German | 27 | 19 | 6 | 7 | 1 | 6 | 233 | 1 | |||||||
UD_Gothic | 128 | 2 | 16 | 4 | 8 | ||||||||||
UD_Greek | 27 | 5 | 1 | 3 | 1 | ||||||||||
UD_Hebrew | 2 | 2 | 5 | 1 | |||||||||||
UD_Hindi | 16 | 2 | 2 | 4 | 545 | ||||||||||
UD_Hungarian | 6 | 1 | 2 | 28 | |||||||||||
UD_Indonesian | 25 | 3 | 7 | 1 | 4 | 2 | 1 | 294 | |||||||
UD_Italian | 6 | 1 | |||||||||||||
UD_Japanese-KTC | 1 | 8 | 5 | 18 | |||||||||||
UD_Latin | 30 | 1 | 3 | 1 | 43 | ||||||||||
UD_Latin-ITTB | 295 | 1 | 12 | 7 | 52 | 5 | 251 | 4 | |||||||
UD_Latin-PROIEL | 443 | 49 | 38 | 10 | 4 | ||||||||||
UD_Latvian | 8 | 9 | 9 | 3 | 2 | 8 | 58 | 5 | |||||||
UD_Norwegian | 53 | 2 | 8 | 5 | 1 | ||||||||||
UD_Old_Church_Slavonic | 107 | 1 | 9 | 4 | |||||||||||
UD_Polish | 1 | 21 | 1 | ||||||||||||
UD_Portuguese | 103 | 3 | 15 | 1 | 66 | 14 | 12 | 8 | |||||||
UD_Portuguese-Bosque | 40 | 7 | 13 | 4 | 4 | 24 | |||||||||
UD_Romanian | 15 | 1 | 5 | 2 | 1 | 22 | 17 | 1 | |||||||
UD_Russian | 15 | 1 | 1 | 29 | |||||||||||
UD_Russian-SynTagRus | 45 | 70 | 39 | 2 | 1 | 29 | 1 | 858 | |||||||
UD_Slovak | 2 | 3 | 5 | 6 | |||||||||||
UD_Slovenian | 14 | 1 | 10 | 2 | |||||||||||
UD_Slovenian-SST | 7 | 16 | 4 | 5 | 1 | ||||||||||
UD_Spanish | 70 | 9 | 3 | 3 | 8 | 11 | 9 | 1 | 2658 | 11 | |||||
UD_Spanish-AnCora | 324 | 4 | 43 | 12 | 18 | 1 | 13 | 1 | 2 | ||||||
UD_Swedish | 7 | 5 | 3 | 2 | 4 | ||||||||||
UD_Swedish-LinES | 103 | 4 | 49 | 1 | 15 | ||||||||||
UD_Swedish_Sign_Language | 1 | ||||||||||||||
UD_Tamil | 1 | 8 | |||||||||||||
UD_Turkish | 133 | 4 | 9 | 2 | 1 | 1 | 2 | 301 | |||||||
UD_Ukrainian | 3 | ||||||||||||||
UD_Uyghur | 2 | 1 | 3 |
Marked as NUM but not nummod, nmod or compound
DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID. If a word is marked as a numeral (POS), then it should be marked as being a nummod, compound or nmod dependency. Exceptions occur when the numeral is promoted to a higher function through ellipis. Example: Take five!
Search expression: NUM (<!nummod&<!nmod&<!compound) _
NUM | |
---|---|
UD_Ancient_Greek | 275 |
UD_Ancient_Greek-PROIEL | 453 |
UD_Arabic | 3913 |
UD_Basque | 1829 |
UD_Bulgarian | 234 |
UD_Buryat | 11 |
UD_Catalan | 1750 |
UD_Chinese | 324 |
UD_Coptic | 4 |
UD_Croatian | 389 |
UD_Czech | 17703 |
UD_Czech-CAC | 2684 |
UD_Czech-CLTT | 93 |
UD_Danish | 98 |
UD_Dutch | 1249 |
UD_Dutch-LassySmall | 974 |
UD_English | 800 |
UD_English-ESL | 119 |
UD_English-LinES | 112 |
UD_Estonian | 302 |
UD_Faroese | 174 |
UD_Finnish | 405 |
UD_Finnish-FTB | 173 |
UD_French | 615 |
UD_Galician | 885 |
UD_Galician-TreeGal | 27 |
UD_German | 1026 |
UD_Gothic | 177 |
UD_Greek | 154 |
UD_Hebrew | 966 |
UD_Hindi | 871 |
UD_Hungarian | 592 |
UD_Indonesian | 259 |
UD_Irish | 24 |
UD_Italian | 381 |
UD_Japanese | 1349 |
UD_Japanese-KTC | 128 |
UD_Kazakh | 134 |
UD_Latin | 53 |
UD_Latin-ITTB | 885 |
UD_Latin-PROIEL | 396 |
UD_Latvian | 48 |
UD_Norwegian | 475 |
UD_Old_Church_Slavonic | 328 |
UD_Persian | 337 |
UD_Polish | 640 |
UD_Portuguese | 679 |
UD_Portuguese-BR | 1065 |
UD_Portuguese-Bosque | 537 |
UD_Romanian | 1797 |
UD_Russian | 1280 |
UD_Russian-SynTagRus | 4459 |
UD_Slovak | 543 |
UD_Slovenian | 223 |
UD_Slovenian-SST | 107 |
UD_Spanish | 1127 |
UD_Spanish-AnCora | 1757 |
UD_Swedish | 101 |
UD_Swedish-LinES | 88 |
UD_Swedish_Sign_Language | 3 |
UD_Tamil | 31 |
UD_Turkish | 1152 |
UD_Ukrainian | 19 |
UD_Uyghur | 78 |
UD_Vietnamese | 87 |
Marked as nummod but not NUM
DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID. If a word is marked as a numeric modifier, it should be marked as a numeral (POS).
Search expression: !NUM <nummod _
ADJ | ADP | ADV | CONJ | DET | NOUN | PRON | PROPN | PUNCT | SYM | VERB | X | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
UD_Arabic | 1 | 5 | ||||||||||
UD_Bulgarian | 1 | 1 | ||||||||||
UD_Catalan | 5 | 2 | 1 | |||||||||
UD_Chinese | 5 | 7 | 6 | 5 | ||||||||
UD_Croatian | 183 | 6 | 115 | 9 | 1 | 1 | ||||||
UD_Danish | 13 | 27 | 2 | |||||||||
UD_Dutch-LassySmall | 12 | |||||||||||
UD_English | 5 | 6 | 8 | 52 | 2 | 2 | 121 | |||||
UD_English-ESL | 3 | |||||||||||
UD_English-LinES | 2 | |||||||||||
UD_Faroese | 1 | |||||||||||
UD_Finnish | 549 | 32 | 2 | 1 | 9 | 3 | ||||||
UD_Finnish-FTB | 5 | |||||||||||
UD_French | 1 | 9 | 44 | 1 | 2 | |||||||
UD_Galician | 1 | |||||||||||
UD_Galician-TreeGal | 2 | 1 | 1 | |||||||||
UD_German | 1 | 2 | 85 | 54 | 3 | |||||||
UD_Hebrew | 7 | |||||||||||
UD_Hindi | 16 | |||||||||||
UD_Indonesian | 4 | 47 | 28 | 1 | 89 | 38 | ||||||
UD_Irish | 10 | 60 | ||||||||||
UD_Italian | 1 | 15 | ||||||||||
UD_Japanese | 6 | |||||||||||
UD_Japanese-KTC | 14 | |||||||||||
UD_Kazakh | 6 | |||||||||||
UD_Norwegian | 150 | |||||||||||
UD_Persian | 79 | 25 | 1 | |||||||||
UD_Portuguese | 1 | 1 | 1 | |||||||||
UD_Portuguese-BR | 22 | 8 | 16 | |||||||||
UD_Romanian | 32 | 8 | 4 | 14 | ||||||||
UD_Russian | 6 | 3 | 1 | |||||||||
UD_Russian-SynTagRus | 1 | 2 | 5 | |||||||||
UD_Sanskrit | 1 | |||||||||||
UD_Spanish | 895 | 1 | 195 | 1 | 117 | 23 | ||||||
UD_Spanish-AnCora | 2 | 1 | 1 | |||||||||
UD_Vietnamese | 1 |
Marked as AUX but not aux or auxpass
DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID. If a word is marked as the auxiliary POS, it should be marked as either aux or auxpass dependency.
Search expression: AUX (<!aux&<!auxpass) _
AUX | |
---|---|
UD_Basque | 68 |
UD_Bulgarian | 6 |
UD_Buryat | 4 |
UD_Catalan | 3767 |
UD_Chinese | 10 |
UD_Coptic | 53 |
UD_Croatian | 2643 |
UD_Czech | 6 |
UD_Czech-CAC | 23 |
UD_Czech-CLTT | 2 |
UD_Danish | 1761 |
UD_Dutch | 5112 |
UD_Dutch-LassySmall | 1322 |
UD_English | 88 |
UD_English-ESL | 23 |
UD_English-LinES | 148 |
UD_Estonian | 2246 |
UD_Finnish | 8 |
UD_French | 10 |
UD_German | 35 |
UD_Hebrew | 482 |
UD_Hindi | 10 |
UD_Hungarian | 7 |
UD_Indonesian | 1 |
UD_Italian | 5 |
UD_Japanese-KTC | 5824 |
UD_Kazakh | 175 |
UD_Latin-ITTB | 2 |
UD_Persian | 6 |
UD_Polish | 57 |
UD_Portuguese-BR | 13 |
UD_Portuguese-Bosque | 2 |
UD_Romanian | 324 |
UD_Russian-SynTagRus | 4468 |
UD_Sanskrit | 1 |
UD_Slovak | 1 |
UD_Slovenian-SST | 39 |
UD_Spanish | 36 |
UD_Spanish-AnCora | 5725 |
UD_Swedish-LinES | 55 |
UD_Tamil | 13 |
UD_Turkish | 975 |
UD_Uyghur | 1 |
Marked as aux or auxpass but not AUX
DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID. If a word is marked as a (passive) auxiliary dependency, it should be marked as the auxiliary POS.
Search expression: !AUX (<aux|<auxpass) _
ADJ | ADP | ADV | AUX | CONJ | DET | INTJ | NOUN | NUM | PART | PRON | PROPN | PUNCT | SCONJ | SYM | VERB | X | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
UD_Ancient_Greek | 7 | 35 | |||||||||||||||
UD_Ancient_Greek-PROIEL | 8 | 28 | 2 | 6 | 73 | ||||||||||||
UD_Arabic | 5 | 6 | 4 | 1 | 24 | 1 | 1661 | 13 | 58 | ||||||||
UD_Basque | 1 | 6 | 1 | 4 | 1 | 369 | 5 | ||||||||||
UD_Bulgarian | 6 | 4162 | 7 | ||||||||||||||
UD_Buryat | 1 | 1 | |||||||||||||||
UD_Catalan | 17 | ||||||||||||||||
UD_Chinese | 424 | ||||||||||||||||
UD_Coptic | 4 | 1 | 1 | 1 | 3 | 26 | 17 | ||||||||||
UD_Croatian | 3 | 15 | 2 | 10 | 172 | 1 | 16 | ||||||||||
UD_Czech-CAC | 1 | 1 | 3 | 1 | 1 | 13 | 2 | ||||||||||
UD_Czech-CLTT | 2 | 2 | 2 | ||||||||||||||
UD_Dutch | 89 | 5 | 4 | 1 | 107 | 5 | 9 | 6 | 21 | 6 | |||||||
UD_English | 1 | 6 | 87 | ||||||||||||||
UD_English-ESL | 1 | 1 | |||||||||||||||
UD_English-LinES | 1 | 1 | 141 | ||||||||||||||
UD_Faroese | 11 | ||||||||||||||||
UD_Finnish | 4 | ||||||||||||||||
UD_Finnish-FTB | 1 | 4691 | 1 | ||||||||||||||
UD_Galician | 5 | 1 | 10 | 1347 | |||||||||||||
UD_Galician-TreeGal | 232 | ||||||||||||||||
UD_German | 11 | 2 | 12 | 12 | 6 | ||||||||||||
UD_Gothic | 7 | 48 | 2 | 12 | 4 | ||||||||||||
UD_Greek | 1880 | 228 | |||||||||||||||
UD_Hebrew | 772 | ||||||||||||||||
UD_Hindi | 9 | 7 | 2 | 2 | |||||||||||||
UD_Indonesian | 2 | 1 | |||||||||||||||
UD_Italian | 1 | ||||||||||||||||
UD_Kazakh | 4 | ||||||||||||||||
UD_Latin | 1 | 511 | |||||||||||||||
UD_Latin-ITTB | 2 | ||||||||||||||||
UD_Latin-PROIEL | 7 | 3 | 6 | 10 | 2245 | ||||||||||||
UD_Latvian | 9 | 564 | |||||||||||||||
UD_Old_Church_Slavonic | 5 | 4 | 2 | 7 | 199 | ||||||||||||
UD_Persian | 1768 | ||||||||||||||||
UD_Polish | 19 | ||||||||||||||||
UD_Portuguese | 1 | 1 | 2 | 1 | 1535 | ||||||||||||
UD_Portuguese-BR | 14 | 15 | |||||||||||||||
UD_Romanian | 1 | 4 | 1 | 9 | 10 | 1 | 179 | ||||||||||
UD_Russian | 597 | ||||||||||||||||
UD_Russian-SynTagRus | 1 | 1389 | 1 | ||||||||||||||
UD_Slovak | 14 | 1 | |||||||||||||||
UD_Slovenian | 6 | ||||||||||||||||
UD_Slovenian-SST | 3 | ||||||||||||||||
UD_Spanish | 2 | 6 | 3 | 32 | 9 | ||||||||||||
UD_Spanish-AnCora | 1 | 7 | 3 | ||||||||||||||
UD_Swedish-LinES | 6 | ||||||||||||||||
UD_Swedish_Sign_Language | 1 | 12 | |||||||||||||||
UD_Tamil | 1 | ||||||||||||||||
UD_Ukrainian | 4 | 17 | |||||||||||||||
UD_Uyghur | 1 | 141 | |||||||||||||||
UD_Vietnamese | 22 | 3 | 1 | 1 | 1 | 500 | 60 |
Marked as cc but not CONJ, SYM or ADV
DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID. The relation cc is for coordinating conjunctions, hence the most expected POS tag is CONJ. Symbols such as + may occasionally replace conjunctions, and in some languages some adverbs may take this syntactic function too. However, most other tags are suspicious at least.
Search expression: !CONJ&!SYM&!ADV <cc _
Cc is not leaf
DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID. Coordinating conjunction should be attached to the first conjunct and it should not have its own dependents.
Search expression: _ < (_ <cc _)
ADJ | ADP | ADV | AUX | CCONJ | CONJ | DET | INTJ | NOUN | NUM | PART | PRON | PROPN | PUNCT | SCONJ | SYM | VERB | X | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
UD_Ancient_Greek | 39 | 11 | 145 | 25 | 9 | 343 | 30 | 150 | 13 | 128 | ||||||||
UD_Ancient_Greek-PROIEL | 1 | 22 | 4 | 1 | ||||||||||||||
UD_Arabic | 6 | 59 | 1 | 23 | 37 | 3 | 3 | 17 | 11 | 28 | 61 | |||||||
UD_Basque | 16 | 43 | 6 | 16 | 1 | 166 | 31 | 43 | 3 | 69 | 34 | 51 | 3 | |||||
UD_Bulgarian | 4 | 48 | 1 | 10 | 630 | 3 | 1 | |||||||||||
UD_Buryat | 3 | 4 | ||||||||||||||||
UD_Catalan | 27 | 306 | 101 | 2 | 42 | 48 | 543 | 38 | 21 | 10 | 73 | 365 | 154 | 26 | 191 | |||
UD_Chinese | 1 | 1 | ||||||||||||||||
UD_Croatian | 14 | 76 | 3 | 3 | 1 | 10 | 38 | 2 | ||||||||||
UD_Czech | 2 | 313 | 6 | 13 | ||||||||||||||
UD_Czech-CAC | 11 | 5 | 15 | 203 | 16 | 3 | 4 | 8 | 1 | 4 | 2 | 3 | ||||||
UD_Czech-CLTT | 31 | 1 | ||||||||||||||||
UD_Danish | 27 | 69 | 150 | 1 | 80 | 3 | 10 | 9 | 141 | 3 | 1 | 67 | 1 | |||||
UD_Dutch | 1 | 3 | 2 | 1 | 1 | 8 | 12 | 8 | ||||||||||
UD_Dutch-LassySmall | 1 | 1 | 7 | 1 | ||||||||||||||
UD_English | 40 | 31 | 48 | 4 | 8 | 1 | 32 | 3 | 8 | |||||||||
UD_English-ESL | 13 | 11 | 1 | 4 | 1 | 1 | 2 | 1 | ||||||||||
UD_English-LinES | 10 | 10 | 2 | 1 | 31 | |||||||||||||
UD_Estonian | 1 | 3 | 11 | 1 | ||||||||||||||
UD_Faroese | 353 | |||||||||||||||||
UD_Finnish | 8 | 6 | 15 | |||||||||||||||
UD_Finnish-FTB | 12 | 67 | ||||||||||||||||
UD_French | 4 | 145 | 10 | 2 | 7 | 1 | 1 | 1 | 93 | 169 | 1 | |||||||
UD_Galician | 3 | 3 | 5 | 1 | 2 | 5 | 2 | 4 | 2 | 2 | 48 | 1 | ||||||
UD_Galician-TreeGal | 1 | 1 | 1 | |||||||||||||||
UD_German | 69 | 6 | 4 | 1 | 16 | 1 | 6 | 17 | ||||||||||
UD_Gothic | 6 | 6 | 1 | |||||||||||||||
UD_Greek | 2 | 184 | 78 | 39 | 80 | 3 | 5 | 223 | 3 | |||||||||
UD_Hindi | 1 | 1 | 38 | 61 | 1 | |||||||||||||
UD_Hungarian | 2 | 2 | 1 | 2 | 2 | 2 | ||||||||||||
UD_Indonesian | 2 | 16 | 8 | 5 | 13 | 2 | 6 | 11 | ||||||||||
UD_Irish | 2 | 6 | 1 | 1 | 9 | 1 | 1 | 2 | 1 | 84 | 1 | 2 | 1 | |||||
UD_Italian | 1 | 2 | 13 | 10 | 1 | 5 | 3 | 80 | 2 | 10 | ||||||||
UD_Japanese | 2 | 2 | 161 | 119 | 7 | 1 | 87 | 11 | ||||||||||
UD_Japanese-KTC | 665 | |||||||||||||||||
UD_Kazakh | 1 | 3 | 1 | |||||||||||||||
UD_Latin | 2 | 11 | 1 | 7 | 18 | 4 | 18 | |||||||||||
UD_Latin-ITTB | 17 | 3 | 11 | 15 | 36 | 2 | 3 | 3 | 1 | 36 | ||||||||
UD_Latin-PROIEL | 9 | 6 | 1 | 1 | ||||||||||||||
UD_Latvian | 37 | 1 | ||||||||||||||||
UD_Norwegian | 1 | 5 | 22 | 5 | ||||||||||||||
UD_Old_Church_Slavonic | 1 | 3 | 3 | |||||||||||||||
UD_Persian | 2 | 2 | 3 | 158 | 1 | 3 | 1 | 16 | 1 | 1 | 1 | |||||||
UD_Polish | 2 | |||||||||||||||||
UD_Portuguese | 1 | 16 | 5 | 3 | 2 | 61 | 1 | 2 | 4 | 31 | 56 | 7 | ||||||
UD_Portuguese-BR | 6 | 1 | 2 | 47 | 26 | |||||||||||||
UD_Portuguese-Bosque | 9 | 15 | 2 | 1 | 92 | 1 | ||||||||||||
UD_Romanian | 3 | 4 | 3 | 1 | 101 | 2 | 13 | 2 | 5 | 1 | 41 | 3 | 18 | 3 | ||||
UD_Russian | 2 | 7 | 96 | 5 | 2 | 9 | 1 | 6 | 2 | |||||||||
UD_Russian-SynTagRus | 49 | 6 | 26 | 9 | 92 | 1 | 537 | 8 | 2 | 1306 | 11 | 19 | ||||||
UD_Slovak | 1 | 4 | ||||||||||||||||
UD_Slovenian | 1 | 9 | 19 | 6 | 15 | |||||||||||||
UD_Slovenian-SST | 1 | 54 | 6 | 1 | 1 | 45 | 15 | 7 | ||||||||||
UD_Spanish | 17 | 21 | 22 | 149 | 4 | 142 | 4 | 1 | 213 | 1 | 2 | 54 | 2 | |||||
UD_Spanish-AnCora | 34 | 85 | 69 | 6 | 32 | 30 | 399 | 9 | 30 | 12 | 37 | 342 | 166 | 3 | 178 | |||
UD_Swedish | 2 | |||||||||||||||||
UD_Swedish-LinES | 1 | 1 | 1 | 1 | 2 | 6 | 19 | |||||||||||
UD_Turkish | 10 | 3 | 14 | 4 | 67 | 1 | 37 | 1 | 4 | 12 | 6 | 10 | ||||||
UD_Uyghur | 4 | 1 | 20 | |||||||||||||||
UD_Vietnamese | 4 | 2 | 1 | 5 | 1 | 4 | 7 | 4 | 5 |
Marked as neg but not PART or ADV
DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID. The neg relation is used to attach negating particles (sometimes also tagged as adverbs; but we probably want to choose just one tag), such as English not, to the predicate or phrase they negate. It is not used for negative determiners such as English no in there is no chance, which should be attached as det.
Search expression: !PART&!ADV <neg _
Parts of speech of expl
DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID. The parts of speech of words marked as being expletive dependencies.
Search expression: _ <expl _
ADJ | ADP | ADV | AUX | DET | NOUN | PART | PRON | PROPN | SCONJ | VERB | X | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
UD_Bulgarian | 3502 | |||||||||||
UD_Croatian | 1 | 3 | 1 | 1 | 1 | 3 | 1 | |||||
UD_Czech | 17180 | |||||||||||
UD_Czech-CAC | 6066 | |||||||||||
UD_Czech-CLTT | 112 | |||||||||||
UD_Danish | 404 | 37 | ||||||||||
UD_Dutch | 460 | |||||||||||
UD_Dutch-LassySmall | 18 | 26 | ||||||||||
UD_English | 7 | 725 | ||||||||||
UD_English-ESL | 541 | |||||||||||
UD_English-LinES | 3 | 271 | ||||||||||
UD_Finnish-FTB | 3 | 74 | 447 | |||||||||
UD_French | 49 | 245 | 382 | 59 | ||||||||
UD_Galician-TreeGal | 284 | |||||||||||
UD_German | 402 | 1 | ||||||||||
UD_Italian | 2097 | 1 | ||||||||||
UD_Norwegian | 7 | 1 | 3231 | |||||||||
UD_Polish | 1708 | |||||||||||
UD_Portuguese-BR | 1 | 398 | 270 | 3 | ||||||||
UD_Romanian | 1 | 1 | 495 | 1 | ||||||||
UD_Russian-SynTagRus | 1 | 1 | 32 | |||||||||
UD_Sanskrit | 1 | 1 | ||||||||||
UD_Slovak | 2621 | |||||||||||
UD_Slovenian | 2298 | |||||||||||
UD_Slovenian-SST | 463 | 1 | ||||||||||
UD_Swedish | 493 | |||||||||||
UD_Swedish-LinES | 2 | 460 | ||||||||||
UD_Ukrainian | 1 | 1 |
Dependents of expl
DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID. The dependents of a word marked as being an expletive dependency. This should be empty.
Search expression: _ < (_ <expl _)
ADJ | ADP | ADV | CONJ | DET | NOUN | PART | PRON | PROPN | PUNCT | SCONJ | VERB | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
UD_Bulgarian | 2 | 1 | 1 | |||||||||
UD_Croatian | 3 | |||||||||||
UD_Dutch | 1 | 1 | ||||||||||
UD_English-ESL | 1 | 1 | ||||||||||
UD_Finnish-FTB | 33 | 1 | 7 | 3 | 285 | |||||||
UD_French | 1 | 3 | 1 | 10 | 3 | 2 | 8 | |||||
UD_Norwegian | 2 | |||||||||||
UD_Portuguese-BR | 1 | 1 | ||||||||||
UD_Romanian | 13 | 1 | 1 | |||||||||
UD_Russian-SynTagRus | 1 | 2 | 6 | |||||||||
UD_Slovenian-SST | 1 | |||||||||||
UD_Swedish-LinES | 1 | 1 | 4 | 2 | 2 | 1 | 1 | 4 |
Heads of expl
DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID. Parts of speech of words which have an expletive dependency.
Search expression: _ >expl _
ADJ | ADP | ADV | AUX | CONJ | DET | NOUN | NUM | PART | PRON | PROPN | SCONJ | SYM | VERB | X | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
UD_Bulgarian | 58 | 10 | 1 | 13 | 1 | 1 | 3396 | ||||||||
UD_Croatian | 2 | 1 | 3 | 1 | 4 | ||||||||||
UD_Czech | 538 | 3 | 1 | 16638 | |||||||||||
UD_Czech-CAC | 253 | 17 | 16 | 5780 | |||||||||||
UD_Czech-CLTT | 28 | 2 | 79 | 3 | |||||||||||
UD_Danish | 7 | 4 | 428 | 1 | |||||||||||
UD_Dutch | 43 | 17 | 19 | 15 | 1 | 2 | 2 | 360 | |||||||
UD_Dutch-LassySmall | 7 | 7 | 30 | ||||||||||||
UD_English | 133 | 2 | 34 | 2 | 4 | 556 | |||||||||
UD_English-ESL | 137 | 1 | 1 | 58 | 2 | 1 | 341 | ||||||||
UD_English-LinES | 20 | 2 | 10 | 32 | 4 | 2 | 1 | 203 | |||||||
UD_Finnish-FTB | 35 | 29 | 1 | 11 | 447 | ||||||||||
UD_French | 21 | 4 | 1 | 34 | 2 | 299 | 25 | 344 | |||||||
UD_Galician-TreeGal | 1 | 280 | |||||||||||||
UD_German | 30 | 1 | 2 | 9 | 1 | 1 | 359 | ||||||||
UD_Italian | 1 | 4 | 6 | 2 | 1 | 2061 | |||||||||
UD_Norwegian | 613 | 3 | 84 | 1 | 10 | 495 | 4 | 93 | 59 | 1874 | |||||
UD_Polish | 23 | 31 | 2 | 1652 | |||||||||||
UD_Portuguese-BR | 2 | 55 | 1 | 4 | 13 | 595 | |||||||||
UD_Romanian | 5 | 3 | 3 | 1 | 483 | ||||||||||
UD_Russian-SynTagRus | 2 | 1 | 1 | 30 | |||||||||||
UD_Sanskrit | 2 | ||||||||||||||
UD_Slovak | 1 | 5 | 2612 | ||||||||||||
UD_Slovenian | 4 | 1 | 1 | 2292 | |||||||||||
UD_Slovenian-SST | 6 | 1 | 4 | 5 | 1 | 3 | 4 | 433 | 3 | ||||||
UD_Swedish | 121 | 1 | 21 | 350 | |||||||||||
UD_Swedish-LinES | 67 | 3 | 5 | 2 | 51 | 34 | 10 | 290 | |||||||
UD_Ukrainian | 2 |
Heads of xcomp
DEBUGGING TEST. NONZERO HITS DOES NOT MEAN THE DATA IS INVALID. Parts of speech of words which have an open clausal complement dependency.
Search expression: _ >xcomp _
ADJ | ADP | ADV | AUX | CONJ | DET | INTJ | NOUN | NUM | PART | PRON | PROPN | PUNCT | SCONJ | SYM | VERB | X | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
UD_Ancient_Greek | 169 | 4 | 50 | 12 | 9 | 109 | 3 | 69 | 65 | 37 | 4537 | 1 | |||||
UD_Ancient_Greek-PROIEL | 1 | 3437 | |||||||||||||||
UD_Arabic | 26 | 139 | 9 | 19 | 1460 | 59 | |||||||||||
UD_Basque | 10 | 1 | 2 | 12 | 6 | 1 | 55 | 2 | 1 | 4 | 1287 | 4 | |||||
UD_Bulgarian | 497 | ||||||||||||||||
UD_Buryat | 6 | 2 | 133 | ||||||||||||||
UD_Catalan | 132 | 8 | 7 | 3 | 5 | 2 | 4 | 4 | 5 | 2271 | |||||||
UD_Chinese | 17 | 22 | 9 | 1 | 9 | 1 | 4 | 1653 | |||||||||
UD_Coptic | 5 | 51 | |||||||||||||||
UD_Croatian | 88 | 21 | 164 | 46 | 2 | 1 | 1674 | ||||||||||
UD_Czech | 621 | 6 | 4 | 1420 | 33 | 22 | 569 | 391 | 15256 | ||||||||
UD_Czech-CAC | 209 | 4 | 782 | 6 | 11 | 213 | 63 | 11 | 4144 | ||||||||
UD_Czech-CLTT | 77 | 1 | 47 | 1 | 7 | 140 | 3 | ||||||||||
UD_Danish | 7 | 3 | 562 | ||||||||||||||
UD_Dutch | 58 | 62 | 70 | 319 | 2 | 112 | 1 | 15 | 558 | 2 | 3350 | 22 | |||||
UD_Dutch-LassySmall | 1 | 122 | |||||||||||||||
UD_English | 245 | 5 | 6 | 1 | 1 | 3 | 1 | 1 | 3303 | ||||||||
UD_English-ESL | 160 | 4 | 8 | 15 | 1 | 4 | 2122 | ||||||||||
UD_English-LinES | 6 | 1 | 15 | 4 | 2 | 1 | 1236 | ||||||||||
UD_Estonian | 119 | 5 | 1 | 45 | 1 | 4 | 2 | 3099 | |||||||||
UD_Finnish | 19 | 16 | 2 | 2277 | |||||||||||||
UD_Finnish-FTB | 2 | 1266 | |||||||||||||||
UD_French | 10 | 1 | 1 | 13 | 1 | 2 | 2 | 1673 | |||||||||
UD_Galician-TreeGal | 7 | 3 | 2 | 1 | 190 | ||||||||||||
UD_German | 40 | 4 | 4 | 2 | 2 | 48 | 1 | 4 | 4 | 973 | |||||||
UD_Gothic | 1 | 1 | 1 | 2370 | |||||||||||||
UD_Greek | 3 | 33 | 8 | 15 | |||||||||||||
UD_Hebrew | 20 | 13 | 683 | 30 | 1 | 14 | 2 | 982 | |||||||||
UD_Hindi | 729 | ||||||||||||||||
UD_Hungarian | 34 | 8 | 1 | 268 | |||||||||||||
UD_Indonesian | 10 | 3 | 38 | 1 | 3 | 10 | 1256 | ||||||||||
UD_Irish | 15 | 3 | 1 | 1 | 143 | 13 | 6 | 3 | 389 | 3 | |||||||
UD_Italian | 6 | 1 | 2106 | ||||||||||||||
UD_Kazakh | 7 | ||||||||||||||||
UD_Latin | 33 | 5 | 2 | 34 | 13 | 4 | 6 | 753 | |||||||||
UD_Latin-ITTB | 63 | 1 | 1 | 193 | 1 | 91 | 23 | 3296 | |||||||||
UD_Latin-PROIEL | 2602 | 4 | |||||||||||||||
UD_Latvian | 1 | 154 | |||||||||||||||
UD_Norwegian | 13 | 2 | 3 | 159 | 4 | 9 | 35 | 3744 | |||||||||
UD_Old_Church_Slavonic | 2378 | ||||||||||||||||
UD_Persian | 35 | 2 | 45 | 2 | 370 | ||||||||||||
UD_Polish | 49 | 11 | 2 | 1 | 1063 | ||||||||||||
UD_Portuguese | 1 | 2 | 1 | 12 | 1 | 1 | 2813 | ||||||||||
UD_Portuguese-BR | 1 | 1 | 1 | 1 | 4 | 956 | 1 | ||||||||||
UD_Portuguese-Bosque | 5 | 13 | 5 | 41 | 3 | 9 | 11 | 6 | 1798 | ||||||||
UD_Romanian | 62 | 1 | 1 | 2 | 1 | 3 | 1 | 1053 | |||||||||
UD_Russian | 72 | 6 | 1 | 2 | 578 | ||||||||||||
UD_Russian-SynTagRus | 8571 | ||||||||||||||||
UD_Sanskrit | 3 | ||||||||||||||||
UD_Slovak | 20 | 8 | 70 | 2 | 20 | 34 | 1052 | ||||||||||
UD_Slovenian | 8 | 1188 | |||||||||||||||
UD_Slovenian-SST | 3 | 239 | |||||||||||||||
UD_Spanish | 4 | 3 | 3 | 6 | 2 | 9 | 1393 | ||||||||||
UD_Spanish-AnCora | 149 | 9 | 8 | 3 | 7 | 2 | 5 | 1 | 2394 | ||||||||
UD_Swedish | 11 | 1 | 15 | 1 | 3 | 1029 | |||||||||||
UD_Swedish-LinES | 7 | 1 | 24 | 7 | 14 | 1 | 1 | 1 | 1328 | ||||||||
UD_Swedish_Sign_Language | 3 | ||||||||||||||||
UD_Tamil | 3 | 3 | 26 | ||||||||||||||
UD_Ukrainian | 4 | 6 | 29 | ||||||||||||||
UD_Uyghur | 2 | ||||||||||||||||
UD_Vietnamese | 248 | 2 | 1490 | 2 | 18 | 1583 | 2 |