home edit page issue tracker

This page pertains to UD version 2.

Other Constructions

This section of the syntax overview contains detailed discussion of particular linguistic constructions that fall outside (or cut across) the main categories of simple clauses, complex clauses, and nominal phrases.

Coordination

As discussed in the section on complex clauses, coordination is essentially a symmetrical relation. However, in order to satisfy the dependency tree constraint, the first conjunct is treated as the parent (or “technical head”) of all following conjuncts via the conj relation. Coordinating conjunctions and punctuation delimiting the conjuncts are attached to the associated conjunct using the cc and punct relations respectively. This analysis is applied to all cases of coordination at the clause, phrase or word level.

Ellipsis

The UD approach to ellipsis can be summarized as follows:

  1. If the elided element has no overt dependents, we do nothing.
  2. If the elided element has overt dependents, we promote one of these to take the role of the head.
  3. If the elided element is a predicate and the promoted element is one of its arguments or adjuncts, we use the orphan relation when attaching other non-functional dependents to the promoted head.

Ellipsis in Nominals

If the head nominal is elided, we promote dependents in the following order: amod > nummod > det > nmod > case.

Examples:

Ellipsis in Clauses

If the main predicate is elided, we use simple promotion only if there is an aux or cop, or a mark in the case of an infinitival marker.

Example:

In more complicated cases where a predicate is elided but no aux or cop is present, simple promotion (without orphan deprels) could lead to very unnatural and confusing relations. For example, in the following sentence, you would be the subject of coffee, suggesting that the second clause contains a copular construction rather than an elided predicate.

Of course, one may decide to promote you instead of coffee, but suggesting that coffee is a direct object of you is no better:

To avoid confusion and signal that the dependency structure is incomplete, we use the special relation orphan to connect the non-promoted dependents with the promoted dependent. Orphaned dependents are considered for promotion in the following order: nsubj > obj > iobj > obl > advmod > csubj > xcomp > ccomp > advcl > dislocated > vocative. If it is necessary to select among several orphans of the same type (e.g., there are just two orphans and both are advmod), the orphan occurring first (closer to the sentence start) is promoted.

Note that the orphan relation is only used when an ordinary relation would be misleading (for example, when attaching an object to a subject). In particular, the ordinary cc relation should be used for the coordinating conjunction, which attaches to the pseudo-constituent formed through the orphan dependency.

In German the grammar requires that non-finite verbs are at the end of the clause, which may mean that they are far away from their finite auxiliary verbs, possibly with intervening conjuncts. In the following example we do not try to keep wurde geschieden in one constituent. Instead, the auxiliary verb wurde is promoted as the head of the first conjunct, and the content participle geschieden heads the third conjunct. There is no verb in the middle conjunct and the orphan relation is used instead. See also issue 522.

In verb-final languages the verb occurs in the last conjunct and gapping may occur in the earlier conjuncts; unlike German, not even an auxiliary occurs in the first conjunct. Note that coordination, being a non-dependency relation, must go left-to-right even in these languages. As a result, the first conjunct (clause with gap) may have to be headed by a non-verb, as the overt verb belongs to the last conjunct clause. The following example is from Malayalam.

Comparatives

The syntax of comparative constructions poses various challenges for linguistic theory. For English, many of these are discussed in Bresnan (1973) and Huddleston and Pullum (2002, chapter 13). We give a discussion of equality comparisons (That car is as big as mine) and inequality scalar comparisons (Sue is taller than Jim).

In constructions of the form as X as Y or the same X as Y, X and Y can be of a range of syntactic types, leading to surface forms such as those exemplified below:

We note that the head of the whole construction appears to be the head of the X phrase. We can simply say:

We then say that the first as is an independent modifier in the comparative, modifying something in the X phrase, in part because the following as Y is fairly optional:

However, this first as may not modify the head of X, it may modify an existing modifier of the head of X. Its role is adverbial (advmod) consistent with other kinds of degree modification:

We then take the complement of the comparative as an oblique dependent of the first part. It is clear that the material in the complement as Y can be clausal. It is also usually optional, as indicated above. For that reason, we usually make the complement an advcl, with the second as analyzed as a mark. That gives us:

We take the as Y clause as a dependent of the content-word whose degree is being assessed (here often). We take its head to be the head of the clause, here heard. An initially plausible alternative analysis would be to make the clausal dependent headed by as a dependent of the comparative modifier as, more, or less, and indeed this is the analysis which Huddleston and Pullum (2002) argue for in English. However, there are several reasons to doubt this analysis. One is the general principles of UD in favoring content words as heads. A second argument is motivated by a desire for crosslinguistic adequacy: in languages such as Finnish and Japanese, this functional element is not present.

Since the first as is a functional element, the dependent can be understood to modify the whole phrase as often, and therefore is attached to the head of that phrase. Additionally, it might be noted that comparatives without a comparative word occur in certain varieties of English. For example in Indian English you find usages such as So don’t worry if you think that you have a girl-friend, who is intelligent than you. One further argument from morphological comparatives is discussed below.

The same basic analysis is given for inequality scalar comparatives, with more or less or a comparative adjective and than, parallel to the two uses of as above, except that more can also directly modify a noun, and more is then taken to have the amod relation to the noun. In this case, we take the comparative complement as directly depending on more, roughly seeing it as elliptical for more numerous. In general, the comparative complement always depends on an adjective or adverb, and is usually an advcl except when it is directly analyzed as an obl (as discussed at the end of this section).

In addition to crosslinguistic adequacy, we can see here another possible advantage of not attaching the than clause to more: This analysis then means that the dependency structure is more parallel between cases with a periphrastic comparative like more intelligent and a morphological comparative like taller (even though in bound morpheme cases, the -er could be argued to be the comparative head).

If the head is elided, then the functional element can be promoted.

Very commonly the complement clause in a comparative undergoes various amounts of partial reduction or ellipsis, sometimes to a quite extreme extent:

In general, we treat whatever remnant that remains as still an advcl, as above.

However, a limiting case of this is that only a nominal is present:

The analysis in this case is unclear: Should the comparative complement still be analyzed as an extremely reduced complement clause or analyzed as simply a nominal modifier? There are arguments for both positions. For English, there is a long discussion of the arguments in section 2.2 of chapter 13 of Huddleston and Pullum (2002). We err on the side of minimizing the postulation of unobserved structure and opt to treat these cases as just an oblique nominal complement:

More than as a multiword expression

In certain contexts the comparative complement combines both the action or adjective that is being compared and the quantity it is compared to:

In these cases we consider more than to be a fixed multiword expression because the two words are inseparable. One cannot say *more percent than 90.

If the expression modifies a counted noun phrase, it attaches directly to the modified number:

If there is no number (because the indefinite article functions as the number “one”), it attaches directly to the head noun:

Sufficiency and Excess

Also involving degree semantics are constructions conveying a consequence/result of that degree. Like the as much flour as the recipe called for example above, we treat the degree-qualified adjective or adverb as the head of the additional clause that is licensed by the construction (as advcl):

Here, so is the degree modifier and dusty is the degree-modified adjective. The adjective heads the advcl.

Examples with degree modifiers too (pre-adverb) and enough (post-adjective):

Even if the thing measured is provided by a noun, it is the adjective that heads the additional clause:

The that-clause is licensed by the sufficiency construction: *There was much flour that I sneezed is not possible. Sometimes multiple clauses are licensed, one by the adjective (as ccomp) and one by the sufficiency construction (either can be omitted):

The adjective expressing sufficiency or excess may also occur on its own, without being headed by a noun:

Paratactic Constructions

The parataxis relation is used to analyze a number of constructions where clauses are combined by relations that are looser than standard coordination.

Side-by-side sentences (“run-on sentences”)

The parataxis relation is used for a pair of what could have been standalone sentences, but which are being treated together as a single sentence. This may happen because sentence segmentation of the sentence was done primarily following the presence of sentence-final punctuation, and these clauses are joined by punctuation such as a colon or comma, or not delimited by punctuation at all. In a spoken corpus, it may happen because what is labeled as a sentence is more commonly an utterance turn. Even if the treebanker is doing the sentence division, it may happen because there seems to be a clear discourse relation linking two clauses. Sometimes there are more than two sentences joined in this way. In this case we make all the later sentences dependents of the first one, to maximize similarity to the analysis used for conjunction.

This relation may happen with units that are smaller than sentences:

Paired clauses with non-conjunction connective (“X so Y” etc.)

The relation is also used for clauses connected by a word like so, then, therefore, or however if neither clause is interpreted as modifying the other, and there is no coordinating conjunction:

The following, by contrast, are advcl modifiers:

Note that if-clauses should almost always be analyzed as subordinate, even when then is present.

Reported speech

When a speech verb interrupts reported speech content, the interruption is treated as a parenthetical parataxis:

See further discussion of reported speech at ccomp.

News article bylines

We have used the parataxis relation to connect the parts of a news article byline. There does not seem to be a better relation to use.

Interjected Clauses

Single word or phrase interjections are analyzed as discourse, but when a whole clause is interjected, we use the relation parataxis.

In the second example, we treat the second half as the head of the dependency because the first half feels like a whole clause interjection, not like the main clause of the utterance.

Tag questions

We also use the parataxis relation for tag questions such as isn’t it? or haven’t you?.

Feedback words

In a sentence starting with a feedback word such as yes or no and continuing with a main clause, we take the predicate of the main clause to be the root of the sentence and attach the feedback word to this predicate with a discourse relation:

However, when the feedback is expressed by a full clause instead of a feedback word, the predicate of this clause is taken as the root and the predicate of the following clause is attached with a parataxis relation:

Punctuation

Tokens with the relation punct always attach to content words (except in cases of ellipsis) and can never have dependents. Since punct is not a normal dependency relation, the usual criteria for determining the head word do not apply. Instead, we use the following principles:

  1. A punctuation mark separating coordinated units is attached to the immediately following conjunct.
  2. A punctuation mark preceding or following a dependent unit is attached to that unit.
  3. Within the relevant unit, a punctuation mark is attached at the highest possible node that preserves projectivity.
  4. Paired punctuation marks (quotes and brackets) should be attached to the same word unless that would create non-projectivity. This word is usually the head of the phrase enclosed in the paired punctuation.

See also examples at parataxis.

BESbswyBESbswyBESbswyBESbswy