1 Introduction

At least since [7] one of the main problem in proof theory has been to find for a given first-order theory \(\textsf{T}\) an equivalent sequent calculus \(\textsf{G}\) satisfying Gentzen’s celebrated cut-elimination theorem.Footnote 1 When \(\textsf{T}\) is just classical or intuitionistic first-order logic, then we can see that the issue has already been settled in the early days of proof theory by Gentzen himself with the sequent calculi \(\textsf{LK}\) and \(\textsf{LJ}\), respectively. However, if \(\textsf{T}\) consists of non-logical axioms it is not entirely clear whether such a \(\textsf{G}\) can be found at all. For \(\textsf{G}\) cannot simply be \(\textsf{LK}\) or \(\textsf{LJ}\) extended with one new ‘axiomatic sequent’ \(\Rightarrow A\) for each non-logical axiom A of \(\textsf{T}\), since in this case there is no guarantee that \(\textsf{G}\) would still satisfy cut elimination.

In [6] Negri introduced cut-free sequent calculi for coherent theories, i.e., first-order theories in which the non-logical axioms are (closed) formulas of the form \(\forall \bar{x} (P_1 \wedge \dots \wedge P_n \supset \exists \bar{y} (C_1 \vee \dots \vee C_m))\), where each \(P_i\) is an atom and each \(C_j\) is a conjunction of atoms.Footnote 2 The methodology of [6] were later generalized in [3] by Negri and Dyckhoff, who showed that any first-order theory admits a cut-free systematization in sequent calculus. To attain such a greater level of generality, the methodology of [3] dictates that for theories not falling under the coherent fragment, the language is to be extended with new symbols and consequently more axioms need to be considered. For example, the axiom of strict seriality, namely \(\forall x \exists y (x \le y \wedge \lnot y \le x)\), is not coherent, nor there exists a coherent formula in the same language which is equivalent to it. The basic idea of [3] is that such an axiom is nevertheless equivalent to coherent one of another, richer, language. Specifically, if the original language is extended with a new binary predicate constant \(\nleq \) and a new atom \(x \nleq y\) is defined as \(\lnot x \le y\), then strict seriality can be written in such extended language as \(\forall x \exists y (x \le y \wedge y \nleq x)\), which is entirely coherent. However, we need consider also an extra axiom corresponding (at least partly) to the definition of \(x \nleq y\). In general, when the axioms of a theory are of certain syntactic complexity, extending the language may become exceedingly involved and in many ways undesirable. For example, it almost entirely prevents one to prove interesting results such as interpolation theorem which are notoriously sensitive to the choice of the non-logical constants in the language.Footnote 3

In this paper we suggest a much simpler way to extend the methodology of [6] beyond the coherent fragment without resorting to the linguistic extensions of [3]. In particular, we show that if the language is in negation normal form, then the class of theories admitting a cut-free systematization in sequent calculus is significantly larger than the class of coherent ones, although it is still smaller than the class of all first-order theories. Interestingly such a class includes all co-coherent theories investigated in [9], for which so far the methodology of [6] was not applicable, as well as well-known theories such as classical mereology [2] and Tarski’s formalization of geometry [12].

2 The calculus \(\textsf{G}_n\)

The language \(\mathcal {L}_n\) is a first-order language (without identity) in negation normal form, i.e., \(\mathcal {L}_n\) contains two sorts of atoms, the positive atoms and the negative ones. A positive (negative) atom is indicated as P (\(\bar{P}\), respectively). Compound formulas are built up from atoms, \(\bot \) and \(\top \) in the usual way using conjunction, disjunction and the quantifiers. The negation \(\lnot A\) of a formula A is defined inductively as follows: \(\lnot P:= \bar{P}\), \(\lnot \bar{P}:= P\), \(\lnot (B \wedge C):= \lnot B \vee \lnot C\), \(\lnot (B \vee C):= \lnot B \wedge \lnot C\), \(\lnot \forall x B:= \exists x \lnot B\) and \(\lnot \exists x B:= \forall x \lnot B\). The depth d(A) of a formula A is defined inductively as follows: \(d(\bot ) = d (\top ) = d(P) = d (\bar{P}) = 0\), \(d(B \wedge C) = d(B \vee C) = d(B) + d(C) + 1\), and \(d (\forall x B) = d (\exists x B) = d(B) + 1\). It follows that \(d (\lnot A) = d(A)\). We agree that \(A \supset B\) and \(A \equiv B \) are abbreviations for \(\lnot A \vee B\) and \((A \supset B) \wedge (B \supset A)\), respectively. A sequent is a pair \(\langle \Gamma , \Delta \rangle \) of (finite, possibly empty) multisets \(\Gamma \), \(\Delta \) of formulas and will be indicated as \(\Gamma \Rightarrow \Delta \). The sequent calculus \(\textsf{G}_n\) consists of the following initial sequents and inference rules, where z is the proper variable of \(R\forall \) and \(L\exists \), namely it does not occur free in their conclusion.

figure a

A logical axiom is an initial sequent or the conclusion of a rule with no premise.Footnote 4 The notions of derivation of a sequent and derivable sequent are defined as usual. We assume that if \(\Gamma \Rightarrow \Delta \) is derivable and \(\Gamma ' \Rightarrow \Delta '\) differs from \(\Gamma \Rightarrow \Delta \) only in the names of bound variables, then \(\Gamma ' \Rightarrow \Delta '\) is also derivable. The height \(h(\mathcal {D})\) of a derivation \(\mathcal {D}\) of \(\Gamma \Rightarrow \Delta \) is defined inductively as follows: if \(\Gamma \Rightarrow \Delta \) is a logical axiom, then \(h(\mathcal {D}) = 0\); if \(\Gamma \Rightarrow \Delta \) is the conclusion of a rule with two premises \(\Gamma ' \Rightarrow \Delta '\) and \(\Gamma '' \Rightarrow \Delta ''\) whose derivations are \(\mathcal {D}'\) and \(\mathcal {D}''\), respectively, then \(h(\mathcal {D}) = \max (h (\mathcal {D}'), h(\mathcal {D}'')) + 1\); if \(\Gamma \Rightarrow \Delta \) is the conclusion of a rule with one premise \(\Gamma ' \Rightarrow \Delta '\) whose derivation is \(\mathcal {D}'\), then \(h(\mathcal {D}) = h (\mathcal {D}') + 1\). If \(\mathcal {D}\) is a derivation of \(\Gamma \Rightarrow \Delta \) and \(n \in \mathbb {N}\), then we say that \(\Gamma \Rightarrow \Delta \) is n-derivable, if \(h (\mathcal {D}) \le n\). A rule is admissible if its conclusion is derivable, whenever its premises are derivable and it is height-preserving admissible if its conclusion is n-derivable, whenever its premises are n-derivable.

We shall also consider the standard structural rules of weakening, contraction and cut.

figure b

We do not assume any of these rules as primitive since they are all admissible. In fact, we shall see that weakening and contraction are also height-preserving admissible.

Finally, the rules for negations, i.e.,

will also be proved admissible. However, since we are primarily interested in the extensions of \(\textsf{G}_n\) we shall postpone the proof of the admissibility of the structural rules and of the negation rules once such extensions have been introduced.

3 The extensions of \(\textsf{G}_n\)

We now consider how to extend the calculus \(\textsf{G}_n\) with inference rules. We first introduce the notion of coherent formula and coherent theory in \(\mathcal {L}_n\). To ease the notation let a literal \(\ell \) be either a positive atom P or a negative one \(\bar{P}\). We shall use \(\ell , \jmath , \imath , \dots \) to indicate literals. A (closed) formula of \(\mathcal {L}_n\) is coherent when it is equivalent to:

$$\begin{aligned} \forall \bar{x} \Big (\ell _1 \wedge \dots \wedge \ell _n \supset \exists \bar{y} \big ((\jmath _{1_1} \wedge \dots \wedge \jmath _{1_p}) \vee \dots \vee (\jmath _{m_1} \wedge \dots \wedge \jmath _{m_q}) \big )\Big ) \end{aligned}$$

Without loss of generality we assume that none of the variables in \(\bar{y}\) occurs free in \(\ell _1 \wedge \dots \wedge \ell _n\). We shall also agree that if \(n = 0\) (\(m=0\)), then the consequent (antecedent) of \(\supset \) is \(\top \) (\(\bot \), respectively). A theory is coherent when all its non-logical axioms are coherent formulas. Given a coherent formula, the corresponding coherent rule \(C_n\) is:

where in each premise of \(C_n\) each variable in \(\bar{y}\) of the coherent formula has been replaced by a variable in \(\bar{z}\) not occurring free in the conclusion of \(C_n\). Let \(\textsf{G}_n^*\) be any extension of \(\textsf{G}_n\) with finitely many coherent rules. It is easy to see that a coherent formula A and the corresponding coherent rule \(C_n\) are equivalent in the that sense that the sequent \(\Rightarrow A\) is derivable in \(\textsf{G}_n^*\) and \(C_n\) is admissible in the calculus \(\textsf{G}_n\) plus the cut rule and the initial sequent \(\Rightarrow A\).

To prove the admissibility of the contraction rules, we shall assume that coherent rules satisfy the following closure condition for contraction. Let \(C_n\) be a coherent rule of \(\textsf{G}_n^*\) such that:

Then, the following rule \(cC_n\) is also a rule of \(\textsf{G}_n^*\):

We say that the rule \(cC_n\) is the contracted version of \(C_n\).

Perhaps the most well-known example of a coherent theory is first-order logic with identity. If the predicates \(=\) and \(\ne \) are added to \(\mathcal {L}_n\), then a sequent calculus for first-order logic with identity can be obtained by extending \(\textsf{G}_n\) with the following two coherent rules Ref\(_=\) and Repl\(_=\), corresponding each to the two coherent axioms of reflexivity and replacement of identicals, \(\forall x (\top \supset x = x)\) and \(\forall x \forall y (\ell (x) \wedge x = y \supset \ell (y))\).

figure c

In general, adding a coherent rule to \(\textsf{G}_n\) does not suffices to obtain a cut-free calculus for a given first-order theory. For example, consider a simple first-order theory axiomatized by the coherent formula \(\forall x (x < x \supset \bot )\) and let \(\textsf{G}_n^<\) be the extension of \(\textsf{G}_n\) by the following coherent rule with no premise:

As a counter-example to cut elimination in \(\textsf{G}_n^<\) consider the sequent \(\Rightarrow x \nless x\). Although such a sequent is derivable by cut, i.e.,

it is clear that a cut-free derivation of it can hardly be found, unless we avail ourselves with the rules of negation. Certainly, using the rules of negation, especially \(R_\lnot \), the sequent can easily be proved to be cut-free derivable.

However, since no rule of negation is primitive in \(\textsf{G}_n\), we should show that at least \(R_\lnot \) is admissible. Alas, it appears that in the presence of Irref\(_<\) the admissibility of \(R_\lnot \) essentially requires the admissibility of cut. To see this, let the premise of \(R_\lnot \) be an instance of Irref\(_<\) with A principal formula. Thus, A is \(x < x\) and the derivation of the conclusion of \(R_\lnot \) is:

However, it should be clear that the only way to derive the conclusion of \(R_\lnot \) is to use cut.

Thus, it seems that there is a circularity in the admissibility of cut and the rules of negation: the former presupposes the latter and vice versa.

To overcome this obstacle we shall impose on \(\textsf{G}_n^*\) an additional closure condition. Let \(C_n\) be a coherent rule of \(\textsf{G}_n^*\) except Repl\(_=\) and \(\Lambda \) the multi-set of its principal atoms. Moreover, let \(\textsf{Rel} (\Lambda ) = \bigcup _{\ell \in \Lambda } \textsf{Rel} (\ell )\), where \(\textsf{Rel} (\ell )\) is the set of predicates of \(\ell \). In \(\textsf{G}_n^<\), for example, we have \(\textsf{Rel} (\Lambda ) = \{<\}\), since Irref\(_<\) is the only coherent rule. Now, we shall assume that \(\textsf{G}_n^*\) contains all instances of the following rule:

where \(\textsf{Rel} (P) \subseteq \textsf{Rel} (\Lambda )\) or \(\textsf{Rel} (\bar{P}) \subseteq \textsf{Rel} (\Lambda )\). Moreover, if \(C_n\) is Repl\(_=\) and \(\Gamma \Rightarrow \Delta \) a sequent to be derived in \(\textsf{G}_n^*\) such that \(\Delta \) contains either P or \(\bar{P}\) and as well as an identity atom Q or \(\bar{Q}\) in \(\Lambda \), then we shall also add all instances of \(LEM^*\) such that \(\textsf{FV} (P,\bar{P}) \cap \textsf{FV} (Q,\bar{Q}) \ne \emptyset \), where \(\textsf{FV} (A)\) is the set of the free variables of a formula A.

Some remarks on \(LEM^*\) are in order. Firstly, \(LEM^*\) is a multi-succedent version of the rule Gem-at of [8], introduced to provide a single-succedent calculus for classical propositional logic; the main difference is that in Gem-at the principal formulas are an atom and its negation, whereas here we have two atoms.

Secondly, from a semantic point of view, \(LEM^*\) is just another way to express in sequent calculus the law of excluded middle; in this respect, it does not differ substantially from LEM. The motivation for having another rule for the law of excluded middle is that with \(LEM^*\) we can address satisfactorily the issue of circularity in the admissibility of negation rule and cut. In the case of \(\textsf{G}_n^<\) the sequent \(\Rightarrow x \nless x\) becomes cut-free derivable:

(The case of the admissibility of \(R_\lnot \) in \(\textsf{G}_n^<\) can be dealt with similarly). Much more generally, we shall see that in any extension \(\textsf{G}_n^*\) of \(\textsf{G}_n\), although \(R_\lnot \) is still needed in the proof of admissibility of cut (Theorem 2), cut is not needed at all in the proof of the admissibility of \(R_\lnot \) (Lemma 2).

Thirdly, \(LEM^*\) is manifestly a non-analytic rule; inasmuch as it makes P and \(\bar{P}\) disappear, \(LEM^*\) is similar to a cut rule. While the presence of a rule similar to cut in the extensions of first-order logic is arguably acceptable from a conceptual point of view, such a rule in a sequent calculus for first-order logic would certainly be a disturbing upshot for anyone subscribing to the idea that logic is analytic. Here we can retain the idea that at least \(\textsf{G}_n\) is analytic by imposing that \(LEM^*\) is only present in \(\textsf{G}_n^*\). Moreover, the side conditions circumscribe significantly the non-analytic character of \(\textsf{G}_n^*\) brought about by \(LEM^*\). Indeed, we need to consider the instances of \(LEM^*\) where all predicates occurring in the disappearing formulas occur also in the coherent rules, provided that these rule are not the identity rules. And since we only consider finitely many rules at a time, it is clear that the number of instances to be added is finite. Moreover, if we also consider the identity rules, especially Repl\(_=\) the disappearing formulas of \(LEM^*\) can be taken to be atoms whose variables are the variables occurring in the identity atoms principal in Repl\(_=\). Also in this case we shall add only finitely many instances of \(LEM^*\).

4 Admissibility results

In this section we shall prove the main admissibility results concerning the structural rules of \(\textsf{G}_n^*\) as well as the admissibility of the rules of negation. We proceed by showing first the height-preserving admissibility of weakening and contraction rules. These results will be then used in the proof of the admissibility of the negation rules which, in turn, will be applied in the proof of the admissibility of cut rule.

We need some preliminary results regarding the substitution of variables. Let the variable y be free for x in \(\Gamma , \Delta \). The substitution rule Sb is:

Lemma 1

The substitution rule is height-preserving admissible in \(\textsf{G}_n^*\).

Proof

Let \(\mathcal {D}\) be the derivation of the premise \(\Gamma \Rightarrow \Delta \). We proceed by induction on \(h (\mathcal {D})\). If \(h (\mathcal {D}) = 0\), then \(\Gamma \Rightarrow \Delta \) is a logical axiom or the conclusion of a coherent rule with no premise. Then, also the conclusion of Sb is a logical axiom or the conclusion of a coherent rule with no premise. If \(h (\mathcal {D}) > 0\), then we reason by cases on the last rule R applied in the derivation of \(\Gamma \Rightarrow \Delta \). If R is a propositional rule or a quantifier rule, then the proof is as in Lemma 4.1.2 of [8]. If R is \(LEM^*\), then the premises of \(LEM^*\) are \(P, \Gamma \Rightarrow \Delta \) and \(\bar{P}, \Gamma \Rightarrow \Delta \) and the conclusion of Sb can be derived as follows:

Notice that the two applications of Sb are height-preserving admissible by the inductive hypothesis and that the application of Sb is legitimate since if y is free for x in \(\Gamma ,\Delta \), then so is in \(P, \Gamma ,\Delta \) and \(\bar{P},\Gamma \Rightarrow \Delta \) (because P and \(\bar{P}\) are atoms). Finally, if R is a coherent rule \(C_n\) with at least one premise, then the proof does not differ substantially from that given in Lemma 1 of [6]. In this case \(\Gamma \) contains the literals \(\ell _1, \dots , \ell _n\). Since y may be one the proper variables of \(C_n\), we first need replace each premise of \(C_n\) all such variables with new ones; this can be done via several applications of Sb, all of which are height-preserving admissible by the inductive hypothesis. Next we apply again Sb to replace x with y and then \(C_n\) again. \(\square \)

Now we can prove that the structural rules, including cut, are admissible in \(\textsf{G}_n^*\). In fact, we shall prove something stronger, namely the height-preserving admissibility of weakening and contraction as well as the height-preserving invertibility of all rules.

Theorem 1

The rules of weakening and contraction are height-preserving admissible in \(\textsf{G}_n^*\). Moreover, all rules are height-preserving invertibile.

Proof

We start with the height-preserving admissibility of weakening. We only consider LW and leave RW to the reader. We proceed by induction on \(h(\mathcal {D})\), where \(\mathcal {D}\) is the derivation of the premise \(\Gamma \Rightarrow \Delta \) of LW. If \(h(\mathcal {D}) = 0\), then \(\Gamma \Rightarrow \Delta \) is a logical axiom or the conclusion of a coherent rule with no premise. Then, also the conclusion of LW is a logical axiom or the conclusion of a coherent rule with no premise. If \(h(\mathcal {D}) > 0\), then \(\Gamma \Rightarrow \Delta \) is the conclusion of some rule R and we proceed by cases on R. If R is a propositional or quantifier rule, then the proof is as in Theorem 2 of [6]. If R is \(LEM^*\), then from the premises \(P, \Gamma \Rightarrow \Delta \) and \(\bar{P}, \Gamma \Rightarrow \Delta \) of \(LEM^*\) the conclusion of LW is obtained as follows:

Notice that the two applications of LW in the derivation above are height-preserving admissible by the inductive hypothesis. If R is a coherent rule \(C_n\) with at least one premise, then \(\Gamma \) contains \(\ell _1, \dots , \ell _n\). Since A may contain some of the proper variables of \(C_n\), we first apply Sb on the premises of \(C_n\) to replace all such variables with new ones. Notice that Sb is height-preserving admissible by Lemma 1 and the substitution does not affect \(\ell _1, \dots , \ell _n, \Gamma ', \Delta \) in virtue of the variable condition. Then, the conclusion of LW is obtained from here by applying LW (which is height-preserving admissible by the inductive hypothesis) and \(C_n\) again.

The proof of height-preserving invertibility of logical rules is similar to the proof of Theorem 3.1.1 and Lemma 4.2.8 in [8]. Moreover, coherent rules as well as \(LEM^*\) are height-preserving invertible via the height-preserving admissibility of LW.

To prove height-preserving admissibility of contraction, we proceed by induction on the premise \(A, A, \Gamma \Rightarrow \Delta \) of LC (the case of RC is left to the reader). Let \(\mathcal {D}\) be the derivation of \(A, A, \Gamma \Rightarrow \Delta \). If \(h(\mathcal {D}) = 0\), then \(A, A, \Gamma \Rightarrow \Delta \) is a logical axiom or the conclusion of a coherent rule with no premise. If it is a logical axiom, then we have the following cases.

(i):

\(P \in \Gamma \cap \Delta \). The conclusion of LC is also a logical axiom.

(ii):

\(\bar{P} \in \Gamma \cap \Delta \). The conclusion of LC is also a logical axiom.

(iii):

\(P, \bar{P} \in \Gamma \). The conclusion of LC is also a logical axiom.

(iv):

\(P, \bar{P} \in \Delta \). The conclusion of LC is also a logical axiom.

(v):

A is P and \(\bar{P} \in \Gamma \). The conclusion of LC is also a logical axiom.

(vi):

A is \(\bar{P}\) and \(P \in \Gamma \). The conclusion of LC is also a logical axiom.

(vii):

\(\bot \in \Gamma \). The conclusion of LC is also a logical axiom.

(viii):

\(\top \in \Delta \). The conclusion of LC is also a logical axiom.

If the premise of LC is the conclusion of a coherent rule \(C_n\) with no premise, then we distinguish the following three cases:

(ix):

none of the occurrences of A is principal in \(C_n\). In this case, \(\ell _1, \dots , \ell _n \in \Gamma \). Then, the conclusion of LC is a conclusion of \(C_n\).

(x):

exactly one occurrence of A is principal in \(C_n\). In this case, A is \(\ell \) and \(\ell _1, \dots , \ell _{n-1} \in \Gamma \), namely the premise of LC is \(\ell , \ell , \ell _1, \dots , \ell _{n-1}, \Gamma ' \Rightarrow \Delta \). Then, the conclusion \(\ell ,, \ell _1, \dots , \ell _{n-1}, \Gamma ' \Rightarrow \Delta \) of LC is a conclusion of \(C_n\).

(xi):

both occurrences of A are principal of \(C_n\). Thus, A is \(\ell \) and \(\ell _1, \dots , \ell _{n-2} \in \Gamma \), i.e., the premise of LC is \(\ell , \ell , \ell _1, \dots , \ell _{n-2}, \Gamma ' \Rightarrow \Delta \). Notice that the conclusion \(\ell , \ell _1, \dots , \ell _{n-2}, \Gamma ' \Rightarrow \Delta \) of LC is not a conclusion of \(C_n\) (since \(C_n\) needs n literals to be applied). However the sequent \(\ell , \ell _1, \dots , \ell _{n-2}, \Gamma ' \Rightarrow \Delta \) is a conclusion of the contracted version \(cC_n\), which is a rule of \(\textsf{G}_n^*\) in virtue of the closure condition.

If \(h(\mathcal {D}) > 0\), then we need to reason by cases on the last rule R applied in the derivation of the premise of LC. If R is a propositional or quantifier rule, then see Theorem 4 of [6]. If R is \(LEM^*\), then none of the occurrences of the contracted formula A can be principal. Then, from the premises \(P, A, A, \Gamma \Rightarrow \Delta \) and \(\bar{P}, A, A, \Gamma \Rightarrow \Delta \) we derive the conclusion of LC as follows:

Clearly, the two applications of LC are height-preserving admissible by the inductive hypothesis. If R is coherent rule \(C_n\) with at least one premise, then have three cases, all similar to (ix)-(xi). Firstly, none of the occurrences of A is principal in \(C_n\). Then, we take the premises of \(C_n\) and via several applications of LC (all height-preserving admissible by the inductive hypothesis) and \(C_n\) we obtain the conclusion of LC as in (ix). Secondly, exactly one occurrence of A is principal of \(C_n\). The reasoning is as in (x). Thirdly, both occurrences of A are principal. In this case we the conclusion of LC is obtained via several height-preserving admissible applications of LC and the contracted version of \(cC_n\), like in (xi). \(\square \)

These preliminary results will be now used to prove the admissibility of the negation rules as well as the admissibility of the cut rule. We start with the admissibility of the rules of negation.

Lemma 2

The rules \(L_\lnot \) and \(R_\lnot \) are admissible in \(\textsf{G}_n^*\). Moreover, in each rule if the premise is n-derivable, then the conclusion is \(n+1\)-derivable.

Proof

We consider \(L_\lnot \) first. Let \(\mathcal {D}\) be the derivation of the premise \(\Gamma \Rightarrow \Delta , A\). We proceed by induction on \(h (\mathcal {D})\). If \(h (\mathcal {D}) = 0\), then \(\Gamma \Rightarrow \Delta , A\) is either a logical axiom or the conclusion of a coherent rule with no premise. If it is a logical axiom, then we consider the following cases.

(i):

\(P \in \Gamma \cap \Delta \). The conclusion of \(L_\lnot \) is a logical axiom.

(ii):

\(\bar{P} \in \Gamma \cap \Delta \). The conclusion of \(L_\lnot \) is a logical axiom.

(iii):

\(P, \bar{P} \in \Gamma \). The conclusion of \(L_\lnot \) is a logical axiom.

(iv):

\(P, \bar{P} \in \Delta \). The conclusion of \(L_\lnot \) is a logical axiom.

(v):

A is P and \(P \in \Gamma \). The conclusion of \(L_\lnot \) is \(\lnot P, P, \Gamma ' \Rightarrow \Delta \) and this is a logical axiom since \(\lnot P:= \bar{P}\).

(vi):

A is \(\bar{P}\) and \(\bar{P} \in \Gamma \). The conclusion of \(L_\lnot \) is \(\lnot \bar{P}, \bar{P}, \Gamma ' \Rightarrow \Delta \) and this is a logical axiom since \(\lnot \bar{P}:= P\).

(vii):

A is P and \(\bar{P} \in \Delta \). The conclusion of \(L_\lnot \) is \(\lnot P, \Gamma \Rightarrow \Delta ', \bar{P}\) and this is a logical axiom since \(\lnot P:= \bar{P}\).

(viii):

A is \(\bar{P}\) and \(P \in \Delta \). The conclusion of \(L_\lnot \) is \(\lnot \bar{P}, \Gamma \Rightarrow \Delta ', P\) and this is a logical axiom since \(\lnot \bar{P}:= P\).

(ix):

\(\bot \in \Gamma \). The conclusion of \(L_\lnot \) is a logical axiom.

(x):

\(\top \in \Delta \). The conclusion of \(L_\lnot \) is a logical axiom.

If \(\Gamma \Rightarrow \Delta , A\) is the conclusion of a coherent rule C with no premise, then \(\Gamma \) contains \(\ell _1, \dots , \ell _n\). Also in this case, the conclusion of \(L_\lnot \) is also a conclusion of C. If \(h(\mathcal {D}) > 0\), then we distinguish according to whether A is principal or not principal of R. In the latter case, then the conclusion of \(L_\lnot \) is obtained from the premise of R by an application of \(L_\lnot \) and then by one of R. Clearly, such an application of \(L_\lnot \) is admissible by the inductive hypothesis. (Here we tacitly assumed that R is a rule with one premise, but if it has has two or more premises the reasoning is the same). If A is principal of R, then R is either \(R_\wedge \) or \(R_\vee \) or \(R_\forall \) or else \(R_\exists \). We consider only the first case. If R is \(R_\wedge \), then A is \(B \wedge C\) and \(\mathcal {D}\) is:

Considering that \(\lnot (B \wedge C):= \lnot B \vee \lnot C\), the conclusion of \(L_\lnot \) is thus obtained as follows:

Once again the two applications of \(L_\lnot \) are admissible by the inductive hypothesis.

We now prove the admissibility of \(R_\lnot \) which is slightly more challenging. We proceed by induction on \(h(\mathcal {D})\), where \(\mathcal {D}\) is the derivation of the premise \(A, \Gamma \Rightarrow \Delta \). If \(h(\mathcal {D}) = 0\), then \(A, \Gamma \Rightarrow \Delta \) is a logical axiom or the conclusion of coherent rule with no premise. In the first case, we reason by cases.

(i):

\(P \in \Gamma \cap \Delta \). The conclusion of \(L_\lnot \) is a logical axiom.

(ii):

\(\bar{P} \in \Gamma \cap \Delta \). The conclusion of \(L_\lnot \) is a logical axiom.

(iii):

\(P, \bar{P} \in \Gamma \). The conclusion of \(L_\lnot \) is a logical axiom.

(iv):

\(P, \bar{P} \in \Delta \). The conclusion of \(L_\lnot \) is a logical axiom.

(v):

A is P and \(P \in \Delta \). The conclusion of \(L_\lnot \) is \(\Gamma \Rightarrow \Delta ', P, \lnot P\) and this is a logical axiom since \(\lnot P:= \bar{P}\).

(vi):

A is \(\bar{P}\) and \(\bar{P} \in \Delta \). The conclusion of \(L_\lnot \) is \(\Gamma \Rightarrow \Delta ', \bar{P}, \lnot \bar{P}\) and this is a logical axiom since \(\lnot \bar{P}:= P\).

(vii):

A is P and \(\bar{P} \in \Gamma \). The conclusion of \(L_\lnot \) is \(\bar{P}, \Gamma ' \Rightarrow \Delta \lnot P\) and this is a logical axiom since \(\lnot P:= \bar{P}\).

(viii):

A is \(\bar{P}\) and \(P \in \Gamma \). The conclusion of \(L_\lnot \) is \(P, \Gamma ' \Rightarrow \Delta \lnot \bar{P}\) and this is a logical axiom since \(\lnot \bar{P}:= P\).

(ix):

\(\bot \in \Gamma \). The conclusion of \(L_\lnot \) is a logical axiom.

(x):

\(\top \in \Delta \). The conclusion of \(L_\lnot \) is a logical axiom.

If \(A, \Gamma \Rightarrow \Delta \) is the conclusion of a coherent rule \(C_n\) with no premise, then we need to distinguish two cases. Either A is principal of \(C_n\) or it is not. If it is not, then the conclusion of \(R_\lnot \) is a conclusion of \(C_n\). Else, A is \(\ell _n\) and \(\ell _1, \dots , \ell _{n-1} \in \Gamma \). In this case we use \(LEM^*\). In particular, we need to find a derivation \(\mathcal {D}'\) of the the sequent \(\ell _1, \dots , \ell _{n-1}, \Gamma ' \Rightarrow \Delta , \lnot \ell _n\) starting from the conclusion of \(C_n\), namely \(\ell _1, \dots , \ell _{n-1}, \ell _n, \Gamma ' \Rightarrow \Delta \). Let \(\mathcal {D}'\) be the following derivation.

Since \(\lnot \ell _n:= \bar{\ell }_n\) we have derived \(\ell _1, \dots , \ell _{n-1}, \Gamma ' \Rightarrow \Delta , \lnot \ell _n\). If \(h(\mathcal {D}) > 0\), then we distinguish according to whether A is principal or not principal of the last rule R applied in the derivation of the premise. In the latter case, we reason as in the corresponding case in the proof of the admissibility of \(L_\lnot \). If A is principal of R, then we need to consider, besides the cases already considered in the proof of the admissibility of \(L_\lnot \), also the possibility that R is a coherent rule \(C_n\). Let \(\mathcal {D}\) be the derivation of the premise of \(R_\lnot \) with A principal of \(C_n\). To prevent the derivation trees to spread we assume without loss of generality that \(C_n\) has just one premise. Thus, premise of \(R_\lnot \) has the following derivation:

We can find a derivation \(\mathcal {D}'\) of the conclusion of \(R_\lnot \), namely the sequent \(\ell _1, \dots , \ell _{n-1}, \Gamma ' \Rightarrow \Delta , \lnot \ell _n\), by considering the following derivation:

Notice that the applicability of \(C_n\) is legitimate since no proper variable of \(C_n\) may occur in \(\bar{\ell }_n\) introduced in the application of RW. From here we immediately see that \(\ell _1, \dots , \ell _{n-1}, \Gamma ' \Rightarrow \Delta , \lnot \ell _n\) is derivable since \(\lnot \ell _n:= \bar{\ell }_n\). \(\square \)

We can now prove the central admissibility result for \(\textsf{G}_n^*\), namely the admissibility of the cut rule.

Theorem 2

The rule of cut is admissible in \(\textsf{G}_n^*\).

Proof

We proceed by induction on the depth d(A) of the cut formula A with a sub-induction on the height of a cut, defined as the sum of the heights of the two premises of cut. The proof follows the patter of the proof of Theorem 4.2.10 of [8]. We have the following two main cases. (A) One of the two premises of cut is a logical axiom or the conclusion of a coherent rule with no premise; (B) none of the two premise of cut is a logical axiom or the conclusion of a coherent rule with no premise. In case (A), we need to consider the following sub-cases: (i) the left premise is a logical axiom or the conclusion of a coherent rules with no premise; (ii) the right premise is a logical axiom or the conclusion of a coherent rules with no premise. In case (B), there are three sub-cases: (iii) A is not principal in the left premise; (iv) A is principal only in the left premise; (v) A is principal in both premises. We start from (i) and consider first the case in which the left premise is a logical axiom.

(i.1):

\(P \in \Gamma \cap \Delta \). The conclusion of cut is a logical axiom.

(i.2):

\(\bar{P} \in \Gamma \cap \Delta \). The conclusion of cut is a logical axiom.

(i.3):

A is \(P \in \Gamma \). The original cut is:

The conclusion of cut is obtained by several application of LW and RW, jointly indicated as W, from the right premise as follows:

(i.4):

A is \(\bar{P} \in \Gamma \). Similar to case (i.3).

(i.5):

\(P,\bar{P} \in \Gamma \). The conclusion of cut is a logical axiom.

(i.6):

\(P,\bar{P} \in \Delta \). The conclusion of cut is a logical axiom.

(i.7):

A is P and \(\bar{P} \in \Delta \). The original cut is:

The conclusion of cut is obtained by W and \(R_\lnot \) from the right premise as follows:

(i.8):

A is \(\bar{P}\) and \(P \in \Delta \). Similar to (i.7).

(i.9):

\(\bot \in \Gamma \). The conclusion of cut is a logical axiom.

(i.10):

\(\top \in \Delta \). The conclusion of cut is a logical axiom.

If the left premise of cut is the conclusion of a coherent \(C_n\) rule with no premise, then the cut formula A is not principal in \(C_n\). Thus, we have only one case, namely:

(i.11):

\(\ell _1, \dots , \ell _n \in \Gamma \). The conclusion of cut, i.e., \(\ell _1, \dots , \ell _n, \Gamma ', \Pi \Rightarrow \Delta , \Sigma \), is a conclusion of \(C_n\).

We now consider the case (ii). If the right premise of cut is a logical axiom, then there are several cases:

(ii.1):

\(P \in \Pi \cap \Sigma \). The conclusion of cut is a logical axiom.

(ii.2):

\(\bar{P} \in \Pi \cap \Sigma \). The conclusion of cut is a logical axiom.

(ii.3):

A is \(P \in \Sigma \). Similar to (i.3).

(ii.4):

A is \(\bar{P} \in \Sigma \). Similar to case (i.3).

(ii.5):

\(P,\bar{P} \in \Pi \). The conclusion of cut is a logical axiom.

(ii.6):

A is P and \(\bar{P} \in \Pi \). Similar to (i.7).

(ii.7):

A is \(\bar{P}\) and \(P \in \Pi \).. Similar to (i.7).

(ii.8):

\(P,\bar{P} \in \Sigma \). The conclusion of cut is a logical axiom.

(ii.9):

\(\bot \in \Pi \). The conclusion of cut is a logical axiom.

(ii.10):

\(\top \in \Sigma \). The conclusion of cut is a logical axiom.

If the right premise is the conclusion of a coherent rule \(C_n\) with no premise, then we need to distinguish according to whether the cut formula is principal or not principal in \(C_n\):

(ii.11):

A is not principal in \(C_n\). Then, \(\ell _1, \dots , \ell _n \in \Pi \). The conclusion of cut \(\Gamma , \ell _1, \dots , \ell _n, \Pi ' \Rightarrow \Delta , \Sigma \) is also a conclusion of \(C_n\).

(ii.12):

A is principal in \(C_n\). In this case, A is \(\ell _n\) and \(\ell _1, \dots , \ell _{n-1} \in \Pi \). Thus, the original cut is:

Here we need to reason on the left premise. Let \(\ell _n\) be P (the case of \(\bar{P}\) is similar). If the left premise is a logical axiom and P is principal, then either \(P \in \Gamma \) or \(\bar{P} \in \Delta \). In the first case, the conclusion of cut is obtained by W as in (i.3). In the second case, the conclusion of cut is obtained by W and \(R_\lnot \) as in (i.7). If the left premise is a logical axiom and P is not principal, then the conclusion of cut is a logical axiom, too. If the left premise is the conclusion of a coherent rule \(C'_n\) with no premise, then \(\Gamma \) contains \(\ell _1', \dots , \ell '_m\) and also the conclusion of cut is a conclusion of \(C_n'\). If the left premise is the conclusion of a rule R, then the cut formula \(\ell _n\), being atomic, cannot be principal in R. If R is a propositional rule or \(LEM^*\) or else a quantifier rule without variable condition, then we take the premise of R, apply an instance of cut on it and the right premise of cut and then R again. If R is a quantifier rule with variable condition or a coherent rule, we take again the premise(s) of R, apply the rule Sb in order to replace all the proper variable with new ones, apply an instance of cut on the right premise of cut and finally R again. This concludes the proof of case (ii).

Regarding case (iii) we proceed by cases on the last rule R applied in the derivation of the left premise. Here we permute cut and R as in case (ii.11). Regarding case (iv) we proceed by cases on the last rule R applied in the derivation of the right premise. Here, too, cut is permuted with R as in (ii.11). Finally, the case (v) does not differ from the same case in the proof of Theorem 4.2.10 of [8] since it involves only logical rules. \(\square \)

5 Beyond coherent theories

In this section we shall compare using some examples the present method with that of [6]. We first recall some definitions from [6]. Let \(\mathcal {L}\) be the standard first-order language (possibly with identity). A (closed) formula of \(\mathcal {L}\) is coherent when it is equivalent to:

$$\begin{aligned} \forall \bar{x} \Big (P_1 \wedge \dots \wedge P_n \supset \exists \bar{y} \big ((Q_{1_1} \wedge \dots \wedge Q_{1_p}) \vee \dots \vee (Q_{m_1} \wedge \dots \wedge Q_{m_q})\big )\Big ) \end{aligned}$$

We assume that no variable in \(\bar{y}\) occurs free in \(P_1 \wedge \dots \wedge P_n\). Given a coherent formula of \(\mathcal {L}\), a coherent rule C is:

where in each premise of C each variable in \(\bar{y}\) has been replaced by a variable in \(\bar{z}\) not occurring in the conclusion of C. Let \(\textsf{G}^*\) be any extension of the calculus \(\textsf{G3c}\) for first-order logic with finitely many coherent rules. As shown in [6], the admissibility results for structural rules hold for \(\textsf{G}^*\).

It is clear that a coherent formula of \(\mathcal {L}\) is a special case of a coherent formula of \(\mathcal {L}_n\), the one in which all literals are positive atoms. On the other hand, there are coherent formulas of \(\mathcal {L}_n\) that are not coherent in \(\mathcal {L}\). Consider once again the axiom of strict seriality \(\forall x \exists y (x \le y \wedge \lnot y \le x)\). In \(\mathcal {L}\) strict seriality is not coherent since \(\lnot y \le x\) is not atomic. However, in \(\mathcal {L}_n\) we have since the beginning that \(\lnot y \le x:= y \nleq x\) and consequently strict seriality is equivalent to the coherent formula \(\forall x \exists y (\top \supset x \le y \wedge y \nleq x)\). Thus, it corresponds to the following coherent rule StrSer, with z proper variable:

One may object that the example of strict seriality is too artificial. After all, such a principle can easily be treated by the standard coherent rules once it is formalized in \(\mathcal {L}\) as the coherent formula \(\forall x \exists y (x < y)\). While this certainly the case, for certain axioms there does not seem to be a similar solution. Consider, for example, the axiom of extensionality in set theory, namely the formula \(\forall x \forall y (\forall z (z \in x \equiv z \in y) \supset x =y)\). In \(\mathcal {L}_n\) such a formula is coherent, since it is equivalent to \(\forall x \forall y (x \ne y \supset \exists z ((z \in x \wedge z \notin y) \vee (z \notin x \wedge z \in y)))\). Thus, in \(\textsf{G}_n^*\) extensionality corresponds to the following coherent rule Ext, where u is the proper variable:

However, extensionality is not coherent in \(\mathcal {L}\). In \(\mathcal {L}\) the closest we can get to a coherent formulation of extensionality is \(\forall x \forall y (x = y \vee \exists z ((z \in x \wedge \lnot z \in y) \vee (\lnot z \in x \wedge z \in y)))\), which still fails to be coherent precisely because both \(\lnot z \in y\) and \(\lnot z \in x\) are compound formulas.

Other examples of axioms that are coherent in \(\mathcal {L}_n\) but not in \(\mathcal {L}\) include the supplementation principles in mereology.Footnote 5 Consider the so-called strong supplementation principle: \(\lnot x \text {P} y \rightarrow \exists z (z \text {P} x \wedge \lnot z \text {O} y)\), where \(\text {P}\) and \(\text {O}\) are the relations of part and overlap, respectively.Footnote 6 Strong supplementation is clearly not coherent in \(\mathcal {L}\). However, in \(\mathcal {L}_n\) the formulas \(\lnot x \text {P} y\) and \(\lnot z \text {O} y\) are the atoms \(x \bar{\text {P}} y\) and \(z \bar{\text {O}} y\) and strong supplementation can be converted into the following coherent rule of \(\textsf{G}_n\), where u is the proper variable:

A final example is from Tarski’s classical formalization of geometry in [12] based on a ternary predicate B of betweennes among points. As noted in [1], the theory is not coherent since the so-called lower 2-dimensional axiom \(\exists x\exists y \exists z (\lnot B(x,y,z) \wedge \lnot B (y,z,x) \wedge \lnot B (z,x,y))\) asserting the existence of three non-collinear points on the same plane is not coherent. Thus, in order to provide a cut-free sequent calculus equivalent to Tarski’s system, in [1] the language \(\mathcal {L}\) is extended with predicates corresponding to the negations of the primitive predicates, in particular of B. In \(\mathcal {L}_n\) this is not necessary since the predicate \(\bar{B}\) is already present. The lower 2-dimensional axiom can be thus reformulated in \(\mathcal {L}_n\) as the coherent formula \(\top \supset \exists x\exists y \exists z (\bar{B}(x,y,z) \wedge \bar{B} (y,z,x) \wedge \bar{B} (z,x,y))\), which corresponds to the following coherent rule of \(\textsf{G}_n^*\) with x, y and z not occurring free in the conclusion:

These examples suffice to show that the class of axioms to which the present method applies is larger that the class of ordinary coherent axioms. How much larger? We provide a partial answer to this question. For a start we notice that not any (first-order) axiom can be treated in \(\textsf{G}_n^*\). Consider, for example, the axiom of minimal posets from [3], according to which every element is less or equal than some minimal element, namely \(\forall x \exists y (\top \supset x \le y \wedge \forall z (y \le z \supset y = z))\). Even in \(\mathcal {L}_n\) such a formula is not coherent. Following the method of semindefinitional extensions developed in [3], we extend \(\mathcal {L}\) with a new symbol unary predicate M and define M(x) as \(\forall z (x \le z \supset x = z)\). Then, we take the axiom of minimal posets and replace \(\forall z (x \le z \supset x = z)\) with M(x). In this way the new axiom of minimal posets becomes the coherent formula \(\forall x \exists y (\top \supset x \le y \wedge M(y))\). Clearly this is not enough since there are no axiom governing M(x). However, as shown in [3], the only axiom for M(x) that needs to be considered is the coherent formula \(M(x) \wedge x \le z \supset x = z\), whereas the conditional \((\forall z (x \le z \supset x = z)) \supset M(x)\) is not needed since in the new axiom of minimal posets the atom M(y) occurs in the consequent of \(\supset \) (or positively, in the terminology of [3]). A sequent calculus for the theory of minimal posets is thus obtained by adding on top of \(\textsf{G3c}^=\) the following coherent rules, where u is proper variable in \(PM_1\).

The method of semidefinitional extensions is certainly very general: any first-order formulas can be converted into a rule of sequent calculus without jeopardizing cut elimination. In other words, the class of axioms to which the method of semidefinitional extensions applies is the class of all first-order formulas. The drawback of the semidefinitional approach is that the language \(\mathcal {L}\) needs to be extended with new symbols; and for certain theories this procedure, though viable in theory, is in practice exceeding complex. The examples above show that certain axioms can easily be treated with the method of semidefinitional extensions, but they do not need to. Indeed, those axioms already correspond to coherent rules of \(\textsf{G}_n\), with no need to extend the underlying language \(\mathcal {L}_n\). Thus, it appears that the class of axioms that can adequately dealt with in \(\textsf{G}_n\) is intermediate between the class of coherent axioms and the class of all axioms expressible in some extension of a standard first-order language.

6 Co-coherent formulas in negation normal form

In this section we shall see that the axioms that the present method can treat include all the so-called co-coherent formulas. Recall from [9] that a formula of \(\mathcal {L}\) is co-coherent when it is equivalent to:

$$\begin{aligned} \forall \bar{x} \Big (\forall \bar{y} \big ((Q_{1_1} \vee \dots \vee Q_{1_p}) \wedge \dots \wedge (Q_{m_1} \vee \dots \vee Q_{1_q})\big ) \supset P_1 \vee \dots \vee P_n\Big ) \end{aligned}$$

where we assume that none of the variables in \(\bar{y}\) occurs free in \(P_1 \vee \dots \vee P_n\). A theory is co-coherent when all its proper axioms are co-coherent formulas. Given a co-coherent formula, the corresponding rule \(co\text {-}C\) is:

where in each premise each variable in \(\bar{y}\) has been replaced by a variable in \(\bar{z}\) not occurring free in the conclusion. Let \(\mathsf {co\text {-}G^*}\) be any extension of \(\textsf{G3c}^=\) with finitely many co-coherent rules. Also \(\mathsf {co\text {-}G^*}\) satisfies the admissibility results of the previous section. In \(\mathcal {L}\) coherent rules fall short of treating co-coherent formulas, which require co-coherent rules. This is clearly disadvantageous since there might be theories in which some axioms are coherent and others are co-coherent. Alas, for such ‘mixed’ theories the standard method of [6] to preserve the admissibility results is not applicable. In fact, evidence suggests that when we work with such ‘combined calculi’, then cut elimination is jeopardized. Consider, for example, in \(\mathcal {L}\) a sequent calculus \(\textsf{GE}\) resulting from \(\textsf{G3c}^=\) by adding a zero-premise co-coherent rule \(E_1\) corresponding to the fact that a binary relation \(\sim \) is reflexive as well as a one-premise coherent rule \(E_2\) corresponding to the fact that \(\sim \) is Euclidean.

In such a calculus it is easy to find a counter-example to cut elimination. Indeed, the sequent corresponding to the symmetry of \(\sim \), namely \(x \sim y \Rightarrow y \sim x\), is not derivable without cut, though is derivable with it, as the following derivation shows:

While in \(\mathcal {L}\) the combination of coherent and co-coherent rules may impair cut elimination, this issue does not arise in \(\mathcal {L}_n\) for the simple reason that in \(\mathcal {L}_n\) co-coherent formulas are, in fact, coherent. More precisely, given a co-coherent formula of \(\mathcal {L}\):

$$\begin{aligned} \forall \bar{x} \Big (\forall \bar{y} \big ((Q_{1_1} \vee \dots \vee Q_{1_p}) \wedge \dots \wedge (Q_{m_1} \vee \dots \vee Q_{1_q})\big ) \supset P_1 \vee \dots \vee P_n\Big ) \end{aligned}$$

we can use classically valid equivalences to obtain the following

$$\begin{aligned} \forall \bar{x} \Big (\lnot P_1 \wedge \dots \wedge \lnot P_n \supset \exists \bar{y} \big ((\lnot Q_{1_1} \wedge \dots \wedge \lnot Q_{1_p}) \vee \dots \vee (\lnot Q_{m_1} \wedge \dots \wedge \lnot Q_{m_q})\big )\Big ) \end{aligned}$$

However, given the definition of negation in \(\mathcal {L}_n\), the latter is a coherent formula:

$$\begin{aligned} \forall \bar{x} \Big (\bar{P}_1 \wedge \dots \wedge \bar{P}_n \supset \exists \bar{y} \big ((\bar{Q}_{1_1} \wedge \dots \wedge \bar{Q}_{1_p}) \vee \dots \vee (\bar{Q}_{m_1} \wedge \dots \wedge \bar{Q}_{m_q})\big )\Big ) \end{aligned}$$

In other words, co-coherent formulas of \(\mathcal {L}\) are special cases of coherent formulas of \(\mathcal {L}_n\). This entails that a co-coherent formula of \(\mathcal {L}\) can be expressed by a coherent rule of \(\textsf{G}_n^*\).

Thus, in \(\mathcal {L}_n\) there is no need of special, co-coherent rules for co-coherent axioms as coherent rules in negation normal form suffice to treat both coherent and co-coherent formulas. This means that in \(\textsf{G}_n^*\) we can treat coherent and co-coherent theories alike.

7 Conclusions

Since Negri’s work [6] there has been a growing interest in methods for obtaining cut-free sequent calculi for first-order theories falling outside the coherent fragment. Recent works such as [3] introduced the method of semidefinitional extensions and showed any first-order theory admits a cut-free systematization in sequent calculus. While such a method, in virtue of its wide applicability, has superseded all previous ones, including Negri’s own method of systems of rules of [3], alternative approaches are still worth to be investigated, if only because semidefinitional extensions are arguably quite artificial and, though unavoidable in certain cases, they are not needed at all for certain others. To what extent are semidefinitional extensions dispensable is not clear and one of the main contribution of the present work was to take some steps towards providing an answer to this question. In particular, using languages in negation normal form we showed that without resorting to semidefinitional extensions we can still provide cut-free sequent calculi for a class of theories which include, among others, also co-coherent theories investigated in [9]. While the present method is inevitably a special case of the method of semidefinitional extensions, in the sense that the latter is applicable to all first-order theories while the former only to a proper subclass thereof, it has the advantage of being entirely standard in proof theory. Indeed, languages in negation normal form have been extensively investigated in proof theory and were the primary source of inspiration for Gentzen-Schütte sequent calculi for classical and intuitionistic first-order logic.Footnote 7 From this perspective, one can see that among the contributions of this paper there is also the development of extensions of Gentzen-Schütte sequent calculi which have so far been hardly considered at all.