The language of propositional modal logic has been used to shed further light on the logic of provability and Gödel’s incompleteness theorems.
The subject matter of arithmetic is the structure of the natural numbers with the usual order and the operations of addition and multiplication. The language of arithmetic is a first-order language with a constant symbol \(0\) for zero, a function symbol \(S\) for the successor operation, and two binary functional symbols \(+\) and \(\times\) for addition and multiplication, respectively. A numeral consists of either \(0\) or finite applications of \(S\) to \(0\), e.g., the numeral for zero is \(0\), the numeral for two is \(SS0\), and the numeral for five is \(SSSSS0\).
We may take for granted a recursive axiomatization of arithmetic such as Robinson’s Arithmetic. One group of axioms makes sure that zero is no successor and that successor is a one-one operation. \[ \begin{array}{lll} Q1 & \forall x \neg Sx = 0 &\\ Q2 & \forall x \forall y (Sx = Sy \to x =y) & \\ \end{array} \] The second group of axioms governs the behavior of addition and multiplication: \[ \begin{array}{lll} Q3 & \forall x (x+0) = x & \\ Q4 & \forall x \forall y(x+Sy) = S(x+y) &\\ Q5 & \forall x (x \times 0) = 0 &\\ Q6 & \forall x \forall y(x \times Sy) = (x\times y) + x & \\ \end{array} \] There is a final group of axioms on the structure of the less than relation on the natural numbers: \[ \begin{array}{lll} Q7 & \forall x \neg x <0 & \\ Q8 & \forall x \forall y(x <Sy \to (x <y \vee x =y)) & \\ Q9 & \forall x \forall y (x<y \vee x =y \vee y <x) & \\ \end{array} \] The axioms of Robinson’s Arithmetic are true in the natural number structure: \[ (\mathbb{N}, <, +, \times, S, 0) \] A proof of a formula \(\varphi\) of the language of arithmetic in \(Q\) is a finite sequence of formulas each of which is either an axiom of \(Q\) or derivable from earlier formulas in first-order logic. A formula \(\varphi\) is a theorem of \(Q\), written \(\vdash_Q \varphi\), if there is a proof of \(\varphi\) in \(Q\).
We say that Robinson’s Arithmetic is sound with respect to the natural number structure to mean that every theorem of Robinson’s Arithmetic is true in the natural number structure. In symbols: \[ \text{If} \ \vdash_{Q} \varphi, \ \text{then} \ (\mathbb{N}, <, +, \times, S, 0) \Vdash \varphi \]
Kurt Gödel famously established that we do not have the converse — or, more carefully, we do not have the converse if \(Q\) is a consistent theory. More generally, if \(T\) is a recursively axiomatizable theory, then if \(T\) is consistent, then there are sentences of the language of arithmetic which are true in \((\mathbb{N}, <, +, \times, S, 0)\) but are not theorems of \(T\).
He explained how to use natural numbers to encode formulas of the language of arithmetic and arithmetical formulas to encode syntactic facts. More generally, he found a way to encode syntactic facts by means of arithmetical sentences. To each sentence \(\varphi\) of the language, he assigned a natural number \(\ulcorner \varphi\urcorner\) as a code, and he managed to define an arithmetical formula \(Prov_Q(\ulcorner \varphi \urcorner)\) to encode the fact that \(\varphi\) is a theorem of the system \(Q\).
The provability predicate for a system like \(Q\) is governed by three conditions known as the Hilbert-Bernays provability conditions:
If \(\vdash_Q \varphi\), then \(\vdash_Q Prov_Q(\ulcorner \varphi\urcorner)\)
If \(\vdash_Q Prov_Q(\ulcorner \varphi \to \psi \urcorner)\), then if \(\vdash_Q Prov_Q(\ulcorner \varphi\urcorner)\), then \(\vdash_Q Prov_Q(\ulcorner \psi \urcorner)\)
If \(\vdash_Q Prov_Q(\ulcorner \varphi\urcorner)\), then \(\vdash_Q Prov_Q(\ulcorner Prov_Q(\ulcorner \varphi\urcorner) \urcorner)\)
If these conditions look familiar, it is because they are; they are completely parallel the rule of necessitation and axioms \(K\) and \(4\).
What Gödel noticed is that a framework such as Robinson’s Arithmetic provides the resources to build a self-referential arithmetical formula \(G\) such that:
\[ \vdash_Q G \leftrightarrow \neg Prov_Q(\ulcorner G \urcorner). \] He used that formula to prove his celebrated first incompleteness theorem, since he argued that unless \(Q\) is inconsistent, \(\nvdash_Q G\), which is exactly what \(G\) says.7 Why? On the one hand, by condition I, if \(\vdash_{Q} G\), then \(\vdash_{G} Prov_{Q}(\ulcorner G\urcorner)\). But on the other hand, if \(\vdash_{Q} G\), then since \(\vdash_Q G \leftrightarrow \neg Prov_Q(\ulcorner G \urcorner)\), given the provability conditions I and II, \(\vdash_{G} \neg Prov_{Q}(\ulcorner G\urcorner)\), which would make \(Q\) inconsistent.
Before we introduce the modal logic of provability, we should note that there is an important further condition on provability from a recursively axiomatizable system isolated by Martin Löb in response to a question by Leon Henkin. Henkin had been interested in the status of certain self-referential formula \(H\) for which: \[ \vdash_Q H \leftrightarrow Prov_Q(\ulcorner H \urcorner) \]
Löb proved, in particular, that \(Q\) proves the conditional \(Prov_Q(\ulcorner H\urcorner) \to H\) only if \(H\) is in fact a theorem of the system. More generally, if T is, as usual, a recursively axiomatizable arithmetical theory, then:
If \(\vdash_T Prov_T(\ulcorner \varphi \urcorner) \to \varphi\), then \(\vdash_T \varphi\).
This is Löb’s theorem for which there is a further formalization in the language of arithmetic:
\[ \vdash_T (Prov_T(\ulcorner Prov_T(\ulcorner \varphi \urcorner) \to \varphi\urcorner) \to Prov_T(\ulcorner \varphi \urcorner)). \] This is the formalized Löb’s theorem.
We will now consider an interpretation of \(\Box\) as “it is a theorem of first-order theory \(T\)”. To flesh out the interpretation of the language of propositional modal logic, we define a realization function as an assignment \(\alpha\) of formulas of arithmetic to propositional variables of propositional modal logic. That is, for each propositional variable \(p\), \(\alpha(p)\) is a formula of the language of arithmetic. This, in turn, determines a translation from the language of propositional modal logic into the language of arithmetic. The translation \(\varphi^\star\) of a modal formula \(\varphi\) under a realization \(\alpha\) is defined inductively:
\[ \begin{array}{lll} p^\star & = & \alpha(p) \\ (\neg \varphi)^\star & = & \neg \varphi^\star \\ (\varphi \to \psi)^\star & = & \varphi^\star \to \psi^\star\\ (\Box \varphi)^\star & = & Prov_T(\ulcorner \varphi^\star \urcorner) \end{array} \] We are now in a position to verify the axioms of \(K4\) for this interpretation of \(\Box\). The rule of necessitation corresponds to the first Hilbert-Bernays derivability condition whereas \(K\) corresponds to the second derivability condition. Finally, the third derivability condition is the translation of axiom \(4\).
\(K4\) is still too weak to encode the logic of provability, since we want to be able to deliver the formalized Löb’s theorem: \[ \vdash_T (Prov_T(\ulcorner Prov_T(\ulcorner \varphi \urcorner) \to \varphi\urcorner) \to Prov_T(\ulcorner \varphi \urcorner)). \] This brings us to the characteristic axiom of the modal logic of provability: \[\tag{L} \Box (\Box \varphi \to \varphi) \to \Box \varphi. \] The modal logic \(GL\) (for Gödel and Löb) is the normal modal logic, which extends \(K\) with all substitution instances of axiom \(L\).
One important observation at this point is that on pain of inconsistency, GL cannot in general extend KT.
Proposition 13.1 GL does not extend KT
Proof. \(GL\) is closed under necessitation. So, \(\vdash_{GL} \Box \bot \to \bot\) only if \(\vdash_{GL} \Box (\Box \bot \to \bot)\) and \(\vdash_{GL} \Box \bot\). It follows that if \(\vdash_{GL} \Box \bot \to \bot\), then it must be the case that \(\vdash_{GL} \bot\). Since \(\nvdash_{GL} \bot\), we conclude that \(\nvdash_{GL} \Box \bot \to \bot\).
This is a modal counterpart of the fact that a recursively axiomatizable consistent theory \(T\) cannot even prove the arithmetical formula \(Prov_{T}(\ulcorner 0=1\urcorner) \to 0=1\) on account of Löb’s theorem.
Proposition 13.2 GL extends K4
Proof. That is because \(\vdash_{GL} \Box p \to \Box \Box p\).
The key to this proof is to substitute \(\Box p \wedge p\) for \(p\) in \(L\):
\[ \Box(\Box (\Box p \wedge p) \to (\Box p \wedge p)) \to \Box (\Box p \wedge p) \] The antecedent of that conditional follows from \(\Box p\) alone: \[ \vdash_K \Box p \to \Box(\Box (\Box p \wedge p) \to (\Box p \wedge p)) \] Here is a justification for that fact: \[ \begin{array}{lllll} 1 & & (\Box p \wedge p) \to p & \text{PL} \\ 2 & & \Box (\Box p \wedge p) \to \Box p & \text{RK} \ 1 \\ 3 & & p \to (\Box (\Box p \wedge p) \to (\Box p \wedge p)) & \text{PL} \ 2 \\ 4 & & \Box p \to \Box(\Box (\Box p \wedge p) \to (\Box p \wedge p))& \text{RK} \ 3 \\ \end{array} \] So, it follows that \[ \vdash_{GL} \Box p \to \Box(\Box p \wedge p) \] and \[ \vdash_{GL} \Box p \to \Box \Box p \]
Part of the interest of the modal logic of provability is that it casts some light upon Gödel’s incompleteness theorems.
Proposition 13.3 If \(\vdash_{GL} p \leftrightarrow \neg \Box p\), then \(\vdash_{GL} \neg \Box \bot \to \neg \Box p\).
Given a realization \(\alpha\) for which \(\alpha(p)\) is the Gödel formula, \(\vdash_{GL} p \leftrightarrow \neg \Box p\) would correspond to the fact that \(\vdash_Q G \leftrightarrow \neg Prov_Q(\ulcorner G\urcorner)\). On the other hand, \(\neg \Box \bot\) corresponds to \(\neg Prov_Q(\ulcorner 0 =1\urcorner)\). Now:
Proof. We argue as follows: \[ \begin{array}{lllll} 1 & p \leftrightarrow \neg \Box p & \\ 2 & \Box p \to \neg p & \text{PL} \ 1 \\ 3 & \Box \Box p \to \Box \neg p & \text{RK} \ 2 \\ 4 & \Box p \to \Box \Box p & 4 \\ 5 & \Box p \to \Box \neg p & \text{PL} \ 3, 4\\ 6 & \Box p \to (\Box \neg p \wedge \Box p) & \text{PL} \ 5\\ 7 & (\Box \neg p \wedge \Box p) \to \Box (\neg p \wedge p) & \text{K} \\ 8 & \Box p \to \Box (\neg p \wedge p) & \text{PL} \ 6, 7\\ 9 & \Box p \to \Box \bot & \text{PL} \ 8\\ 10 & \neg \Box \bot \to \neg \Box p & \text{PL} \ 9\\ \end{array} \]
Proposition 13.4 \(\vdash_{GL} \neg \Box \bot \to \neg \Box \neg \Box \bot\).
This corresponds to the fact that if \(T\) is consistent, then \(T\) does not prove its own consistency.
Proof. We argue as follows: \[ \begin{array}{lllll} 1 & \neg \Box \bot \to (\Box \bot \to \bot) & \text{PL} \\ 2 & \Box \neg \Box \bot \to \Box (\Box \bot \to \bot) & \text{RK} \ 1 \\ 3 & \Box (\Box \bot \to \bot) \to \Box \bot & \text{L} \\ 4 & \Box \neg \Box \bot \to \Box \bot & \text{PL} \ 2, 3\\ 5 & \neg \Box \bot \to \neg \Box \neg \Box \bot & \text{PL} \ 4 \\ \end{array} \]
Definition 13.1 (Converse Well-Founded) A binary relation \(R\) on a set \(W\) is converse well-founded if, and only if, for every non-empty subset \(X\) of \(W\), there is a greatest \(R\)-member of \(X\), that is, there is some \(w \in X\) such that for every \(x \in X\), \(\neg Rwx\). In other words, \(R\) is converse well-founded on \(W\) if there are no infinite ascending \(R\)-chains of the form \(x_1Rx_2Rx_3 \cdots\).
Notice that a relation \(R\) is converse well-founded on \(W\) only if it is irreflexive and asymmetric on \(W\).
Proposition 13.5 All theorems of \(GL\) are valid in all transitive and converse well-founded frames.
Proof. All axioms of \(GL\) are valid in all transtive and converse well-founded frames. That is because \(\Box (\Box p \to p) \to \Box p\) modally defines the class of transtive converse well-founded frames — as observed in proposition 6.3 above.
Proposition 13.6 \(GL\) is complete with respect to the class of transitive and converse well-founded frames.
The argument for completeness is more elaborate than usual, since there is no reason to expect the canonical model for \(GL\) to be irreflexive, which means that we cannot prove that the canonical model for \(GL\) is converse well-founded.