A Few Translation Exercises

One of my favourite books is Gödel, Escher, Bach, by Douglas Hofstadter. If you’re patient enough to read it all, I highly recommend it: it’s a great book about mathematics and cognitive science.

One of the main goals of the book is to motivate, and sketch a proof of, Gödel’s first incompleteness theorem. At one point, he provides some exercises in transcribing number theoretical statements in a specific implementation of Peano arithmetic he calls TNT. They’re pretty easy, but there are a couple that can really stump you if you haven’t taken any first-order logic courses, like me.

It would befit me to detail Hofstadter’s notation for TNT, but it’s not really anything out of the ordinary. We denote material implication with the archaic $\def\impl{\mathbin\supset}\impl$ symbol, and negation with the tilde $\def\tnot{\mathop{\sim}}\tnot$ , but otherwise it is bog-standard, and the particulars are not that important.

Here is the relevant passage from Chapter 8 of GEB—in my 20th Anniversary Edition, it is on page 215.

A Few More Translation Exercises

And now, a few practice exercises for you, to test your understanding of the notation of TNT. Try to translate the first four of the following N-sentences into TNT-sentences, and the last one into an open well-formed formula.

All natural numbers are equal to 4.
There is no natural number which equals its own square.
Different natural numbers have different successors.
If 1 equals 0, then every number is odd.
$b$ is a power of 2.

The last one you may find a little tricky. But is it nothing, compared to this one:

$b$ is a power of 10.

Strangely, this one takes great cleverness to render in our notation. I would caution you to try it only if you are willing to spend hours and hours on it—and if you know quite a bit of number theory!

Let’s just jump right into it. I know you’re excited to see me bend over backwards to translate that last one, but I don’t like to just skip to the end of a story, however trite the crescendo.

The first few exercises are pretty much freebies, so I won’t say too much about them. Note that, as with any translation, we sometimes have to massage our goal a bit to be able to translate accurately.

All natural numbers are equal to 4.

$\def\0{\mathsf 0}\def\s{\mathsf S}\def\c{\mathpunct:}\def\eq{\mathbin=}\forall n\c n \eq \s\s\s\s\0$ .

There is no natural number which equals its own square.

$\tnot\exists n\c n \eq (n \cdot n)$ . Note that by convention, all binary arithmetical operations must be delimited with parentheses $({-})$ . This was probably done to avoid worrying about disambiguation, especially later on when proving the incompleteness theorem.

Different natural numbers have different successors.

$\def\<{\langle}\def\>{\rangle}\forall m\c \forall n\c \< \tnot m \eq n \impl \tnot \s m \eq \s n \>$ . Just as the arithmetical operations got parens, the binary logical connectives must be delimited with angle brackets $\<{-}\>$ . Don’t cross the streams and all that.

If 1 equals 0, then every number is odd.

$\< \s\0 \eq \0 \impl \forall n\c \exists a\c n = \s(a+a) \>$ . I interpret this exercise as a gentle reminder to keep your quantifiers as tight as possible.

$b$ is a power of 2.

Okay, finally something juicy. Here we must create a formula with one free variable, $b$ , which is true iff $b$ is a power of two. We have no way to directly transcribe exponentiation, as our only operations are $+$ and $\cdot$ . So how can we state that property sufficiently simply to implement that in first order logic?

By the Fundamental Theorem of Arithmetic, every nonzero number which is not a power of two has an odd factor besides $\mathsf{S0}$ . Every odd number is a factor of $\0$ , so that case is handled as well. Conversely, every factor of a power of two is either equal to $\s\0$ or is even. So we’ll use that reformulation.

$\def\u#1#2{ {\underbrace{#1}_{#2}}}\forall a\c \< \u{\exists c\c b = (\mathsf{SS} a \cdot c)}{\mathsf{SS} a\ \text{is a factor}} \impl \u{\exists c\c \mathsf{SS} a = (c+c)}{\mathsf{SS} a\ \text{is even}} \>$

Recall that if we want to talk about a number $a' \ge 2$ , then we have to write it as $a' = \mathsf{SS} a$ for $a \ge 0$ . Also recall that there’s nothing wrong with using $c$ in both the antecedent and the consequent, because of the tightness of my quantifier bindings.

$b$ is a power of 10.

Now we’re going to have to be even more clever than last time. The previous solution worked because 2 was prime, and we were able to get away with not caring which power of two $b$ was. But 10 isn’t a prime, and even if we want to say that $10^n = 2^n \cdot 5^n$ , we’ll somehow need to know what that exponent $n$ is.

Well, here’s a slick math trick. Let’s just say we know what $n$ is. Since we don’t really know how to read off $n$ easily from $b = 10^n$ , we’ll just sweep that problem under the rug with an $\exists n$ . So as long as we can calculate $10^n$ in TNT, we can stipulate that $b = 10^n$ and be done with it.

How do we do that? Well, let’s just compute $10^n$ the way we usually do. Namely, start with $\mathsf{S0}$ , then multiply it by $\mathsf{SSSSSSSSSS0}$ $n$ times. This gives a sequence of numbers, $x = (1,10,100,...,10^n)$ . What makes this sequence nice is that it has some defining characteristics which are very easily beheld, even by our good friend Giuseppe: to wit, $x_0 = 1$ , and $x_{i+1} = 10 \cdot x_i$ for all $i < n$ .

We can view this sequence $x$ as a certificate that $b$ equals $10^n$ , by asking that $x_n = b$ , and it looks like TNT is in principle smart enough to be able to verify this, since all we really need is multiplication by ten. Clearly, a certificate for $b$ can only exist iff $b$ is a power of ten, so our issue now is to write this certificate in a language that TNT can understand. It would suffice to encode $x$ as some fixed-length tuple of numbers, because then we can just existentially quantify those suckers and go home.

Our saviour comes in the form of the Chinese Remainder Theorem. If we have a pairwise-coprime set of moduli $m_0, ..., m_n$ that are sufficiently large, then we can solve the system of equations

$\begin{align*} a &\equiv x_0 \pmod{m_0} \\ a &\equiv x_1 \pmod{m_1} \\ &\;\mathrel{\vdots} \\ a &\equiv x_n \pmod{m_n} \\ \end{align*}$

for $a$ . That guarantees the existence of an encoding. Turning that on its head, given $a$ and $m_i$ , we can access $x_i$ as the remainder of dividing $a$ by $m_i$ . For convenience, let’s define a relation $R$ for saying that $r$ is the remainder when dividing $a$ by $m$ :

$R[a,m,r] := \< \u{\exists r'\c m \eq (r + \s r')}{r < m\vphantom\mid} \wedge \u{\exists q\c a \eq ((m \cdot q) + r)}{m \mid a - r} \>$

So now we just need a convenient collection of pairwise coprime $m_i$ to use. I’ll go with $m_i = 1 + k(i+1)$ , where $k$ is a sufficently large and highly divisible number, so that (1) $m_i > x_i$ , and (2) the $m_i$ are coprime. We can guarantee this works, because e.g. $k = \max(x_0, x_1, ..., x_n, n)!$ has precisely the properties we want. So our certificate will provide $k$ and $a$ . We also need to know $n$ to know when to stop and compare to $b$ .

We now have our certificate scheme $(n,a,k)$ , and can find one for any valid $b$ . They can get pretty large, though, especially using our overly generous estimate for $k$ . For example, $(1,43,5)$ works to prove that $10 = 10^1$ , since $43 \equiv 1 \pmod{1\cdot5+1}$ and $43 \equiv 10 \pmod{2\cdot5+1}$ . However, in our safe estimate, we took $k = 10! = 3628800$ so that the least result CRT can give us is $a = 20(k+1) - (2k+1) = 65318419$ . $(2,32026,34)$ will certify that $100 = 10^2$ , but our safe estimate gives $k = 100! \approx 9.33 \times 10^{157}$ and some value of $a < (k+1)(2k+1)(3k+1)$ .

In any case, we have enough to solve the problem, so let’s assemble the parts. Our solution will be $\exists n\c P[\mathsf{SSSSSSSSSS0},n,b]$ , where

$\begin{align*} P[m,n,b] &:= \exists a\c \exists k\c \< \< \u{R[a,\s k,\s\0]}{x_0=1} \wedge \u{R[a,\s(k \cdot \s n),b]}{x_n=b} \> \wedge \u{I[m,n,a,k]}{\text{inductive step}} \>, \\ I[m,n,a,k] &:= \forall i\c \< \u{\exists j\c n \eq (i + \s j)}{i < n} \impl \exists y\c \< \u{R[a,\s(k \cdot \s i),y]}{x_i = y} \wedge \u{R[a,\s(k \cdot \s\s i),(m \cdot y)]}{x_{i+1} = my} \> \>. \end{align*}$

Thankfully, that’s it, and we’re done. Those poor souls who wish to prove that this method of exponentiation is well-defined in Peano Arithmetic have it way way worse because they have to fiddle with Cantor coding probably and use induction to prove the arbitrary extension of sequences. It’s a real stinker, and mathematicians are fine with this because the mere existence of a proof is good enough to justify the Laconic axiom set.

To close out, I’ve gone and done the obvious thing, which was to inline all the definitions to give you one long string of pure TNT. I’ve thrown in a couple of optimizations to shave off what I could—including obviating an $\exists s\c \s k \eq (\s\0 + \s s)$ by using $\s k$ everywhere we’d otherwise use $k$ —but we’re still at 201 symbols. So brace yourselves…

$\begin{align*} \exists n\c{}&\exists a\c \exists k\c \< \< \exists q\c a \eq \s(\s\s k \cdot q) \wedge \< \exists s\c (\s k \cdot \s n) \eq (b + s) \wedge \exists q\c a \eq ((\s(\s k \cdot \s n) \cdot q) + b) \> \> \\ &{} \wedge \forall i\c \< \exists j\c n \eq (i + \s j) \impl \exists y\c \< \< \exists s\c (\s k \cdot \s i) \eq (y + s) \wedge \exists q\c a \eq ((\s(\s k \cdot \s i) \cdot q) + y) \> \\ &\qquad{} \wedge \< \exists s\c (\s k \cdot \s\s i) \eq ((\mathsf{SSSSSSSSSS0} \cdot y) + s) \\ &\qquad{} \wedge \exists q\c a \eq ((\s(\s k \cdot \s\s i) \cdot q) + (\mathsf{SSSSSSSSSS0} \cdot y)) \> \> \> \> \end{align*}$

Exercise. (For all the code golfers out there.) Find a formula with less than 200 symbols. Can you beat 193?

Comments have been disabled for this post.