Formulas of arithmetic that behave like decision agents

Nisan

35 Formulas of arithmetic that behave like decision agents

3rd Feb 2012

13 min read

35

I wrote this post in the course of working through Vladimir Slepnev's A model of UDT with a halting oracle. This post contains some of the ideas of Slepnev's post, with all the proofs written out. The main formal difference is that while Slepnev's post is about programs with access to a halting oracle, the "decision agents" in this post are formulas in Peano arithmetic. They are generally uncomputable and do not reason under uncertainty.

These ideas are due to Vladimir Slepnev and Vladimir Nesov. (Please let me know if I should credit anyone else.) I'm pretty sure none of this is original material on my part. It is possible that I have misinterpreted Slepnev's post or introduced errors.

We are going to define a world function $U()$ , a $0$ -ary function¹ that outputs an ordered pair $(a,b) \in \mathbb{Q}^2$ of payoff values. There are functions $\pi_i$ such that $\pi_0(a,b) = a$ and $\pi_1(a,b) = b$ for any $a,b$ . In fact $\pi_i(a,b)$ is a function in the three variables $i, a,$ and $b$ .

We are also going to define an agent function $A(x,i)$ that outputs the symbol $C$ or $D$ . The argument $x$ is supposed to be the Gödel number of the world function, and $i \in \mathbb{N}$ is some sort of indexical information.

We want to define our agent such that

$A(\ulcorner \chi \urcorner,i) = \left\{ \begin{array}{ll}D & \mbox{if } \vdash A(\ulcorner \chi \urcorner, \underline{i}) = C \mbox{; else}\\C & \mbox{if } \vdash A(\ulcorner \chi \urcorner, \underline{i}) = D \mbox{; else}\\C & \mbox{if } \exists a,b ( \vdash ( A(\ulcorner \chi \urcorner,\underline{i}) = C \to \pi_{\underline{i}}(\chi()) = \underline{a})\\ & \mbox{\quad} \wedge \mbox{\quad} (A(\ulcorner \chi \urcorner,\underline{i}) = D \to \pi_{\underline{i}}(\chi()) = \underline{b} ))\\ & \mbox{\quad} \wedge \mbox{\quad}a > b \mbox{; else}\\ D &\end{array} \right.$

( $\ulcorner \omega \urcorner$ denotes the Gödel number of $\omega$ . $\vdash \omega$ means that $\omega$ is provable in Peano arithmetic. $\underline{i}$ represents the numeral for $i$ . I don't care what value $A(x,i)$ has when $x$ isn't the Gödel number of an appropriate $0$ -ary function.)

There is some circularity in this tentative definition, because a formula standing for $A$ appears in the definition of $A$ itself. We get around this by using diagonalization. We'll describe how this works just this once: First define the function $\phi$ as follows:

$\phi(\ulcorner \chi \urcorner,i,\ulcorner \psi \urcorner) = \left\{ \begin{array}{ll}D & \mbox{if } \vdash \psi(\ulcorner \chi \urcorner, \underline{i}) = C \mbox{; else}\\C & \mbox{if } \vdash \psi(\ulcorner \chi \urcorner, \underline{i}) = D \mbox{; else}\\C & \mbox{if } \exists a,b ( \vdash ( \psi(\ulcorner \chi \urcorner,\underline{i}) = C \to \pi_{\underline{i}}(\chi()) = \underline{a})\\ & \mbox{\quad} \wedge \mbox{\quad} (\psi(\ulcorner \chi \urcorner,\underline{i}) = D \to \pi_{\underline{i}}(\chi()) = \underline{b} ))\\ & \mbox{\quad} \wedge \mbox{\quad}a > b \mbox{; else}\\ D &\end{array} \right.$

This function can be defined by a formula. Then the diagonal lemma gives us a formula $A$ such that $\vdash A(x,i) = \phi(x,i,\ulcorner A \urcorner)$ .

This is our (somewhat) rational decision agent. If it can prove it will do one thing, it does another; this is what Slepnev calls "playing chicken with the universe". If it can prove that $C$ is an optimal strategy, it chooses $C$ ; and otherwise it chooses $D$ .

First, a lemma about the causes and consequences of playing chicken:

Lemma 1. For any $\chi$ ,

$\vdash \operatorname{Prv}( A(\ulcorner \chi \urcorner, \underline{i}) = C) \to \neg \operatorname{Con}(\operatorname{PA})$
$\vdash \neg \operatorname{Con}(\operatorname{PA}) \to A(\ulcorner \chi \urcorner, \underline{i}) = D$
$\vdash \operatorname{Prv}(A(\ulcorner \chi \urcorner,\underline{i})=D) \to \neg\operatorname{Con}(\operatorname{PA}+\operatorname{Con}(\operatorname{PA}))$
$\vdash \operatorname{Con}(\operatorname{PA}) \wedge \neg\operatorname{Con}(\operatorname{PA}+\operatorname{Con}(\operatorname{PA})) \to A(\ulcorner \chi \urcorner,\underline{i}) = C$

( $\operatorname{Prv}$ is a binary-valued function such that $\operatorname{Prv}(\ulcorner \omega \urcorner)$ is true exactly when there is a proof of $\omega$ in Peano arithmetic. For brevity we write $\operatorname{Prv}(\omega)$ instead. $\operatorname{Con}(\operatorname{PA}+\operatorname{Con}(\operatorname{PA}))$ is the proposition that Peano arithmetic, plus the axiom that Peano arithmetic is consistent, is a consistent theory.)

Proof. (1) By definition of $A$ ,

$\begin{align*}\quad\quad \vdash \operatorname{Prv}(A(\ulcorner \chi \urcorner,\underline{i}) = C) & \to A(\ulcorner \chi \urcorner,i) = D &\end{align*}$

$\begin{align*}\quad\quad \vdash \operatorname{Prv}(\operatorname{Prv}(A(\ulcorner \chi \urcorner,\underline{i}) = C)) & \to \operatorname{Prv}(A(\ulcorner \chi \urcorner,i) = D) &\end{align*}$
$\begin{align*}\quad\quad \vdash \operatorname{Prv}(A(\ulcorner \chi \urcorner,\underline{i}) = C) & \to \operatorname{Prv}(A(\ulcorner \chi \urcorner,\underline{i}) = C) \wedge \operatorname{Prv}(\operatorname{Prv}(A(\ulcorner \chi \urcorner,\underline{i}) = C)) &\\& \to \operatorname{Prv}(A(\ulcorner \chi \urcorner,\underline{i}) = C) \wedge \operatorname{Prv}(A(\ulcorner \chi \urcorner,\underline{i}) = D) &\\& \to \neg \operatorname{Con}(\operatorname{PA}) &\end{align*}$

(2) By the principle of explosion,

$\begin{align*}\quad\quad \vdash \neg \operatorname{Con}(\operatorname{PA}) & \to \operatorname{Prv}(A(\ulcorner \chi \urcorner,\underline{i}) = C) &\\ & \to A(\ulcorner \chi \urcorner,\underline{i}) = D &\end{align*}$

(3) By the definition of $A$ ,

$\begin{align*}\quad\quad \vdash \neg \operatorname{Prv}(A(\ulcorner \chi \urcorner,\underline{i}) = C) \wedge \operatorname{Prv}(A(\ulcorner \chi \urcorner,\underline{i}) = D) & \to A(\ulcorner \chi \urcorner,\underline{i}) = C &\end{align*}$
$\begin{align*}\quad\quad \vdash \operatorname{Prv}(A(\ulcorner \chi \urcorner,\underline{i}) = D) & \to \biggl( \operatorname{Prv}(A(\ulcorner \chi \urcorner,\underline{i}) = C) \wedge \operatorname{Prv}(A(\ulcorner \chi \urcorner,\underline{i}) = D) \biggr) \vee A(\ulcorner \chi \urcorner,\underline{i}) = C &\\& \to \neg \operatorname{Con}(\operatorname{PA}) \vee A(\ulcorner \chi \urcorner,\underline{i}) = C\end{align*}$
$\begin{align*}\quad\quad \vdash \operatorname{Prv}(\operatorname{Prv}(A(\ulcorner \chi \urcorner,\underline{i}) = D)) & \to \operatorname{Prv} \biggl( \neg\operatorname{Con}(\operatorname{PA}) \vee A(\ulcorner \chi \urcorner,\underline{i}) = C \biggr) &\end{align*}$
$\begin{align*}\quad\quad \vdash \operatorname{Prv}(A(\ulcorner \chi \urcorner,\underline{i}) = D) & \to \operatorname{Prv}(A(\ulcorner \chi \urcorner,\underline{i}) = D) \wedge \operatorname{Prv}(\operatorname{Prv}(A(\ulcorner \chi \urcorner,\underline{i}) = D)) &\\& \to \operatorname{Prv}(A(\ulcorner \chi \urcorner,\underline{i}) = D) \wedge \operatorname{Prv} \biggl(\neg \operatorname{Con}(\operatorname{PA}) \vee A(\ulcorner \chi \urcorner,\underline{i}) = C \biggr) &\\& \to \operatorname{Prv} \biggl( A(\ulcorner \chi \urcorner,\underline{i}) = D \wedge \biggl(\neg\operatorname{Con}(\operatorname{PA}) \vee A(\ulcorner \chi \urcorner,\underline{i}) = C \biggr) \biggr) &\\& \to \operatorname{Prv}(\neg\operatorname{Con}(\operatorname{PA})) &\\& \to \neg\operatorname{Con}(PA + \operatorname{Con}(\operatorname{PA})) &\end{align*}$

(4)

$\begin{align*}\quad\quad \vdash \neg\operatorname{Con}(\operatorname{PA} + \operatorname{Con}(\operatorname{PA})) & \to \operatorname{Prv}(\neg\operatorname{Con}(\operatorname{PA})) &\\& \to \operatorname{Prv}(\operatorname{Prv}(A(\ulcorner \chi \urcorner,\underline{i}) = C)) &\\& \to \operatorname{Prv}(A(\ulcorner \chi \urcorner,\underline{i}) = D) &\end{align*}$
$\begin{align*}\quad\quad \vdash \operatorname{Con}(\operatorname{PA}) \wedge \neg\operatorname{Con}(\operatorname{PA} + \operatorname{Con}(\operatorname{PA})) & \to \operatorname{Con}(\operatorname{PA}) \wedge \operatorname{Prv}(A(\ulcorner \chi \urcorner,\underline{i}) = D) &\\& \to \neg\operatorname{Prv}(A(\ulcorner \chi \urcorner,\underline{i}) = C) \wedge \operatorname{Prv}(A(\ulcorner \chi \urcorner,\underline{i}) = D) &\\& \to A(\ulcorner \chi \urcorner,\underline{i}) = C &\end{align*}$

$\square$

If we assume consistency of $\operatorname{PA} + \operatorname{Con}(\operatorname{PA})$ (which entails consistency of $\operatorname{PA}$ ), then parts (1) and (3) of Lemma 1 tell us that for any $i$ , $\nvdash A(\ulcorner \chi \urcorner,\underline{i}) = C$ and $\nvdash A(\ulcorner \chi \urcorner,\underline{i}) = D$ . So the agent never actually plays chicken.

Now let's see how our agent fares on a straightforward decision problem:

Proposition 2. Let $\alpha, \beta \in \mathbb{Q}$ and suppose

$U() = \left\{ \begin{array}{ll}(\alpha, 0) & \mbox{\rm if } A(\ulcorner U \urcorner,0) = C\\ (\beta, 0) & \mbox{\rm if } A(\ulcorner U \urcorner,0) = D\end{array} \right.$

Assume consistency of $\operatorname{PA}+\operatorname{Con}(\operatorname{PA})$ . Then $A(\ulcorner U \urcorner,0) = C$ if and only if $\alpha > \beta$ .

Proof. If we assume consistency of $\operatorname{PA} + \operatorname{Con}(\operatorname{PA})$ , then Lemma 1 tells us that the agent doesn't play chicken. So the agent will choose $C$ if and only if it determines that choosing $C$ is optimal.

We have

$\begin{align*}\quad\quad \vdash \biggl( A(\ulcorner U \urcorner,0) = C \to \pi_0 U() = \underline{\alpha} \biggr) \wedge \biggl( A(\ulcorner U \urcorner,0) = D \to \pi_0 U() = \underline{\beta} \biggr) &&\end{align*}$

Suppose $\alpha > \beta$ . Then clearly

$\begin{align*}\quad\quad \exists a,b \vdash \biggl( A(\ulcorner U \urcorner,0) = C \to \pi_0 U() = \underline{a} \biggr) \wedge \biggl( A(\ulcorner U \urcorner,0) = D \to \pi_0 U() = \underline{b} \biggr) \wedge a > b &&\end{align*}$

So $A(\ulcorner U \urcorner,0) = C$ .

As for the converse: We have $\vdash (A(\ulcorner U \urcorner,0) = C \to \pi_0 U() = \underline{\alpha})$ . If also

$\begin{align*}\quad\quad \vdash (A(\ulcorner U \urcorner,0) = C \to \pi_0 U() = \underline{a}) &&\end{align*}$

and $a \neq \alpha$ , then $\vdash(A(\ulcorner U \urcorner,0) = D)$ . By Lemma 1(3) and consistency of $\operatorname{PA} + \operatorname{Con}(\operatorname{PA})$ , this cannot happen. So

$\begin{align*}\quad\quad \nvdash (A(\ulcorner U \urcorner,0) = C \to \pi_0 U() = \underline{a}) &&\end{align*}$

Similarly, we have

$\begin{align*}\quad\quad \nvdash (A(\ulcorner U \urcorner,0) = D \to \pi_0 U() = \underline{b}) &&\end{align*}$

for all $b \neq \beta$ . So

$\begin{align*}\quad\quad \nexists a,b \vdash(A(\ulcorner U \urcorner,0) = C \to \pi_0 U() = \underline{a}) \wedge (A(\ulcorner U \urcorner,0) = D \to \pi_0 U() = \underline{b}) \wedge a>b &&\end{align*}$

So the agent doesn't decide that $C$ is optimal, and $A(\ulcorner U \urcorner,0) = D$ .

$\square$

Now let's see how $A$ fares on a symmetric Prisoner's Dilemma with itself:

Proposition 3. Let

$U() = \left\{ \begin{array}{ll}(1,1) & \mbox{\rm if } A(\ulcorner U \urcorner,0)=C \wedge A(\ulcorner U \urcorner,1)=C\\ (-1,2) & \mbox{\rm if } A(\ulcorner U \urcorner,0)=C \wedge A(\ulcorner U \urcorner,1)=D\\ (2,-1) & \mbox{\rm if } A(\ulcorner U \urcorner,0)=D \wedge A(\ulcorner U \urcorner,1)=C\\ (0,0) & \mbox{\rm if } A(\ulcorner U \urcorner,0)=D \wedge A(\ulcorner U \urcorner,1)=D\end{array} \right.$

Then, assuming consistency of $\operatorname{PA} + \operatorname{Con}(\operatorname{PA})$ , we have $A(\ulcorner U \urcorner,0) = C = A(\ulcorner U \urcorner,1)$ .

Proof.

(This proof uses Löb's theorem, and that makes it confusing. Vladimir Slepnev points out that Löb's theorem is not really necessary here; a simpler proof appears in the comments.)

$\begin{align*}\quad\quad \vdash A(\ulcorner U \urcorner,0) = A(\ulcorner U \urcorner,1) & \to \biggl(\biggl( A(\ulcorner U \urcorner,0)=C \to \pi_0 U() = 1 \biggr) \wedge \biggl( A(\ulcorner U \urcorner,0)=D \to \pi_0 U() = 0 \biggr)\biggr) &\end{align*}$
$\begin{align*}\quad\quad \vdash \operatorname{Prv}(A(\ulcorner U \urcorner,0) = A(\ulcorner U \urcorner,1)) & \to \exists a,b. \operatorname{Prv}\biggl((A(\ulcorner U \urcorner,0)=C \to \pi_0 U() = \underline{a}) &\\& \quad\quad \wedge (A(\ulcorner U \urcorner,0)=D \to \pi_0 U() = \underline{b})\biggr) \wedge a > b &\end{align*}$
Looking at the definition of $A$ , we see that

$\begin{align*}\quad\quad \vdash \operatorname{Prv}(A(\ulcorner U \urcorner,0) = A(\ulcorner U \urcorner,1)) & \to \operatorname{Prv}(A(\ulcorner U \urcorner,0)=C) \vee \operatorname{Prv}(A(\ulcorner U \urcorner,0)=D) &\\&\quad\quad \vee A(\ulcorner U \urcorner,0)=C &\end{align*}$
By Lemma 1, (1) and (3),

$\begin{align*}\quad\quad \vdash \operatorname{Prv}(A(\ulcorner U \urcorner,0) = A(\ulcorner U \urcorner,1)) & \to \neg \operatorname{Con}(\operatorname{PA}) \vee \neg \operatorname{Con}(\operatorname{PA} + \operatorname{Con}(\operatorname{PA})) \vee A(\ulcorner U \urcorner,0)=C &\end{align*}$

Similarly,

$\begin{align*} \label{ast}(*)\quad \vdash \operatorname{Prv}(A(\ulcorner U \urcorner,0) = A(\ulcorner U \urcorner,1)) & \to \neg \operatorname{Con}(\operatorname{PA}) \vee \biggl(\operatorname{Con}(\operatorname{PA}) \wedge \neg \operatorname{Con}(\operatorname{PA} + \operatorname{Con}(\operatorname{PA})) \biggr) &\\&\quad\quad \vee A(\ulcorner U \urcorner,0)=C=A(\ulcorner U \urcorner,1) &\end{align*}$

Applying Lemma 1(2) and (4),

$\begin{align*}\quad\quad \vdash \operatorname{Prv}(A(\ulcorner U \urcorner,0) = A(\ulcorner U \urcorner,1)) & \to A(\ulcorner U \urcorner,0) = D = A(\ulcorner U \urcorner,1) &\\&\quad\quad \vee A(\ulcorner U \urcorner,0) = C = A(\ulcorner U \urcorner,1) &\\&\quad\quad \vee A(\ulcorner U \urcorner,0)=C=A(\ulcorner U \urcorner,1) &\\& \to A(\ulcorner U \urcorner,0) = A(\ulcorner U \urcorner,1) &\end{align*}$

By Löb's theorem,

$\begin{align*}\quad\quad \vdash A(\ulcorner U \urcorner,0) = A(\ulcorner U \urcorner,1) &&\end{align*}$
$\begin{align*}\quad\quad \vdash \operatorname{Prv}(A(\ulcorner U \urcorner,0) = A(\ulcorner U \urcorner,1)) &&\end{align*}$

By $(*)$ , we have

$\begin{align*}\quad\quad \vdash \neg \operatorname{Con}(\operatorname{PA}) \vee \biggl(\operatorname{Con}(\operatorname{PA}) \wedge \neg \operatorname{Con}(\operatorname{PA} + \operatorname{Con}(\operatorname{PA})) \biggr) \vee A(\ulcorner U \urcorner,0)=C=A(\ulcorner U \urcorner,1) &&\end{align*}$

So, assuming $\operatorname{Con}(\operatorname{PA} + \operatorname{Con}(\operatorname{PA}))$ , we conclude that $A(\ulcorner U \urcorner,0)=C=A(\ulcorner U \urcorner,1)$ .

$\square$

The definition of $A$ treats the choices $C$ and $D$ differently; so it is worth checking that $A$ behaves correctly in the Prisoner's Dilemma when the effects of $C$ and $D$ are switched:

Proposition 4. Let

$U() = \left\{ \begin{array}{ll}(0,0) & \mbox{\rm if } A(\ulcorner U \urcorner,0)=C \wedge A(\ulcorner U \urcorner,1)=C\\ (2,-1) & \mbox{\rm if } A(\ulcorner U \urcorner,0)=C \wedge A(\ulcorner U \urcorner,1)=D\\ (-1,2) & \mbox{\rm if } A(\ulcorner U \urcorner,0)=D \wedge A(\ulcorner U \urcorner,1)=C\\ (1,1) & \mbox{\rm if } A(\ulcorner U \urcorner,0)=D \wedge A(\ulcorner U \urcorner,1)=D\end{array} \right.$

Then, assuming consistency of $\operatorname{PA} + \operatorname{Con}(\operatorname{PA})$ , we have $A(\ulcorner U \urcorner,0) = D = A(\ulcorner U \urcorner,1)$ .

A proof appears in the comments.

There are a number of questions one can explore with this formalism: What is the correct generalization of $A$ that can choose between $n$ actions, and not just two? How about infinitely many actions? What about theories other than Peano arithmetic? How do we accomodate payoffs that are real numbers? How do we make agents that can reason under uncertainty? How do we make agents that are computable algorithms rather than arithmetic formulas? How does $A$ fare on a Prisoner's Dilemma with asymmetric payoff matrix? In a two-person game where the payoff to player $A_0$ is independent of the behavior of $A_1$ , can $A_1$ deduce the behavior of $A_0$ ? What happens when we replace the third line of the definition of $A$ with

$\begin{align*} \quad\quad \vdash \exists a,b & (A(\ulcorner U\urcorner,\underline{i})=C \to \pi_{\underline{i}} U() = a) \\& \wedge ( A(\ulcorner U \urcorner,\underline{i})=D \to \pi_{\underline{i}} U() = b) \\& \wedge a > b \end{align*}$

? What is a (good) definition of "decision problem"? Is there a theorem that says that our agent $A$ is, in a certain sense, optimal?

¹Every $k$ -ary function $U$ in this article is defined by a formula $\phi(x_1, \dots, x_k, y)$ with $k+1$ free variables such that $\vdash \exists y. \phi(x_1, \dots, x_k, y)$ and $\vdash \phi(x_1, \dots, x_k, y) \wedge \phi(x_1, \dots, x_k, y') \to y = y'$ . By a standard abuse of notation, when the name of a function like $U$ appears in a formula of arithmetic, what we really mean is the formula $\phi$ that defines it.

Decision theoryLogic & Mathematics

Personal Blog

35

New Comment

Rendering 0/34 comments, sorted by

top scoring

(show more) Click to highlight new comments since: Today at 1:15 AM

Moderation Log

35 Formulas of arithmetic that behave like decision agents

by Nisan

3rd Feb 2012

13 min read

35

We want to define our agent such that

This function can be defined by a formula. Then the diagonal lemma gives us a formula $A$ such that $\vdash A(x,i) = \phi(x,i,\ulcorner A \urcorner)$ .

First, a lemma about the causes and consequences of playing chicken:

Lemma 1. For any $\chi$ ,

$\vdash \operatorname{Prv}( A(\ulcorner \chi \urcorner, \underline{i}) = C) \to \neg \operatorname{Con}(\operatorname{PA})$
$\vdash \neg \operatorname{Con}(\operatorname{PA}) \to A(\ulcorner \chi \urcorner, \underline{i}) = D$
$\vdash \operatorname{Prv}(A(\ulcorner \chi \urcorner,\underline{i})=D) \to \neg\operatorname{Con}(\operatorname{PA}+\operatorname{Con}(\operatorname{PA}))$
$\vdash \operatorname{Con}(\operatorname{PA}) \wedge \neg\operatorname{Con}(\operatorname{PA}+\operatorname{Con}(\operatorname{PA})) \to A(\ulcorner \chi \urcorner,\underline{i}) = C$

Proof. (1) By definition of $A$ ,

$\begin{align*}\quad\quad \vdash \operatorname{Prv}(A(\ulcorner \chi \urcorner,\underline{i}) = C) & \to A(\ulcorner \chi \urcorner,i) = D &\end{align*}$

(2) By the principle of explosion,

(3) By the definition of $A$ ,

(4)

$\square$

Now let's see how our agent fares on a straightforward decision problem:

Proposition 2. Let $\alpha, \beta \in \mathbb{Q}$ and suppose

$U() = \left\{ \begin{array}{ll}(\alpha, 0) & \mbox{\rm if } A(\ulcorner U \urcorner,0) = C\\ (\beta, 0) & \mbox{\rm if } A(\ulcorner U \urcorner,0) = D\end{array} \right.$

Assume consistency of $\operatorname{PA}+\operatorname{Con}(\operatorname{PA})$ . Then $A(\ulcorner U \urcorner,0) = C$ if and only if $\alpha > \beta$ .

We have

Suppose $\alpha > \beta$ . Then clearly

So $A(\ulcorner U \urcorner,0) = C$ .

As for the converse: We have $\vdash (A(\ulcorner U \urcorner,0) = C \to \pi_0 U() = \underline{\alpha})$ . If also

$\begin{align*}\quad\quad \vdash (A(\ulcorner U \urcorner,0) = C \to \pi_0 U() = \underline{a}) &&\end{align*}$

and $a \neq \alpha$ , then $\vdash(A(\ulcorner U \urcorner,0) = D)$ . By Lemma 1(3) and consistency of $\operatorname{PA} + \operatorname{Con}(\operatorname{PA})$ , this cannot happen. So

$\begin{align*}\quad\quad \nvdash (A(\ulcorner U \urcorner,0) = C \to \pi_0 U() = \underline{a}) &&\end{align*}$

Similarly, we have

$\begin{align*}\quad\quad \nvdash (A(\ulcorner U \urcorner,0) = D \to \pi_0 U() = \underline{b}) &&\end{align*}$

for all $b \neq \beta$ . So

So the agent doesn't decide that $C$ is optimal, and $A(\ulcorner U \urcorner,0) = D$ .

$\square$

Now let's see how $A$ fares on a symmetric Prisoner's Dilemma with itself:

Proposition 3. Let

Then, assuming consistency of $\operatorname{PA} + \operatorname{Con}(\operatorname{PA})$ , we have $A(\ulcorner U \urcorner,0) = C = A(\ulcorner U \urcorner,1)$ .

Proof.

(This proof uses Löb's theorem, and that makes it confusing. Vladimir Slepnev points out that Löb's theorem is not really necessary here; a simpler proof appears in the comments.)

Similarly,

Applying Lemma 1(2) and (4),

By Löb's theorem,

By $(*)$ , we have

So, assuming $\operatorname{Con}(\operatorname{PA} + \operatorname{Con}(\operatorname{PA}))$ , we conclude that $A(\ulcorner U \urcorner,0)=C=A(\ulcorner U \urcorner,1)$ .

$\square$

The definition of $A$ treats the choices $C$ and $D$ differently; so it is worth checking that $A$ behaves correctly in the Prisoner's Dilemma when the effects of $C$ and $D$ are switched:

Proposition 4. Let

Then, assuming consistency of $\operatorname{PA} + \operatorname{Con}(\operatorname{PA})$ , we have $A(\ulcorner U \urcorner,0) = D = A(\ulcorner U \urcorner,1)$ .

A proof appears in the comments.