The Calculus of Nash Equilibria

Heighn

Now that we know a bit about derivatives, it's time to use them to find dominant strategies and Nash equilibria. It helps if the reader is familiar with Nash equilibria already.

Prisoner's dilemma

The payoff matrix of the Prisoner's dilemma can be as follows:

We can see that the payoff for Prisoner 1 depends on her own action (Cooperate/Defect) but also on the action of Prisoner 2. Therefore, the payoff function for Prisoner 1 is a multivariable function: , where $a_{n}$ is the action of Prisoner $n$ (and $n \in {1, 2}$ ). Let's say $a_{n} = 0$ when the action of Prisoner $n$ is Cooperate, and $a_{n} = 1$ for Defect. So $a_{n} \in {0, 1}$ . Then $V 1 (a_{1}, a_{2}) = 20 - 20 a_{2} + 10 a_{1}$ , and crucially, $V 1_{a 1}^{'} (a_{1}, a_{2}) = 10$ . So for Defect ( $a_{1} = 1$ ), Prisoner 1's payoff will be $1$ 0 higher than for Cooperate ( $a_{2} = 0$ ), as can be confirmed in the table. Note that $a_{2}$ doesn't show up in $V 1_{a_{1}}^{'} (a_{1}, a_{2})$ : Defect gives $$ 10$ more for Prisoner 1 regardless of what Prisoner 2 does, which makes Defect a dominant strategy. Don't get me wrong: Prisoner 1's payoff certainly does depend on what Prisoner 2 does. The point is that no matter what Prisoner 2 does, Prisoner 1's payoff will be $10 higher when she (Prisoner 1) defects - and that's what's reflected in $V 1_{a 1}^{'} (a_{1}, a_{2}) = 10$ .

Since the payoff matrix is symmetrical, $V 2 (a_{1}, a_{2}) = 20 - 20 a_{1} + 10 a_{2}$ and $V 2_{a_{2}}^{'} (a_{1}, a_{2}) = 10$ . Prisoner 2 therefore also has a dominant strategy: Defect. The Prisoner's dilemma, then, has a Nash equilibrium: when both prisoners defect. With the partial derivatives, we demonstrated that when both prisoners defect, no one prisoner can do better by changing her action to Cooperate. If e.g. Prisoner 1 were to do this, then $a_{1}$ would go from $1$ to $0$ , and since $V 1_{a_{1}}^{'} (a_{1}, a_{2}) > 0$ , that would lower $V 1$ (regardless of $a_{2}$ ). By symmetry, the same is true for Prisoner 2.

Nonlinear payoff functions

In the Prisoner's dilemma, the payoffs of both players (prisoners) can be modelled by linear payoff functions. What if the payoffs are nonlinear?

Let's say $V 1 (a_{1}, a_{2}) = - a_{1}^{2}$ and $V 2 (a_{1}, a_{2}) = - a_{2}^{2} + a_{2}$ . Then $V 1_{a_{1}}^{'} (a_{1}, a_{2}) = - 2 a_{1}$ and $V 2_{a_{2}}^{'} (a_{1}, a_{2}) = - 2 a_{2} + 1$ . A Nash equilibrium is a point where no player can do better by doing another action given the action of the other player; therefore, $V 1 (a_{1}, a_{2})$ should be maximized with respect to $a_{1}$ while keeping $a_{2}$ constant, whereas $V 2 (a_{1}, a_{2})$ should be maximized with respect to $a_{2}$ while keeping $a_{1}$ constant. If $V 1 (a_{1}, a_{2})$ has a peak value with respect to $a_{1}$ , $V 1_{a_{1}}^{'} (a_{1}, a_{2}) = - 2 a_{1}$ must be $0$ in that point. $V 1_{a_{1}}^{'} (a_{1}, a_{2}) = - 2 a_{1} = 0$ gives $a_{1} = \frac{0}{- 2} = 0$ . So $a_{1} = 0$ could represent a peak, but also a valley, since $V 1_{a_{1}}^{'} (a_{1}, a_{2})$ would be $0$ in both. If $V 1_{a_{1}}^{'} (a_{1}, a_{2}) = - 2 a_{1}$ , $V 1_{a_{1}}^{''} (a_{1}, a_{2}) = - 2 < 0$ . So $a_{1} = 0$ represents a local maximum in $V 1 (a_{1}, a_{2})$ (when $a_{2}$ is held constant)! Since $V 1 (a_{1}, a_{2})$ is quadratic, we can be sure this local maximum is the global maximum too (so there are no values for $a_{1}$ for which $V 1 (a_{1}, a_{2})$ is higher when $a_{2}$ is held constant).

$V 2_{a_{2}}^{'} (a_{1}, a_{2}) = - 2 a_{2} + 1 = 0$ gives $2 a_{2} = 1$ and $a_{2} = \frac{1}{2}$ . $V 2_{a_{2}}^{''} (a_{1}, a_{2}) = - 2 < 0$ , so $a_{2} = \frac{1}{2}$ again represents a local maximum. $V 2 (a_{1}, a_{2})$ is quadratic, so this is a global maximum as well.

So $a_{1} = 0$ represents a global maximum for $V 1$ (for a constant $a_{2}$ ), and $a_{2} = \frac{1}{2}$ represents a global maximum for $V 2$ (for a constant $a_{1}$ ). That means $a_{1} = 0$ is a dominant strategy for player 1, $a_{2} = \frac{1}{2}$ is a dominant strategy for player 2 and we have a Nash equilibrium in $(a_{1} = 0, a_{2} = \frac{1}{2})$ .

Making things a bit more complicated

Let's now define $V 1 (a_{1}, a_{2}) = - a_{1}^{2} * a^{2}$ and $V 2 (a_{1}, a_{2}) = - (a_{2} - 1)^{2}$ . Then for $V 1_{a_{1}}^{'} (a_{1}, a_{2}) = - 2 a_{2} * a_{1} = 0$ , we have $a_{2} = 0 \lor a_{1} = 0$ . $V 1_{a_{1}}^{''} (a_{1}, a_{2}) = - 2 a_{2}$ , which is negative when $a_{2} > 0$ .

For $V 2_{a_{2}}^{'} = - 2 a_{2} + 2 = 0$ we have $a_{2} = 1$ . $V 2_{a_{2}}^{''} = - 2 < 0$ , so this is a local optimum - and also the global one, since $V 2 (a_{1}, a_{2})$ is quadratic. For $a_{2} = 1$ , $V 1_{a_{1}}^{'} = - 2 * 1 * a_{1} = - 2 a_{1}$ . Solving for $0$ gives $a_{1} = 0$ (which we found earlier as well). And since $a_{2} = 1 > 0$ and therefore $V 1_{a_{1}}^{''} (a_{1}, a_{2}) < 0$ , we now have a local maximum for $V 1 (a_{1}, a_{2})$ ! For a constant $a_{2}$ , $V 1 (a_{1}, a_{2})$ is quadratic, so this is the global maximum as well. We found a Nash equilibrium: $(a_{1} = 0, a_{2} = 1)$ .

LESSWRONG
LW

LESSWRONG
LW

5

The Calculus of Nash Equilibria

5

Prisoner's dilemma

Nonlinear payoff functions

Making things a bit more complicated

5

5