Deriving the Geometric Utilitarian Weights

StrivingForLegibility

This is a supplemental post to Geometric Utilitarianism (And Why It Matters), in which I show how I derived the weights which make any Pareto optimal point $p$ optimal according to the geometric weighted average. This is a subproblem of the proof laid out in the first post of this sequence, and the main post describes why that problem is interesting.

Overview

So how are we going to calculate weights $ψ$ which make $p$ optimal among $F$ ?

The idea here is to identify the Harsanyi hyperplane $H$ , which contains all of the joint utilities $u \in R^{n}$ which satisfy $H (u, ϕ) = H (p, ϕ)$ . Where $ϕ$ are the weights which make our chosen point $p \in R^{n}$ optimal with respect to $H (_, ϕ)$ . And we're going to calculate new weights $ψ$ which make $p$ optimal with respect to $G (_, ψ)$ . It turns out it's sufficient to make $p$ optimal among $H$ , and $p$ will also be optimal across our entire feasible set $F$ .

In terms of calculus, we're going to be constructing a function $G \circ H$ , which tells us about how moving around on $H$ changes $G$ . And we're going to choose weights $ψ$ which make the gradient $\nabla_{v} (G \circ H)$ equal 0 at $p$ . This makes it a local optimum, and it will turn out to be a global maximum across $H$ , which in turn will make it a global maximum across $F$ .

Geometrically, we can think of that as the surface gradient of $G$ across $H$ . And so in terms of the overall gradient $\nabla_{u} G$ , we're designing $ψ$ so that $\nabla_{u} G$ is perpendicular to $H$ at $p$ .

Parameterizing the Harsanyi Hyperplane

When thinking about moving around on the Harsanyi hyperplane $H$ , we have a linear constraint that says no matter which $u \in H$ we pick, we know that $u \cdot ϕ = p \cdot ϕ = H (p, ϕ)$ . If we know $u$ lies on $H$ , we can calculate the the $n$ -th agent's utility $u_{n}$ from the first $n - 1$ utilities. We'll be referring to these first n-1 utilities a lot, so let's call them $v \in R^{n - 1}$ . So $v_{i} = u_{i}$ for all $i < n$ .

$H$ and $G$ are both symmetrical with respect to shuffling the indices of agents around, so without loss of generality we'll assume that the n-th agent is one we're assigning positive Harsanyi weight to: $ϕ_{n}$ > 0. This is necessary for the reconstruction to work for all $ϕ$ .

So we can think of $H$ as a function $H : R^{n - 1} \to R^{n}$ , where the $j$ -th output is $H_{j} (v) = u_{j}$ for $j < n$ . We can use the $n$ -th output to reconstruct $u_{n}$ given $v$ like this: $H_{n} (v) = \frac{H (p, ϕ) - \sum_{i = 1}^{n - 1} v_{i} ϕ_{i}}{ϕ_{n}}$ . This lets us move around $R^{n - 1}$ to pick $v$ however we want, and the function $H$ will map that to its image, helpfully also called $H$ !

Alright, now we have $H : R^{n - 1} \to R^{n}$ and we also have $G : R^{n} \times [0, 1]^{n} \to R$ , which is the geometric weighted average whose gradient we're trying to design through our choice of $ψ$ . So let's compose them together to form $G \circ H : R^{n - 1} \times [0, 1]^{n} \to R$ . And since we want $p$ to be an optimum of $G$ across the hyperplane $H$ , we can set the gradient $\nabla (G \circ H) (q, ψ) = 0$ , where $q \in R^{n - 1}$ are the first $n - 1$ utilities of our target joint utility $p$ . Solving this equation for $ψ$ will give us the weights we need!

This looks like to solving a family of $n - 1$ equations $\frac{\partial (G \circ H)}{\partial v_{i}} (q, ψ) = 0$ . Where we're holding the weights constant for the purposes of differentiation, but we'll be solving for the weights that make the derivative 0 at $p$ .

How Does G Change as We Change These Parameters?

Ok, we've built up a few layers of abstraction, so let's start unpacking. By the chain rule, and using the notation that $H_{j}$ is the $j$ -th element of the output of $H$ :

$\frac{\partial (G \circ H)}{\partial v_{i}} (v, ψ) = \sum_{j = 1}^{n} \frac{\partial G}{\partial H_{j}} (H (v), ψ) \frac{\partial H_{j}}{\partial v_{i}} (v)$

How does our point on H change as we change these parameters?

Let's start by computing $\frac{\partial H_{j}}{\partial v_{i}} (v)$ .

For the first n-1 terms this is simple, because $H_{j}$ simply returns $v_{j}$ . So $\frac{\partial H_{j}}{\partial v_{i}}$ is 1 when $i = j$ , and 0 otherwise, which we can represent using the Kronecker delta $δ_{i}^{j}$ . And $\frac{\partial H_{n}}{\partial v_{i}} = - \frac{ϕ_{i}}{ϕ_{n}}$ .

Geometrically, this is telling us about the slope of $H$ . Note that:

$\frac{\partial H_{n}}{\partial v_{i}}$ is constant and doesn't depend on our choice of $v_{i}$
$\frac{\partial H_{n}}{\partial v_{i}} \leq 0$ (We can never increase agent $n$ 's utility by increasing another agent's utility. This is always true at the Pareto frontier.)

How Does G Change as We Move Around on H?

We can start solving for $\frac{\partial G}{\partial H_{j}}$ by substituting in the definition of G:

$\frac{\partial G}{\partial H_{j}} = \frac{\partial}{\partial H_{j}} (\prod_{k = 1}^{n} H_{k}^{ψ_{k}})$ .

From here we can apply the n-factor product rule:

$\frac{\partial G}{\partial H_{j}} = (\prod_{k = 1}^{n} H_{k}^{ψ_{k}}) (\sum_{k = 1}^{n} \frac{\frac{\partial}{\partial H_{j}} (H_{k}^{ψ_{k}})}{H_{k}^{ψ_{k}}})$ .

Thankfully, $\frac{\partial}{\partial H_{j}} (H_{k}^{ψ_{k}}) = 0$ whenever $k \neq j$ , leaving just $\frac{\partial}{\partial H_{j}} (H_{j}^{ψ_{j}}) = ψ_{j} H_{j}^{ψ_{j} - 1}$ . We can also notice $\prod_{k = 1}^{n} H_{k}^{ψ_{k}} = G$ , leaving us with the much nicer $\frac{\partial G}{\partial H_{j}} = G \frac{ψ_{j} H_{j}^{ψ_{j} - 1}}{H_{j}^{ψ_{j}}} = G \frac{ψ_{j}}{H_{j}} = \frac{ψ_{j}}{H_{j}} G$ .

It will be important later that this partial derivative is undefined when $H_{j} = 0$ , aka wherever any agent is receiving their least feasible utility.

Writing function arguments explicitly:

$\frac{\partial G}{\partial H_{j}} (H (v), ψ) = \frac{ψ_{j}}{H_{j} (v)} G (H (v), ψ)$

Putting These Terms Together

Let's start putting these together. We can start by breaking apart the two cases of $\frac{\partial H_{j}}{\partial v_{i}}$ , like this:

$\frac{\partial (G \circ H)}{\partial v_{i}} (H (v), ψ) = \sum_{j = 1}^{n} \frac{\partial G}{\partial H_{j}} (u, ψ) \frac{\partial H_{j}}{\partial u_{i}} (v)$

$\frac{\partial (G \circ H)}{\partial v_{i}} (H (v), ψ) = [\sum_{j = 1}^{n - 1} \frac{\partial G}{\partial H_{j}} (H (v), ψ) \frac{\partial H_{j}}{\partial v_{i}} (v)] + \frac{\partial G}{\partial H_{n}} (H (v), ψ) \frac{\partial H_{n}}{\partial v_{i}} (v)$

$\frac{\partial (G \circ H)}{\partial v_{i}} (v, ψ) = [\sum_{j = 1}^{n - 1} \frac{\partial G}{\partial H_{j}} (H (v), ψ) δ_{i}^{j}] - \frac{\partial G}{\partial H_{n}} (H (v), ψ) \frac{ϕ_{i}}{ϕ_{n}}$

Here's one reason why it's useful to know about the Kronecker delta $δ_{i}^{j}$ : it filters out all but the $i$ -th element of a sum: $\sum_{j} a_{j} δ_{i}^{j} = a_{i}$ . When you're working in Einstein notation (which is great by the way), you just write it as $a_{j} δ_{i}^{j} = a_{i}$ and you can think of the $j$ 's as "cancelling".

That leaves us with:

$\frac{\partial (G \circ H)}{\partial v_{i}} (v, ψ) = \frac{\partial G}{\partial H_{i}} (H (v), ψ) - \frac{\partial G}{\partial H_{n}} (H (v), ψ) \frac{ϕ_{i}}{ϕ_{n}}$

And we know $\frac{\partial G}{\partial H_{i}} (H (v), ψ)$ , so let's plug that in:

$\frac{\partial (G \circ H)}{\partial v_{i}} (v, ψ) = \frac{ψ_{i}}{H_{i} (v)} G (H (v), ψ) - \frac{ψ_{n}}{H_{n} (v)} G (H (v), ψ) \frac{ϕ_{i}}{ϕ_{n}}$

$\frac{\partial (G \circ H)}{\partial v_{i}} (v, ψ) = (\frac{ψ_{i}}{H_{i} (v)} - \frac{ψ_{n} ϕ_{i}}{H_{n} (v) ϕ_{n}}) G (H (v), ψ)$

And that is the family of $n - 1$ equations that we want to all be 0 when $v = q$ . (This causes $H (v) = u = p$ .) We'll call this gradient $\nabla_{v} (G \circ H)$ to remind ourselves that this is the gradient of $(G \circ H) (v, ψ)$ where we're holding the weights $ψ$ constant.

Solving for the Geometric Weights

Ok, now we can set $v = q$ , $\frac{\partial (G \circ H)}{\partial v_{i}} = 0$ and solve for $ψ_{i}$ , for $i < n$ :

$(\frac{ψ_{i}}{H_{i} (q)} - \frac{ψ_{n} ϕ_{i}}{H_{n} (q) ϕ_{n}}) G (H (q), ψ) = 0$

$\frac{ψ_{i}}{H_{i} (q)} G (H (q), ψ) - \frac{ψ_{n} ϕ_{i}}{H_{n} (q) ϕ_{n}} G (H (q), ψ) = 0$

$\frac{ψ_{i}}{H_{i} (q)} G (H (q), ψ) = \frac{ψ_{n} ϕ_{i}}{H_{n} (q) ϕ_{n}} G (H (q), ψ)$

$\frac{ψ_{i}}{p_{i}} G (p, ψ) = \frac{ψ_{n} ϕ_{i}}{p_{n} ϕ_{n}} G (p, ψ)$

$\frac{ψ_{i}}{p_{i}} = \frac{ψ_{n} ϕ_{i}}{p_{n} ϕ_{n}}$

$ψ_{i} = ψ_{n} \frac{p_{i} ϕ_{i}}{p_{n} ϕ_{n}}$

This is still a system of linear equations we need to solve, since each $ψ_{i}$ for $i < n$ depends on $ψ_{n}$ , which in turn satisfies $ψ_{n} = 1 - \sum_{i = 1}^{n - 1} ψ_{i}$ . So let's solve it for $ψ_{n}$ !

$ψ_{n} = 1 - \sum_{i = 1}^{n - 1} ψ_{i}$

$ψ_{n} = 1 - \sum_{i = 1}^{n - 1} ψ_{n} \frac{p_{i} ϕ_{i}}{p_{n} ϕ_{n}}$

$ψ_{n} = 1 - ψ_{n} \sum_{i = 1}^{n - 1} \frac{p_{i} ϕ_{i}}{p_{n} ϕ_{n}}$

$ψ_{n} + ψ_{n} \sum_{i = 1}^{n - 1} \frac{p_{i} ϕ_{i}}{p_{n} ϕ_{n}} = 1$

$ψ_{n} (1 + \sum_{i = 1}^{n - 1} \frac{p_{i} ϕ_{i}}{p_{n} ϕ_{n}}) = 1$

$ψ_{n} = \frac{1}{1 + \sum_{i = 1}^{n - 1} \frac{p_{i} ϕ_{i}}{p_{n} ϕ_{n}}}$

Remembering that $H (p, ϕ) = \sum_{i = 1}^{n} p_{i} ϕ_{i} = \sum_{i = 1}^{n - 1} p_{i} ϕ_{i} + p_{n} ϕ_{n}$ , we can notice that:

$\frac{H (p, ϕ)}{p_{n} ϕ_{n}} = \frac{\sum_{i = 1}^{n - 1} p_{i} ϕ_{i} + p_{n} ϕ_{n}}{p_{n} ϕ_{n}}$

$\frac{H (p, ϕ)}{p_{n} ϕ_{n}} = \sum_{i = 1}^{n - 1} \frac{p_{i} ϕ_{i}}{p_{n} ϕ_{n}} + 1$

This lets us simplify $ψ_{n}$ down to

$ψ_{n} = \frac{1}{(\frac{H (p, ϕ)}{p_{n} ϕ_{n}})}$

$ψ_{n} = \frac{p_{n} ϕ_{n}}{H (p, ϕ)}$

$ψ_{n} = ϕ_{n} \frac{p_{n}}{H (p, ϕ)}$

And now we can plug that back into the formula for all the other $ψ_{i}$ !

$ψ_{i} = ϕ_{n} \frac{p_{n}}{H (p, ϕ)} \frac{p_{i} ϕ_{i}}{p_{n} ϕ_{n}}$

$ψ_{i} = \frac{p_{i} ϕ_{i}}{H (p, ϕ)}$

$ψ_{i} = ϕ_{i} \frac{p_{i}}{H (p, ϕ)}$

Well isn't that convenient! The formula for all $ψ_{i}$ has the same form, and we can think of it like starting with the Harsanyi weights $ϕ$ (which make p optimal according to $H (_, ϕ)$ , along with anything else with the same Harsanyi score $H (p, ϕ)$ ), and then tweaking them to get $G (_, ψ)$ to target $p$ in particular.

We can simplify our formula by noting that $H (p, ϕ) = p \cdot ϕ = \sum_{j = 1}^{n} p_{j} ϕ_{j}$

$ψ_{i} = \frac{p_{i} ϕ_{i}}{p \cdot ϕ}$

To make the formula a little prettier, and to get some extra geometric insight, we can introduce the element-wise product $⊙$ , where $(p ⊙ ϕ)_{i} = p_{i} ϕ_{i}$ .

$ψ = \frac{p ⊙ ϕ}{p \cdot ϕ}$

Here's a good opportunity to make sure our weights $ψ$ sum up to 1:

$\sum_{i = 1}^{n} ψ_{i} = \sum_{i = 1}^{n} \frac{p_{i} ϕ_{i}}{p \cdot ϕ}$

$\sum_{i = 1}^{n} ψ_{i} = \frac{1}{p \cdot ϕ} \sum_{i = 1}^{n} p_{i} ϕ_{i}$

$\sum_{i = 1}^{n} ψ_{i} = \frac{1}{p \cdot ϕ} (p \cdot ϕ)$

$\sum_{i = 1}^{n} ψ_{i} = 1$

Great! $p \cdot ϕ$ is acting like a normalization term, and we can think of $p ⊙ ϕ$ as telling us which direction $ψ$ points in. This vector of weights is then scaled to land on the hypersurface of weights that sum to 1, known as the standard simplex $Δ^{n}$ , which we'll discuss more later.

We can also think of $ϕ$ as a function $ϕ : R^{n} \times C^{n} \to R^{n}$ denoted as $ϕ (p, F)$ which returns the Harsanyi weights $ϕ$ for $p$ in the context of a compact, convex subset $F \subset R^{n}$ . This is it, so let's make a new heading to find it later!

How to Calculate Weights for p

We now have a formula for $ψ : R^{n} \times [0, 1]^{n} \to R^{n}$ , which we can write as

$ψ (p, ϕ (p, F)) = \frac{p ⊙ ϕ (p, F)}{p \cdot ϕ (p, F)}$

Or we can suppress function arguments and simply write

$ψ = \frac{p ⊙ ϕ}{p \cdot ϕ}$

Where $p ⊙ ϕ \in R^{n}$ is the element-wise product of $p$ and $ϕ$ : $(p ⊙ ϕ)_{i} = p_{i} ϕ_{i}$ and $p \cdot ϕ \in R^{n}$ is the dot product $p \cdot ϕ = \sum_{j = 1}^{n} p_{j} ϕ_{j}$

For a single component $ψ_{i}$ , we have

$ψ_{i} = \frac{p_{i} ϕ_{i}}{p \cdot ϕ}$

Note that $ψ$ isn't defined when $p \cdot ϕ = 0$ .

Is this a problem? Not really! $p \cdot ϕ = H (p, ϕ)$ , the Harsanyi aggregate utility of p when $ϕ$ has been chosen to make p optimal under $H (_, ϕ)$ . When this is 0, it means the individual utilities must all be 0 and the entire feasible set $F$ must be a single point at the origin. When that happens, any weights will make $p$ optimal according to $G (_, ψ)$ or $H (_, ϕ)$ . Feel free to use any convention that works for your application, if we're in a context where $ϕ (\to 0)$ is defined we can inherit

$ψ (\to 0, ϕ (\to 0)) = ϕ (\to 0)$

If $F$ is shrinking towards becoming a single point, we can use $ψ (\to 0) = lim p \to \to 0 ψ (p)$ .

Checking Our Solution

Assuming we calculated $\nabla_{v} (G \circ H) (q, ψ)$ correctly, we can verify that these weights lead to $\nabla_{v} (G \circ H) (q, ψ) = 0$ . This requires $\frac{\partial (G \circ H)}{\partial v_{i}} (q, ψ) = 0$ for the first $n - 1$ utilities, so let's check that:

$\frac{\partial (G \circ H)}{\partial v_{i}} (q, ψ) = 0$

$(\frac{ψ_{i}}{H_{i} (q)} - \frac{ψ_{n} ϕ_{i}}{H_{n} (q) ϕ_{n}}) G (H (q), ψ) = 0$

$(\frac{ψ_{i}}{p_{i}} - \frac{ψ_{n} ϕ_{i}}{p_{n} ϕ_{n}}) G (p, ψ) = 0$

$\frac{ψ_{i}}{p_{i}} G (p, ψ) = \frac{ψ_{n} ϕ_{i}}{p_{n} ϕ_{n}} G (p, ψ)$

$\frac{1}{p_{i}} \frac{p_{i} ϕ_{i}}{p \cdot ϕ} G (p, ψ) = \frac{ϕ_{i}}{p_{n} ϕ_{n}} \frac{p_{n} ϕ_{n}}{p \cdot ϕ} G (p, ψ)$

$\frac{ϕ_{i}}{p \cdot ϕ} G (p, ψ) = \frac{ϕ_{i} ϕ_{n}}{ϕ_{n} p \cdot ϕ} G (p, ψ)$

$\frac{ϕ_{i}}{p \cdot ϕ} G (p, ψ) = \frac{ϕ_{i}}{p \cdot ϕ} G (p, ψ)$

Success! $p$ is an optimum of $G$ among $H$ . But is it unique?

P Is the Unique Optimum of G When Weights Are Positive

Let's see how $ψ$ and $p$ influenced the outcome here, and keep track of the critical points which can make $\nabla_{v} (G \circ H)_{i} = 0$ or undefined. These are the only points which can be extrema, and for each we need to check if it is a minimum or maximum among $H$ . ( $G$ doesn't have any saddle points, and $H$ doesn't have any boundaries of its own to worry about. Where $H$ meets the boundary of $G$ 's domain, the $u_{i} = 0$ axes, $G = 0$ there.)

For example, whenever any individual utility $u_{i} = 0$ , $\frac{\partial G}{\partial H_{i}}$ is undefined, which causes $\nabla_{v} (G \circ H)_{i}$ to be undefined. But note that these will be minimal points of $G$ , unless $ψ_{i} = 0$ . To find maximal points $u \in H$ of $G$ across $H$ we need

$\frac{ψ_{i}}{u_{i}} - \frac{ψ_{n} ϕ_{i}}{u_{n} ϕ_{n}} = 0$

$\frac{ψ_{i} u_{n} ϕ_{n}}{u_{i} u_{n} ϕ_{n}} - \frac{u_{i} ψ_{n} ϕ_{i}}{u_{i} u_{n} ϕ_{n}} = 0$

$\frac{ψ_{i} u_{n} ϕ_{n} - u_{i} ψ_{n} ϕ_{i}}{u_{i} u_{n} ϕ_{n}} = 0$

If $u_{i}$ or $u_{n}$ are 0, then $\nabla_{v} (G \circ H)_{i}$ is undefined, and we'll check later if these can still be optimal. We assumed that the index $n$ refers to an agent with $ϕ_{n} > 0$ , in order to prevent that exact case from breaking our entire solution.

$ψ_{i} u_{n} ϕ_{n} - u_{i} ψ_{n} ϕ_{i} = 0$

$u_{n} ϕ_{n} ψ_{i} - u_{i} ϕ_{i} ψ_{n} = 0$

If we were handed $ψ$ from some external source, we could solve this equation to see which $u \in H$ happened to be optimal. But we designed $ψ$ , so let's see what we caused to be optimal.

$u_{n} ϕ_{n} \frac{p_{i} ϕ_{i}}{p \cdot ϕ} - u_{i} ϕ_{i} \frac{p_{n} ϕ_{n}}{p \cdot ϕ} = 0$

$\frac{u_{n} ϕ_{n} p_{i} ϕ_{i} - u_{i} ϕ_{i} p_{n} ϕ_{n}}{p \cdot ϕ} = 0$

If $p \cdot ϕ = 0$ then $\nabla_{v} (G \circ H)_{i}$ is undefined. This only happens when $F$ is a single point, in which case $p$ is indeed the unique optimum of $G$ .

$u_{n} ϕ_{n} p_{i} ϕ_{i} - u_{i} ϕ_{i} p_{n} ϕ_{n} = 0$

$u_{i} p_{n} ϕ_{i} ϕ_{n} = u_{n} p_{i} ϕ_{i} ϕ_{n}$

Here we're going to be careful about which weights can be 0. We'll again use the fact that $ϕ_{n} > 0$ to safely divide it from both sides.

$u_{i} p_{n} ϕ_{i} = u_{n} p_{i} ϕ_{i}$

Here again we can see that $u = p$ solves this family of n-1 equations. And this is very exciting because this is our first maximum of $G$ ! Are there any other solutions?

Each of these equations is satisfied when one of the following is true:

$ϕ_{i} = 0$

$u_{i} p_{n} = u_{n} p_{i}$

In other words, assigning an agent 0 Harsanyi weight $ϕ_{i}$ (and thus geometric weight $ψ_{i}$ ) can allow $G (_, ψ)$ to have multiple optima among $H$ , which can give it multiple optima among $F$ .

What about when all geometric weights $ψ$ are positive? Are there any other solutions to that second family of n-1 equations?

Having all positive geometric weights $ψ$ implies having all positive Harsanyi weights $ϕ$ , and all positive individual utilities $p$ . It also implies that any optimum of $G (_, ψ)$ will have all positive individual utilities $u$ . This lets us freely divide by any of these terms, without needing to worry that we might be dividing by 0.

$\frac{u_{i}}{u_{n}} = \frac{p_{i}}{p_{n}}$

Since $u_{n}$ and $p_{n}$ are both positive, we can think of $u_{n}$ as a scaled version of $p_{n}$ .

$u_{n} = λ p_{n}$

How does this scalar influence the other terms in these equations?

$\frac{u_{i}}{λ p_{n}} = \frac{p_{i}}{p_{n}}$

$u_{i} = λ p_{i}$

$u = λ p$

This forms a line from the origin to $p$ , which only intersects $H$ at $p$ . (Since scaling $p$ up or down changes $H (p, ϕ)$ .) So when all geometric weights $ψ$ are positive, $p$ is the unique optimum of $G (_, ψ)$ among $H$ !

When $ϕ_{i} = 0$ , $ψ_{i}$ is also 0, so $u_{i}$ doesn't affect $G (_, ψ)$ . We can start with $p$ , and then freely vary the utilities of any agent with 0 weight and remain optimal.

Interactive Implementations

We can also check our entire calculation, including those pages of calculus, by actually implementing our solution and seeing if it works! A graphing calculator is sufficient to check this in 2 and 3 dimensions. We can show all the points which satisfy $G (s, ψ) = G (p, ψ)$ and they should trace out the contours of $G$ , showing all the joint utilities which have the same $G$ score as $p$ .

In 2 dimension, the graph looks like this:

Geometric Weight Calculation 2D — Check out an interactive version here!

The Harsanyi hyperplane is a line, and the contour curves are skewed hyperbolas.

As expected, taking $p$ out of the positive quadrant violates our assumption that utilities are non-negative, leading to invalid settings for $ψ$ . Similarly, if $H$ has a positive slope, this violates our assumption that $p$ is on the Pareto frontier. (A positive slope implies that we can make both players better off simultaneously. If we calculate $ϕ$ anyway, a positive slope implies that $ϕ_{i}$ is negative for the agent on the x axis.) This allows $H$ to pass up through the hyperbola at another point other than $p$ , but this never happens when $ϕ_{i} \geq 0$ .

With 3 agents, the graph looks like this:

Geometric Weight Calculation 3D — Interactive version here

In 3 dimensions, the Harsanyi hyperplane is a plane, and the contour surfaces are skewed hyperboloids.

We can move $p$ around on the hyperplane, and this changes $ψ$ , which changes where the contour touches $H$ . We can see that $p$ always lies at the intersection of this contour curve and $H$ , and this is a visual proof that $p$ maximizes $G (_, ψ)$ among $H$ . And when $H$ corresponds to all agents having positive Harsanyi weight $ϕ$ , this intersection only happens at $p$ !