tl;dr
Organizations that enforce rationality at the collective level can get very different voting outcomes than organizations that enforce rationality at the individual level, per known results in social choice theory. This has implications for real-world expert panels.
Here, "rationality" is logical consistency - it is possible for the majority of members to vote to reject a conclusion while also believing the necessary conditions to accept it hold, and vice-versa, even if they all independently evaluated the precepts and arrived at the conclusion logically. This arises because of how majority votes work.
This post summarizes Philip Pettit's 2002 paper outlining the issue and its implication for any deliberative democracy. It additionally summarizes Pettit and List's impossibility result on judgement aggregation rules from the Stanford Encyclopedia of Philosophy.
Introduction to the Discursive Dilemma
The Doctrinal Paradox
Three judges have to decide by majority whether a defendant broke a contract.
Legal doctrine dictates that if the following two premises are true:
(a) the defendant was contractually obliged not to do action X, and
(b) the defendant did action X,
then the conclusion is that the defendant broke his contract. Let's call this conclusion (c).
The judges get to decide which of the premises, (a) and (b), are true.
This is the simple conjunctive formula , and, taking one possible way the judges vote, we can construct a truth table for the same.
A | B | ||
---|---|---|---|
Judge 1 | True | True | True |
Judge 2 | True | False | False |
Judge 3 | False | True | False |
Majority | True | True | False |
In this permutation of votes, because the majority voted false for (c), the judges vote to pardon the defendant.
But here's the paradox: if the majority of the judges thought (a) was true and (b) was true, then it should have implied that the majority thought (c) was true.
You now have the unfortunate case where the majority voted true for all the precepts, but also rejected the conclusion.
This demonstrates collective inconsistency arising from individual consistency.
Generalizations
A similar paradox can be constructed for disjunctive propositions . It scales to arbitrary groups of people.
In fact, Pettit identifies the following minimal conditions:
a. there is a conclusion to be decided among a group of people by reference to a conjunction (or disjunction) of independent or separable premises—the conclusion will be endorsed if relevant prem- ises are endorsed, and otherwise it will be rejected;
b. each member of the group forms a judgment on each of the premises and a corresponding judgment on the conclusion;
c. each of the premises is supported by a majority of members but those majorities do not coincide with one another;
d. the intersection of those majorities will support the conclu- sion, and the others reject it, in view of a; and
e. the intersection of the majorities is only a minority in the group as a whole.
Implications
Although cute-sounding at first, this paradox underscores a very important issue regarding voting procedures in organizations in practice.
First, organizations must choose to vote directly for the proposition at stake, or to vote for individual conditions and have the decision inferred from that. As the foregoing paradox demonstrates, the outcomes are very different in both - in the first procedure, the judges voted to pardon, but, under the second procedure, the judges would have had to convict.
Second, it is not the case that organizations can always choose to vote directly and ignore preserving collective consistency, as is usually argued about electoral voting. Pettit identifies two examples of organizations where collective consistency is absolutely necessary:
-
A committee that has been tasked to evaluate the merits of a case and arrive at a recommendation accordingly. This includes awards panels, juries, trusts acting on external instructions, and expert policy bodies.
-
Political or activist movements that seek to hold ethically or philosophically consistent positions, where members may desert if the movement does not appear to be holistically consistent.
Because of this generalization, the doctrinal paradox has been dubbed the discursive dilemma, given that the issue at stake needn't have anything to do with legal doctrine at all.
Finally, it demonstrates that the root of this issue arises from the specific scheme of majority vote collection. It opens the door to thinking about alternate voting schemes that try to satisfy both individual and collective rationality.
An Impossibility Result For Collective Rationality In the Best Case
The more general problem of arriving at a way to collect votes on propositions is known as judgement aggregation.
Typically, we might want to be able to ensure that we can have a procedure for judgement aggregation that meets a few nice properties:
-
Plurality: There are no constraints on which available options people can vote for. In other words, if there are possible options, it is safe to assume that people may vote for any of the options.
-
Complete collective consistency: The judgment preserves collective consistency for all propositions. In other words, independent of the proposition, the majority judgement will always faithfully evaluate the proposition consistently.
-
Anonymity: If the proportion of people who voted stays the same, then the judgement stays the same, independent of who exactly voted. In other words, it doesn't matter which subpopulation voted, only the relative propotion.
-
Systematicity: This is fairly technical, but is a slightly stronger version of independence of irrelevant alternatives, which states that a preference between two options shouldn't be changed by introducing a third option. You might prefer to , but introducing shouldn't make you suddenly prefer to , although you may prefer to or to .
Unfortunately, List and Pettit jointly proved in 2002 that arriving at a scheme like this is impossible. In order to arrive at a collectively consistent scheme, we must sacrifice some of the other properties.
An Exercise for the Interested Reader
So how would you preserve collective consistency and individual consistency?
If you liked this post, you will love Amartya Sen's Collective Choice and Social Welfare. Originally written in 1970 and expanded in 2017, this is a thorough development of the many paradoxes in collective choice algorithms (voting schemes, ways to aggregate individual utility, and so on.)
My sense is the AI alignment community has not taken these sorts of results seriously. Preference aggregation is non-trivial, so "aligning" an AI to individual preferences means something much different than "aligning" an AI to societal preferences. Different equally-principled ways of aggregating preferences will give different results, which means that someone somewhere will not get what they want. Hence an AI agent will always have some type of politics if only by virtue of its preference aggregation method, and we should be investigating which types we prefer.
I thought Incomplete Contracting and AI Alignment addressed this situation nicely:
How do we make a choice about the "right" politics/preference aggregation method for an AI? I don't think there is or can be an a-priori answer here, so we need something else to break the tie. One strategy is to ask what the consequences of that each type of political system will be in the actual world, rather than an abstract behind-the-veil scenario. But more fundamentally I don't know that we can do better than what humans have always done, which is group discussions with the intention of coming to a workable agreement. Perhaps an AI agent can and should participate in such discussions. It's not just the formal process that makes voting systems work, but the perceived legitimacy and therefore good-faith participation of the people who will be governed by it, and this is what such discussion creates.
Yeah. I was more trying to argue that, compared to Bayesian ideas, voting doesn't win you all that much.