As the author, I think this has generally stood the test of time pretty well. There are various changes I'd make if I were doing a rewrite today; but overall, these are minor.
Aside from those generally-minor changes, I think that the key message of this piece remains important to the purpose of Less Wrong. That is to say: making collective decisions, or (equivalently) statements about collective values, is a tough problem; it's important for rationalists; and studying existing theory on this topic is useful.
Here are the specific changes I'd make if I were going to rewrite this today:
I think this post should be included in the best posts of 2018 collection. It does an excellent job of balancing several desirable qualities: it is very well written, being both clear and entertaining; it is informative and thorough; it is in the style of argument which is preferred on LessWrong, by which I mean makes use of both theory and intuition in the explanation.
This post adds to the greater conversation by displaying rationality of the kind we are pursuing directed at a big societal problem. A specific example of what I mean that distinguishes this post from an overview that any motivated poster might write is the inclusion of Warren Smith's results; Smith is a mathematician from an unrelated field who has no published work on the subject. But he had work anyway, and it was good work which the author himself expanded on, and now we get to benefit from it through this post. This puts me very much in mind of the fact that this community was primarily founded by an autodidact who was deeply influenced by a physicist writing about probability theory.
A word on one of our sacred taboos: in the beginning it was written that Politics is the Mindkiller, and so it was for years ...
I haven't mentioned futarchy ("markets for predictions, votes for values") at all here. Futarchy, of course, would not be classed as a voting method in my typology; it's something bigger than even than my concept of a voting system. I think that the main useful thing I can say about futarch is: if you're going to spend energy on it, you should spend some energy on understanding lower-level voting methods too.
ETA: I guess I think I can expand on that a bit. The issue is that when you embed one system inside another one, there's some inevitable bleed-over in terms of incentives. For this meta-reason, as well as for the simple reason of direct applicability, one lesson from voting theory that I think would apply to futarchy is that "patching a problem can make it worse, because strategic outcomes can be the 'opposite' of unstrategic ones." That's an intuition that for me was hard-won, and I wouldn't expect it to be easy for you to learn it just by hearing it. If that's true and you still want to learn it, spend some time thinking about honest and strategic outcomes in various voting methods in the five levels of pathology I mentioned above. The playable exploration I mention at the end will substantially focus on these ideas, in a way I hope will be visually and interactively easy to digest.
Most elections aren't big. Most elections are about questions like a random group electing their chairman. It's a lot easier to convince a random group to use a different voting system than to change the way voting is done on a national election.
I think the most likely way to create change in the national way of voting is to first change the way smaller groups run their elections.
Your post doesn't really analyses the desirable features of an electoral system and I think that's a common failure mode for people who think theoretically about voting systems without any practical involvement into implementing them.
Running elections costs time and given that most groups need to elect their chairman every year minimizing the amount of time it takes to make that election is desirable.
The role of an election is not just about finding the optimal candidate but about establishing that the candidate was picked in a trustworthy way. This means that "our tech person runs a server and we trust the results of the voting software" is not a satisfactory strategy. Complex counting rules also make fraud easier.
Exciting stuff! It would not shock me in the least for this to be a common point-of-reference for various rationalist experiments (i.e. houses, startups, etc). I have a bunch of questions:
1) Do these voting systems have anything to say about voting for referendums, or is the literature constrained to candidates? Is there any sense in which a batch of competing referendums would not be treated the same as single-winner races?
2) You mention the practical concerns of laypeople being able to grasp the process and counting the votes a few times - is there a body of research that focuses on these elements, or is this work being done by the Center for Election Science?
3) You mention the use of a Condorcet method being used by Debian in internal voting - is there a sense of which methods are suitable for use in organizational voting, as distinct from political voting, ie some of them are unusually sensitive to scale? Further, how is deployment in an organization weighed as evidence regarding deployment in elections?
4) Can you recommend a source which discusses voting theory from the perspective of information aggregation, in the same way as has been done for markets?
I might’ve missed this, as I didn’t read some of the sections of this post quite as closely as I could have, but here’s a question which, I think, is critical:
Has there been any research aimed at determining which voting method is easiest to understand (for people who aren’t specialists in the field[1], or, in other words, for the general public)? (And, by extension: how hard is each method to understand? How do they compare, in this regard?)
Comprehensibility by the general public seems to me to be a sine qua non of any voting method we’re to consider for adoption into the political process.
[1] Or, as we might say, “voting theory nerds”.
I've been moderately hard on IRV, and its supporters, in the above. I want to state here for the record that though IRV is my least-favorite serious* reform proposal, I think that it would solve a substantial fraction of the voting-system dysfunction of FPTP; that if IRV were more of a focus of this essay than it is, there are other decent pro-IRV arguments I could include; and that there are IRV supporters whose intellect and judgement I respect even as I disagree with them on this.
*That excludes Borda. #SorryNotSaari
I've curated this post for these reasons:
My biggest hesitation(s) with curating this:
Overall though the opinionated style of the writi...
I'm quite interested in voting systems, but I was surprised to discover that the general consensus is that score beats approval! I checked it out and it seems to be a robust finding that in real life people understand & are happier with score, but this surprised me.
I'd think that since there are so many options for score, it'd be a bit overwhelming and hard to figure out how to optimize. Whereas with approval it's basically "vote for the minor candidates you like better than the major ones; and also vote for your least unfavorite major cand...
There are two things I would add to this:
The other big disadvantage of Borda is teaming, where nominating more candidates makes a party more likely to win (opposite of vote-splitting).
It seems that to me the advantage of party-less systems in multi-winner elections is that, if your voting system succeeds so well that parties are no longer playing such a big role, you don't have to later change it to be party-less to complete the transition. :)
...also I was going to ask a question about rated-runoff methods (i.e., did anyone consider the most obvious...
Thanks for writing this! Very informative.
I saw a paper (which I don't remember enough about to find, unfortunately) in which a bunch of voting theorists got together and voted on the best voting method, using approval voting (I think?), and the winner was approval voting. I wonder if anyone also saw this and saved it so they can tell me how to find it?
...There's another way that Arrow's theorem was an important foundation, particularly for rationalists. He was explicitly thinking about voting methods not just as real-world ways of electing politicians, but as theoretical possibilities for reconciling values. In this more philosophical sense, Arrow's theorem says something depressing about morality: if morality is to be based on (potentially revealed) preferences rather than interpersonal comparison of (subjective) utilities, it cannot simply be a democratic matter; "the greatest good for the greatest numbe
Ah, right, here's the rub:
A generalization of Gibbard–Satterthwaite is Gibbard's Theorem, which says that whether ranked or not no mechanism can
So the crux of the issue isn't ordinal vs cardinal (preferences vs utility, ranked vs scored). Rather, the crux of the issue is strategy-proofness: Arrow and related theorems are about the difficulty of strategy-proof implementations of the utillitarian ideal.
I'm curious about the question of wasted votes. In the link you provide to explain PLACE, it talks about wasted votes a lot. My understanding is this:
1) Under the loose definition, any vote not for the winning candidate is wasted.
2) Under the strict definition, any vote not needed to win is also wasted.
This is deeply unintuitive to me. Just inferring from the above, this suggests that what should be done is to calculate the least-worst smallest-majority candidate, and then everyone else stays home. The direct implication is that votes are intrinsicall...
What are the improved Condorcet methods you're thinking of? I do recall seeing that Ranked Pairs and Schulze have very favorable strategy-backfire to strategy-works ratios in simulations, but I don't know what you're thinking of for sure. If those are it, then if you approach it right, Schulze isn't that hard to work through and demonstrate an election result (wikipedia now has an example).
I nominate this post because it does a good job of succinctly describing different ways to approach one of the core problems of civilization, which is to say choosing policy (or choosing policy choosers). A lot of our activity here is about avoiding catastrophe somehow; we have spent comparatively little time on big-picture things that are incremental improvements.
Anecdotally, this post did a good job of jogging loose a funk I was in regarding the process of politics. Politics is a notoriously *ugh field* kind of endeavor in the broader culture, and a part...
Not the best place to put this comment, but there's a confusing mistake on the PLACE FAQ where the pie candidate shows a voting option for "Other Ice Cream Candidate" instead of "Other Pie Candidate."
Does anyone have an electorama account to remedy that?
Due to its complexity, [SODA] is probably not a practical proposal, though.
I don't follow, how is that complexity a problem? I googled SODA, stumbled onto a sample ballot here: https://wiki.electorama.com/wiki/SODA_voting_(Simple_Optionally-Delegated_Approval)#Sample_Ballot it doesn't seem like it even matters if voters fully understand the (very succinct) instructions, the thickest voters will just treat it like an approval voting system, which is pretty adequate, isn't it?
Having the candidates' delegations be in there wont really impo...
Many of these methods can (I think) select multiple winners. For large elections, this is usually pretty unlikely, but still possible. What's your preferred method of dealing with that possibility? And have you looked into maximal lotteries? http://dss.in.tum.de/files/brandt-research/fishburn_slides.pdf
(I had several hundred karma on the old Less Wrong site under a nym. Is there both a reason to want to reclaim that and a way to do so?)
It seems like a lot of the challenges in designing a voting system stem from wanting to give each geographic region "their" representative, while not letting people "throw away" their vote.
If we abandon the first part (which is totally reasonable here in the 21st century, with the takeover of digital communication and virtual communities), there is a clean solution to the second part.
Specifically, remove all the names from the ballot, and have people only vote for their preferred party, then allow each party that gets more than [small]% of the vote to desi...
Kenneth Arrow, proved that the problem that Condorcet (and Llull) had seen was in fact a fundamental issue with any ranked voting method. He posed 3 basic "fairness criteria" and showed that no ranked method can meet all of them:
Ranked unanimity, Independence of irrelevant alternatives, Non-dictatorial
I've been reading up on voting theory recently and Arrow's result - that the only voting system which produces a definite transitive preference ranking, that will pick the unanimous answer if one exists, and doesn't change dependi...
I would weakly support this post's inclusion in the Best-of-2018 Review. It's a solid exposition of an important topic, though not a topic that is core to this community.
I think voting theory is pretty useful, and this is the best introduction I know of. I've linked it to a bunch of people in the last two years who were interested in getting a basic overview over voting theory, and it seemed to broadly be well-received.
his last name is last name is great source of puns
Typo: his last name is a great source of puns.
The greater the number of voters the less time it makes sense as an individual to spend researching the options. It seems a good first step would be to randomly reduce the number of voters to an amount that would maximize the overall quality of the decision. Any thoughts on this?
It seems that to lift up one candidate to the top of our ballots we are implicitly expressing a judgement that they're better than every other candidate in the contest. Problem is, most of us know nothing about most of the candidates in most contests. We shouldn't be expressing things like "A > B > all others". We should prefer to say "A > B" and leave leave every other comparison grey. Let other voters, who actually know about C and D and E fill in the rest of the picture. There might be candidates who are better tha...
Re SODA: The setup appears to actively encourage candidates to commit to a preference order. Naively, I would prefer a modification along the following lines; could you comment?
(1) Candidates may make promises about their preference order among other candidates; but this is not enforced (just like ordinary pre-election promises). (2) The elimination phase runs over several weeks. In this time, candidates may choose to drop out and redistribute their delegated votes. But mainly, the expected drop-outs will negotiate with expected survivors, in order to get...
Thanks for this introduction to voting systems! I always wanted to know more about them, but never actually got to study it more - so usually I just suggested approval voting as a reasonable default. (Hence we use it in CZEA)
Btw what do you think of this voting method? https://web.d21.me/en/ (pushed by somewhat eccentric mathematician-trader-billionaire Janecek)
(Btw I would suggest this post gets to curated)
Dear Jameson, as you say the theme is extremely important, but I miss more about Storable Votes: one period Arrovian results deeply change in dynamic voting scenarios. I have recently written two articles about this: one has been published in Journal of Economic Interaction and Coordination, the other is still a pre-print:
https://www.lesswrong.com/posts/5wqFoHBBgpdHeCLS6/storable-votes-with-a-pay-as-you-win-mechanism-a
I also suggest you to read the Casella and Mace review about “vote trading” (there a is Journal version, here you have the pre-print):
https:...
Hi Jameson, brilliant post. I have some questions regarding the candidates in an election :
1. What are the implicit and explicit assumptions we make about candidates?
2. Would it be possible to incentivize preferable behavior from voters by making the candidates play an anonymous game prior to the election? For example lets say we have 100 candidates that we want to narrow to 10. If we, for example, made the 100 candidates answer 5 pertinent questions in a digital form and had 1000 voters rank order(or other voting function of choice) the responses of 10 random candidates, we could take the top 10 performing candidates and then have a non-anonymous round of voting.
Just found this and I have question and comment.
Q. Here (NZ), local body elections are usually STV for both mayor and councillors. It was seen as a way to get around vote-splitting leading to an unfavoured winner largely. There is always idle tea-time discussions about strategic voting without anyone getting sufficient interested to really analyse it. Your comment about it strategic voting in preference system revived my curiousity. How do you game an STV system? The best we could manage is that it seem best to rank all the candidates, rather than just ran...
"Because LNH is incompatible with FBC, it is impossible to completely avoid the chicken dilemma without creating a center squeeze vulnerability, but systems like STAR voting or 3-2-1 minimize it."
Unfortunately, the main weakness of STAR voting is in the automatic runoff system it introduces. Automatic runoff systems (of which IRV is the most known) sacrifice numerous benefits and exhibit various pathologies in the name of speed. STAR voting, while far better than current plurality systems, is unfortunately still introducing an unnecessarily layer of compli...
Hey
Could you dive into the strategic approach challenges you see with quadratic voting a bit further? The way I see it, a decentralised blockchain which rewards consensus can ensure honesty using simple game-theoretic principles. Quadratic voting is especially useful in reputation scoring, where both the magnitude and the diversity of the votes are important to ensure robustness.
More specifically, I'm referring to the quadratic voting method proposed in the Capital Restricted Liberal Radicalism paper. I'm assuming you're referring to the sam...
This post introduced me to a whole new way of thinking about institutional/agency design. The most merit, for me, was pointing out this field existed. The subject is close to one of the core subjects of my thoughts, which is how to design institutions that align selfish behavior with altruistic outcomes on different hierarchical levels, from the international, to the cultural, national, communal, relational, and as far down as the subagent level.
Hello,
Thank you for that very insightful article.
Sorry to post a question so long after its publication, I hope it will reach you. I'm wondering how all these methods stands for anonimity of voters, I'm explaining: in most election, there is a lot of candidats (at leat, it's the case in France, where for example the last 1st round of Presidential election had 11 voters. That leads to a lot of unique permutation (for example, with Approval, we can encode 2^11 = 2048 states, for a fully sorted algo we get 11!, etc). In France, votes are orga...
Another view of voting is where an objective opinion or belief is being formed about ANYTHING that can be named. It might be a belief about utility where the utility can only be seen subjectively but an objective value would be useful in planning, for instance. Another example is determination of what a given set of users might buy given prior page views, or what the value of a given piece of intelligence is to a given audience at a given time and related to a given topic. Such a voting structure is needed in crowd sourcing and in 'training'/...
>>>> Borda: "My method is made for honest men!"
This is why it would miserably fail. We live in a fallen world. We would like to live in a rational one. We do not. We will not until irrational behavior is punished in some horrible fashion. Preferably punished by the Darwin Gods w/o any human intervention that irrational monkeys with car keys can blame.
I see two problems conflated here: (1) how to combine many individual utilities, and (2) how to efficiently extract information about those utilities from each voter.
In this view, a particular voting system is just a way to collect information to build an accurate model of the voter's utility, which would then be used in some simple way. Of course, actually reducing something like ranked voting to utility maximization would be hard. But that's not a point against this perspective.
Problem (1) is presumably solved by simple utilitarianism though th...
What is voting theory?
Voting theory, also called social choice theory, is the study of the design and evaulation of democratic voting methods (that's the activists' word; game theorists call them "voting mechanisms", engineers call them "electoral algorithms", and political scientists say "electoral formulas"). In other words, for a given list of candidates and voters, a voting method specifies a set of valid ways to fill out a ballot, and, given a valid ballot from each voter, produces an outcome.
(An "electoral system" includes a voting method, but also other implementation details, such as how the candidates and voters are validated, how often elections happen and for what offices, etc. "Voting system" is an ambiguous term that can refer to a full electoral system, just to the voting method, or even to the machinery for counting votes.)
Most voting theory limits itself to studying "democratic" voting methods. That typically has both empirical and normative implications. Empirically, "democratic" means:
In order to be considered "democratic", voting methods generally should meet various normative criteria as well. There are many possible such criteria, and on many of them theorists do not agree; but in general they do agree on this minimal set:
Why is voting theory important for rationalists?
First off, because democratic processes in the real world are important loci of power. That means that it's useful to understand the dynamics of the voting methods used in such real-world elections.
Second, because these real-world democratic processes have all been created and/or evolved in the past, and so there are likely to be opportunities to replace, reform, or add to them in the future. If you want to make political change of any kind over a medium-to-long time horizon, these systemic reforms should probably be part of your agenda. The fact is that FPTP, the voting method we use in most of the English-speaking world, is absolutely horrible, and there is reason to believe that reforming it would substantially (though not of course completely) alleviate much political dysfunction and suffering.
Third, because understanding social choice theory helps clarify ideas about how it's possible and/or desirable to resolve value disputes between multiple agents. For instance, if you believe that superintelligences should perform a "values handshake" when meeting, replacing each of their individual value functions by some common one so as to avoid the dead weight loss of a conflict, then social choice theory suggests both questions and answers about what that might look like. (Note that the ethical and practical importance of such considerations is not at all limited to "post-singularity" examples like that one.)
In fact, on that third point: my own ideas of ethics and of fun theory are deeply informed by my decades of interest in voting theory. To simplify into a few words my complex thoughts on this, I believe that voting theory elucidates "ethical incompleteness" (that is, that it's possible to put world-states into ethical preference order partially but not fully) and that this incompleteness is a good thing because it leaves room for fun even in an ethically unsurpassed world.
What are the branches of voting theory?
Generally, you can divide voting methods up into "single-winner" and "multi-winner". Single-winner methods are useful for electing offices like president, governor, and mayor. Multi-winner methods are useful for dividing up some finite, but to some extent divisible, resource, such as voting power in a legislature, between various options. Multi-winner methods can be further subdivided into seat-based (where a set of similar "seats" are assigned one winner each) or weighted (where each candidate can be given a different fraction of the voting power).
What are the basics of single-winner voting theory?
(Note: Some readers may wish to skip to the summary below, or to read the later section on multi-winner theory and proportional representation first. Either is valid.)
Some of the earliest known work in voting theory was by Ramon Llull before his death in 1315, but most of that was lost until recently. Perhaps a better place to start would be in the French Academy in the late 1700s; this allows us to couch it as a debate (American Chopper meme?) between Jean-Charles de Borda and Nicolas de Condorcet.
In my view, Borda was the clear loser there. And most voting theorists today agree with me. The one exception is the mathematician Donald Saari, enamored with the mathematical symmetry of the Borda count. This is totally worth mentioning because his last name is a great source of puns.
But Condorcet soon realized there was a problem with his proposal too: it's possible for A to beat B pairwise, and B to beat C, while C still beats A. That is, pairwise victories can be cyclical, not transitive. Naturally speaking, this is rare; but if there's a decision between A and B, the voters who favor B might have the power to artificially create a "poison pill" amendment C which can beat A and then lose to B.
How would a Condorcet cycle occur? Imagine the following election:
1: A>B>C
1: B>C>A
1: C>A>B
(This notation means that there's 1 voter of each of three types, and that the first voter prefers A over B over C.) In this election, A beats B by 2 to 1, and similarly B beats C and C beats A.
Fast-forward to 1950, when theorists at the RAND corporation were inventing game theory in order to reason about the possibility of nuclear war. One such scientist, Kenneth Arrow, proved that the problem that Condorcet (and Llull) had seen was in fact a fundamental issue with any ranked voting method. He posed 3 basic "fairness criteria" and showed that no ranked method can meet all of them:
Arrow's result was important in and of itself; intuitively, most people might have guessed that a ranked voting method could be fair in all those ways. But even more important than the specific result was the idea of an impossibility proof for voting.
Using this idea, it wasn't long until Gibbard and Satterthwaite independently came up with a follow-up theorem, showing that no voting system (ranked or otherwise) could possibly avoid creating strategic incentives for some voters in some situations. That is to say, there is no non-dictatorial voting system for more than two possible outcomes and more than two voters in which every voter has a single "honest" ballot that depends only on their own feelings about the candidates, such that they can't sometimes get a better result by casting a ballot that isn't their "honest" one.
There's another way that Arrow's theorem was an important foundation, particularly for rationalists. He was explicitly thinking about voting methods not just as real-world ways of electing politicians, but as theoretical possibilities for reconciling values. In this more philosophical sense, Arrow's theorem says something depressing about morality: if morality is to be based on (potentially revealed) preferences rather than interpersonal comparison of (subjective) utilities, it cannot simply be a democratic matter; "the greatest good for the greatest number" doesn't work without inherently-subjective comparisons of goodness. Amartya Sen continued exploring the philosophical implications of voting theory, showing that the idea of "private autonomy" is incompatible with Pareto efficiency.
Now, in discussing Arrow's theorem, I've said several times that it only applies to "ranked" voting systems. What does that mean? "Ranked" (also sometimes termed "ordinal" or "preferential") systems are those where valid ballots consist of nothing besides a transitive preferential ordering of the candidates (partial or complete). That is, you can say that you prefer A over B or B over A (or in some cases, that you like both of them equally), but you cannot say how strong each preference is, or provide other information that's used to choose a winner. In Arrow's view, the voting method is then responsible for ordering the candidates, picking not just a winner but a second place etc. Since neutrality wasn't one of Arrow's criteria, ties can be broken arbitrarily.
This excludes an important class of voting methods from consideration: those I'd call rated (or graded or evaluational), where you as a voter can give information about strength of preference. Arrow consciously excluded those methods because he believed (as Gibbard and Satterthwaite later confirmed) that they'd inevitably be subject to strategic voting. But since ranked voting systems are also inevitably subject to strategy, that isn't necessarily a good reason. In any case, Arrow's choice to ignore such systems set a trend; it wasn't until approval voting was reinvented around 1980 and score voting around 2000 that rated methods came into their own. Personally, for reasons I'll explain further below, I tend to prefer rated systems over purely ranked ones, so I think that Arrow's initial neglect of ranked methods got the field off on a bit of a wrong foot.
And there's another way I feel that Arrow set us off in the wrong direction. His idea of reasoning axiomatically about voting methods was brilliant, but ultimately, I think the field has been too focused on this axiomatic "Arrovian" paradigm, where the entire goal is to prove certain criteria can be met by some specific voting method, or cannot be met by any method. Since it's impossible to meet all desirable criteria in all cases, I'd rather look at things in a more probabilistic and quantitative way: how often and how badly does a given system fail desirable criteria.
The person I consider to be the founder of this latter, "statistical" paradigm for evaluating voting methods is Warren Smith. Now, where Kenneth Arrow won the Nobel Prize, Warren Smith has to my knowledge never managed to publish a paper in a peer-reviewed journal. He's a smart and creative mathematician, but... let's just say, not exemplary for his social graces. In particular, he's not reluctant to opine in varied fields of politics where he lacks obvious credentials. So there's plenty in the academic world who'd just dismiss him as a crackpot, if they are even aware of his existence. This is unfortunate, because his work on voting theory is groundbreaking.
In his 2000 paper on "Range Voting" (what we'd now call Score Voting), he performed systematic utilitarian Monte-Carlo evaluation of a wide range of voting systems under a wide range of assumptions about how voters vote. In other words, in each of his simulations, he assumed certain numbers of candidates and of voters, as well as a statistical model for voter utilities and a strategy model for voters. Using the statistical model, he assigned each virtual voter a utility for each candidate; using the strategy model, he turned those utilities into a ballot in each voting method; and then he measured the total utility of the winning candidate, as compared to that of the highest-total-utility candidate in the race. Nowadays the name for the difference between these numbers, scaled so that the latter would be 100% and the average randomly-selected candidate would be 0%, is "Voter Satisfaction Efficiency" (VSE).
Smith wasn't the first to do something like this. But he was certainly the first to do it so systematically, across various voting methods, utility models, and strategic models. Because he did such a sensitivity analysis across utility and strategic models, he was able to see which voting methods consistently outperformed others, almost regardless of the specifics of the models he used. In particular, score voting, in which each voter gives each candidate a numerical score from a certain range (say, 0 to 100) and the highest total score wins, was almost always on top, while FPTP was almost always near the bottom.
More recently, I've done further work on VSE, using more-realistic voter and strategy models than what Smith had, and adding a variety of "media" models to allow varying the information on which the virtual voters base their strategizing. While this work confirmed many of Smith's results — for instance, I still consistently find that FPTP is lower than IRV is lower than approval is lower than score — it has unseated score voting as the undisputed highest-VSE method. Other methods with better strategy resistance can end up doing better than score.
Of course, something else happened in the year 2000 that was important to the field of single-winner voting theory: the Bush-Gore election, in which Bush won the state of Florida and thus the presidency of the USA by a microscopic margin of about 500 votes. Along with the many "electoral system" irregularities in the Florida election (a mass purge of the voter rolls of those with the same name as known felons, a confusing ballot design in Palm Beach, antiquated punch-card ballots with difficult-to-interpret "hanging chads", etc.) was one important "voting method" irregularity: the fact that Ralph Nader, a candidate whom most considered to be ideologically closer to Gore than to Bush, got far more votes than the margin between the two, leading many to argue that under almost any alternative voting method, Gore would have won. This, understandably, increased many people's interest in voting theory and voting reform. Like Smith, many other amateurs began to make worthwhile progress in various ways, progress which was often not well covered in the academic literature.
In the years since, substantial progress has been made. But we activists for voting reform still haven't managed to use our common hatred for FPTP to unite behind a common proposal. (The irony that our expertise in methods for reconciling different priorities into a common purpose hasn't let us do so in our own field is not lost on us.)
In my opinion, aside from the utilitarian perspective offered by VSE, the key to evaluating voting methods is an understanding of strategic voting; this is what I'd call the "mechanism design" perspective. I'd say that there are 5 common "anti-patterns" that voting methods can fall into; either where voting strategy can lead to pathological results, or vice versa. I'd pose them as a series of 5 increasingly-difficult hurdles for a voting method to pass. Because the earlier hurdles deal with situations that are more common or more serious, I'd say that if a method trips on an earlier hurdle, it doesn't much matter that it could have passed a later hurdle. Here they are:
Note that there's a general pattern in the pathologies above: the outcome of honest voting and that of strategic voting are in some sense polar opposites. For instance, under honest voting, vote-splitting destabilizes major parties; but under strategic voting, it makes their status unassailable. This is a common occurrence in voting theory. And it's a reason that naive attempts to "fix" a problem in a voting system by adding rules can actually make the original problem worse.
(I wrote a separate article with further discussion of these pathologies)
Here are a few of the various single-winner voting systems people favor, and a few (biased) words about the groups that favor them:
FPTP (aka plurality voting, or choose-one single-winner): Universally reviled by voting theorists, this is still favored by various groups who like the status quo in countries like the US, Canada, and the UK. In particular, incumbent politicians and lobbyists tend to be at best skeptical and at worst outright reactionary in response to reformers.
IRV (Instant runoff voting), aka Alternative Vote or RCV (Ranked Choice Voting... I hate that name, which deliberately appropriates the entire "ranked" category for this one specific method): This is a ranked system where to start out with, only first-choice votes are tallied. To find the winner, you successively eliminate the last-place candidate, transferring those votes to their next surviving preference (if any), until some candidate has a majority of the votes remaining. It's supported by FairVote, the largest electoral reform nonprofit in the US, which grew out of the movement for STV proportional representation (see the multi-winner section below for more details). IRV supporters tend to think that discussing its theoretical characteristics is a waste of time, since it's so obvious that FPTP is bad and since IRV is the reform proposal with by far the longest track record and most well-developed movement behind it. Insofar as they do consider theory, they favor the "later-no-harm" criterion, and prefer to ignore things like the favorite betrayal criterion, summability, or spoiled ballots. They also don't talk about the failed Alternative Vote referendum in the UK.
Approval voting: This is the system where voters can approve (or not) each candidate, and the candidate approved by the most voters wins. Because of its simplicity, it's something of a "Schelling point" for reformers of various stripes; that is, a natural point of agreement as an initial reform for those who don't agree on which method would be an ideal end state. This method was used in Greek elections from about 1860-1920, but was not "invented" as a subject of voting theory until the late 70s by Brams and Fishburn. It can be seen as a simplistic special case of many other voting methods, in particular score voting, so it does well on Warren Smith's utilitarian measures, and fans of his work tend to support it. This is the system promoted by the Center for Election Science (electology.org), a voting reform nonprofit that was founded in 2012 by people frustrated with FairVote's anti-voting-theory tendencies. (Full disclosure: I'm on the board of the CES, which is growing substantially this year due to a significant grant by the Open Philanthropy Project. Thanks!)
Condorcet methods: These are methods that are guaranteed to elect a pairwise beats-all winner (Condorcet winner) if one exists. Supported by people like Erik Maskin (a Nobel prize winner in economics here at Harvard; brilliant, but seemingly out of touch with the non-academic work on voting methods), and Markus Schulze (a capable self-promoter who invented a specific Condorcet method and has gotten groups like Debian to use it in their internal voting). In my view, these methods give good outcomes, but the complications of resolving spoil their theoretical cleanness, while the difficulty of reading a matrix makes presenting results in an easy-to-grasp form basically impossible. So I personally wouldn't recommend these methods for real-world adoption in most cases. Recent work in "improved" Condorcet methods has showed that these methods can be made good at avoiding the chicken dilemma, but I would hate to try to explain that work to a layperson.
Bucklin methods (aka median-based methods; especially, Majority Judgment): Based on choosing a winner with the highest median rating, just as score voting is based on choosing one with the highest average rating. Because medians are more robust to outliers than averages, median methods are more robust to strategy than score. Supported by French researchers Balinski and Laraki, these methods have an interesting history in the progressive-era USA. Their VSE is not outstanding though; better than IRV, plurality, and Borda, but not as good as most other methods.
Delegation-based methods, especially SODA (simple optionally-delegated approval): It turns out that this kind of method can actually do the impossible and "avoid the Gibbard-Satterthwaite theorem in practice". The key words there are "in practice" — the proof relies on a domain restriction, in which voters honest preferences all agree with their favorite candidate, and these preference orders are non-cyclical, and voters mutually know each other to be rational. Still, this is the only voting system I know of that's 100% strategy free (including chicken dilemma) in even such a limited domain. (The proof of this is based on complicated arguments about convexity in high-dimensional space, so Saari, it doesn't fit here.) Due to its complexity, this is probably not a practical proposal, though.
Rated runoff methods (in particular STAR and 3-2-1): These are methods where rated ballots are used to reduce the field to two candidates, who are then compared pairwise using those same ballots. They combine the VSE advantages of score or approval with extra resistance to the chicken dilemma. These are currently my own favorites as ultimate goals for practical reform, though I still support approval as the first step.
Quadratic voting: Unlike all the methods above, this is based on the universal solvent of mechanism design: money (or other finite transferrable resources). Voters can buy votes, and the cost for n votes is proportional to n². This has some excellent characteristics with honest voters, and so I've seen that various rationalists think it's a good idea; but in my opinion, it's got irresolvable problems with coordinated strategies. I realize that there are responses to these objections, but as far as I can tell every problem you fix with this idea leads to two more.
TL; DR?
What are the basics of multi-winner voting theory?
Multi-winner voting theory originated under parliamentary systems, where theorists wanted a system to guarantee that seats in a legislature would be awarded in proportion to votes. This is known as proportional representation (PR, prop-rep, or #PropRep). Early theorists include Henry Droop and Charles Dodgson (Lewis Carroll). We should also recognize Thomas Jefferson and Daniel Webster's work on the related problem of apportioning congressional seats across states.
Because there are a number of seats to allocate, it's generally easier to get a good answer to this problem than in the case of single-winner voting. It's especially easy in the case where we're allowed to give winners different voting weights; in that case, a simple chain of delegated voting weight guarantees perfect proportionality. (This idea has been known by many names: Dodgson's method, asset voting, delegated proxy, liquid democracy, etc. There are still some details to work out if there is to be a lower bound on final voting weight, but generally it's not hard to find ways to resolve those.)
When seats are constrained to be equally-weighted, there is inevitably an element of rounding error in proportionality. Generally, for each kind of method, there are two main versions: those that tend to round towards smaller parties (Sainte-Laguë, Webster, Hare, etc.) and those that tend to round towards larger ones (D'Hondt, Jefferson, Droop, etc.).
Most abstract proportional voting methods can be considered as greedy methods to optimize some outcome measure. Non-greedy methods exist, but algorithms for finding non-greedy optima are often considered too complex for use in public elections. (I believe that these problems are NP-complete in many cases, but fast algorithms to find provably-optimal outcomes in all practical cases usually exist. But most people don't want to trust voting to algorithms that nobody they know actually understands.)
Basically, the outcome measures being implicitly optimized are either "least remainder" (as in STV, single transferable vote), or "least squares" (not used by any real-world system, but proposed in Sweden in the 1890s by Thiele and Phragmen). STV's greedy algorithm is based on elimination, which can lead to problems, as with IRV's center-squeeze. A better solution, akin to Bucklin/median methods in the single-winner case, is BTV (Bucklin transferable vote). But the difference is probably not a big enough deal to overcome STV's advantage in terms of real-world track record.
Both STV and BTV are methods that rely on reweighting ballots when they help elect a winner. There are various reweighting formulas that each lead to proportionality in the case of pure partisan voting. This leads to an explosion of possible voting methods, all theoretically reasonable.
Because the theoretical pros and cons of various multi-winner methods are much smaller than those of single-winner ones, the debate tends to focus on practical aspects that are important politically but that a mathematician would consider trivial or ad hoc. Among these are:
Prop-rep methods would solve the problem of gerrymandering in the US. I believe that PLACE is the most viable proposal in that regard: maintains the locality and ballot simplicity of the current system, is relatively non-disruptive to incumbents, and maximizes breadth of voter choice to help increase turnout.
Oh, I should also probably mention that I was the main designer, in collaboration with dozens of commenters on the website Making Light, of the proportional voting method E Pluribus Hugo, which is now used by the Hugo Awards to minimize the impact and incentives of bloc voting in the nominations phase.
Anticlimactic sign-off
OK, that's a long article, but it does a better job of brain-dumping my >20 years of interest in this topic than anything I've ever written. On the subject of single-winner methods, I'll be putting out a playable exploration version of all of this sometime this summer, based off the work of the invaluable nicky case (as well as other collaborators).
I've now added a third article on this topic, in which I included a paragraph at the end asking people to contact me if they're interested in activism on this. I believe this is a viable target for effective altruism.