3 min read

Comment Permalink

The particular curve you describe doesn't work - even if someone gave in to your threat entirely, they'd offer you 2/3 of the 10$ (this maximizes their EV at ~1.5$), but then you'd have to reject a third of the time so you'd wind up with an EV of less than 5.

You could definitely fix that particular flaw in your system. And what you'd wind up with is something that gets analyzed a lot like the original game except that you've stolen first player position and are offering something 'unfair'. So as usual for this game, your 'unfair' strategy would work perfectly against a pure-CDT agent (they'll cave to any unfair setup since the confrontational alternative is getting 0), and work against some real humans while other real humans will say screw you. The 'ideal agent', however, does not reward threats (because being the kind of agent who never rewards threats is a pretty good shield against anyone bothering to threaten you in the first place, while being the kind of agent who does reward threats is asking to be threatened). So if you use a strategy like the one you suggest against them, they will compute an offer (or a probability of an offer) such that their EV is maximized subject to your EV being strictly less than 5$: in this way you would be better off if you'd just done the fair thing to begin with.

See in context

20 Probabilistic Negotiation

by Robert Kennedy

15th Mar 2022

3 min read

20

Follow up to Deterministic Strategies Can Be Sub-optimal

The Ultimatum Game is a simple experiment. Two people have been allocated $10. One person decides how to divide the profits, and the other decides whether to Accept that allocation or to Deny it, in which case both participants get $0. Suppose you are the person whose job it is to choose whether to Accept or Deny an offer. What strategy could you use to maximize your returns?

Yudkowsky offers the following solution (NB: the original text splits $12, because sci-fi; I have changed the numbers inline/without brackets, let me know if that offends)

It goes like this:
When somebody offers you a 6:4 split, instead of the 5:5 split that would be fair, you should accept their offer with slightly less than 5/6 probability. Their expected value from offering you 6:4, in this case, is 6 * slightly less than 5/6, or slightly less than 5. This ensures they can't do any better by offering you an unfair split; but neither do you try to destroy all their expected value in retaliation. It could be an honest mistake, especially if the real situation is any more complicated than the original Ultimatum Game.
If they offer you 7:3, accept with probability slightly-more-less than 5/7, so they do even worse in their own expectation by offering you 7:3 than 6:4.
It's not about retaliating harder, the harder they hit you with an unfair price - that point gets hammered in pretty hard to the kids, a Watcher steps in to repeat it. The circumstances under which you should ever go around carrying out counterfactual threats in real life are much more fraught and complicated than this, and nobody's going to learn about them realistically for several years yet. This setup isn't about retaliation, it's about what both sides have to do, to turn the problem of dividing the gains, into a matter of fairness; to create the incentive setup whereby both sides don't expect to do any better by distorting their own estimate of what is 'fair'.

To be explicit: assume that you have in some way "locked in" some curve, , which tells you to hit "Accept" with probability $p (x)$ when offered to let your conspirator keep $x$ dollars out of the 10 you are to split. You want to maximize your expected value, as does your conspirator: so, you should positively incentivize your conspirator to give you money.

Consider the following instantiation of this algorithm:
$⎛ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎝ \begin{matrix} p (x) & O f f e r (t h e i r s : y o u r s) & E (t h e i r s) & E (y o u r s) 1 & 0 : 10 & 0 & 10 1 & 1 : 9 & 1 & 9 1 & 2 : 8 & 2 & 8 1 & 3 : 7 & 3 & 7 1 & 4 : 6 & 4 & 6 1 & 5 : 5 & 5 & 5 0.8 & 6 : 4 & 4.8 & 3.2 0.6 & 7 : 3 & 4.2 & 1.8 0.4 & 8 : 2 & 3.2 & 0.8 0.2 & 9 : 1 & 1.8 & 0.2 0 & 10 : 0 & 0 & 0 \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎠$

Note that there are many values for $p (x)$ . For now, let's not examine the "greedy" half of the algorithm (where your conspirator is offering you more than they are taking themselves), and model another instantiation:

$⎛ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎜ ⎝ \begin{matrix} p (x) & O f f e r (t h e i r s : y o u r s) & E (t h e i r s) & E (y o u r s) 1 & 5 : 5 & 5 & 5 0.8 & 6 : 4 & 4.8 & 3.2 0.65 & 7 : 3 & 4.55 & 2.15 0.55 & 8 : 2 & 4.4 & 1.1 0.45 & 9 : 1 & 4.05 & 0.45 0.3 & 10 : 0 & 3 & 0 \end{matrix} ⎞ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎟ ⎠$

Note that this maintains a positive incentive for your conspirator to give you more money, while not destroying as much value as the prior algorithm.

I work at a company which does year end performance reviews. I was promoted last year, and am not a particular "flight risk". However, I still want to positively incentive my boss to accurately "rate" me - ie, if I performed above average I would like to be given the rating (and raise) for an above average performance, even if it means increasing exposure to a more flight-prone but poorer performance employee. So I published a curve to my boss demonstrating that I would stay with 100% chance if I got the highest rating I could get, would stay with 90% chance if I got an average rating, would stay with 70% chance if I got below average, and would stay with 50% chance if I got put on a performance improvement plan.

This was received well enough, because I run the Risk Analytics team at a FinTech company, so my entire stack is capable of working with uncertainty. In particular, I highlighted that even an average grade (which would put me in the top 70th percentile) would have me staying with 90% chance, which is above industry attrition rate. I ended up getting an average grade, and rolling a 6 on my d10, so I am staying with my company.

Traditional negotiations work by hemming and hawing. Yudkowsky offers a new solution: publish your flight curve and let your conspirator work towards their own incentives. Increase your legibility so that people don't have to track your subtle indications that you are happy/unhappy with an offer.

Yudkowsky's newest novel is here: https://www.glowfic.com/posts/4582

Rationality2

Frontpage

20

Probabilistic Negotiation

New Comment

9 comments, sorted by

top scoring

Click to highlight new comments since: Today at 10:17 AM

[-]robot-dreams3y70

Big fan of this idea, but I need to be careful where/how I apply it. Otherwise:

"Why did you break up with your significant other?"

"Well, you see, their snoring bothered me but I didn't want to use a deterministic strategy that destroys all the expected value, so I decided to roll a d20, and I got a 2."

[-]Dagon3y*70

There's a lot of assumptions of communication, negotiation, and reliability of commitment in there. The normal experimental setup for the Ultimatum Game is a one-shot without communication, and without much knowledge of your opponent.

In the case of negotiation, your probabilistic ultimatum still can't incent your (rational, numerically-inclined) opponent to offer higher than they would with a fixed threshold, and in fact gives irrationally-optimistic opponents an out to convince themselves to lowball you (because they'd rather get lucky than give in to your demands). I'd enjoy hearing your model of the proposer who behaves differently in your probabilistic statement of intent than if you'd just said "I'll reject if I don't get at least half".

Also, it's STILL a problem that "whoever commits first, effectively moves last, and has the advantage". If you get a response to your probabilistic distribution along the lines of "thanks, I've already locked in my algorithm - I picked from my own distribution before I even heard who I'd be playing against, and you're offered 0.2x. I hope you roll dice well!", you now have to figure out whether to follow through on your (irrelevant, now) statement, or just accept or reject based on other factors and future expectations.

[-]Daniel V3y30

Plus if you think about the Proposer's optimization problem, it really hinges on "what is the probability that the Responder will accept my offer?" Obviously, the probability is at a maximum for 0,10 and one expects it to remain very high, even 1.0, through 5,5. Proposer is already aware that their own expected value declines after that point, and probably assumes it does so monotonically. If the Responder can share their particular probability schedule, that's great, and it's actually important if Proposer for some reason is unaware of the incentive structure. Yudkowsky and Kennedy's explication is nice and probably helpful advice, but not really a "solution."

[-]Dagon3y20

Thinking more about this - if you're taking power by pre-emptively publishing a commitment, why in tarnation are you satisfied with the same probability for 0.5 and 0.99 for yourself? What's your reasoning for not saying something like "I'll accept with a probability equal to the square of the share you offer me" or some other incentive to give you almost all of the split?

[-]benjamincosman3y80

[-]Gunnar_Zarncke3y20

Did you let your boss see you roll the dice?

[-]Robert Kennedy3y20

No, since if I had rolled low I wouldn't want to like, give them significantly more notice than necessary as I job hunted. I offered to do something like hash a seed to use on a RNG, they didn't think that was necessary.

[-]Gunnar_Zarncke3y20

Yeah, these kinds of real-life complications. Cool idea about the hash.

[-]Kronopath1y10

I have to admit, i rolled my eyes when I saw that you worked in financial risk management. Not because what you did was stupid—far from It—but because of course this is the kind of cultural environment in which this would work.

If you did this in a job that wasn’t heavily invested in a culture of quantitative risk management, it would likely cause a likely-permanent loss of trust that would be retaliated against in subtle ways. You’d get a reputation as “the guy that plays nasty/tricky games when he doesn’t get his way” which would make it harder to collaborate with people.

So godspeed, glad it worked for you, but beware applying this in other circumstances and cultures.

Moderation Log

Curated and popular this week

262Good Research Takes are Not Sufficient for Good Strategic Takes

Neel Nanda

67AI for AI safety

Joe Carlsmith

224Tracing the Thoughts of a Large Language Model

Adam Jermyn