All of Diffractor's Comments + Replies

Open Problem in Voting Theory

Diffractor2y20

It is because beach/beach is the surplus-maximizing result. Any Pareto-optimal bargaining solution where money is involved will involve the surplus-maximizing result being played, and a side payment occuring.

Diffractor2y52

I have a reduction of this problem to a (hopefully) simpler problem. First up, establish the notation used.

[n] refers to the set ${1... n}$ . $n$ is the number of candidates. Use $C$ as an abbreviation for the space $Δ [n]$ , it's the space of probability distributions over the candidates. View $C$ as embedded in $R^{n - 1}$ , and set the origin at the center of $C$ .

At this point, we can note that we can biject the following:
1: Functions of type $[n] \to [0, 1]$
2: Affine functions of type $C \to [0, 1]$
3: Functions o... (read more)

Threat-Resistant Bargaining Megapost: Introducing the ROSE Value

Threat-Resistant Bargaining Megapost: Introducing the ROSE Value

With this.

1M. Y. Zuo3y

Reading Clarifications 1, 2, and 3 seems to imply that it would not be useful or applicable in this scenario. Can you show how this would lead to a 'jerk' never gaining in the aforementioned scenario?

Threat-Resistant Bargaining Megapost: Introducing the ROSE Value

What I mean is, the players hit each other up, are like "yo, let's decide on which ROSE point in the object-level game we're heading towards"
Of course, they don't manage to settle on what equilibrium to go for in the resulting bargaining game, because, again, multiple ROSE points might show up in the bargaining game.
But, the ROSE points in the bargaining game are in a restricted enough zone (what with that whole "must be better than the random-dictator point" thing) to seriously constrain the possibilities in the object-level game. "Worst ROSE point ... (read more)

1M. Y. Zuo3y

How would someone ensure that the jerk does not gain from their threats regarding the choice of ROSE points, assuming Players cannot exit the game?

Diffractor3y*Ω711-1

I think I have a contender for something which evades the conditional-threat issue stated at the end, as well as obvious variants and strengthenings of it, and which would be threat-resistant in a dramatically stronger sense than ROSE.

There's still a lot of things to check about it that I haven't done yet. And I'm unsure how to generalize to the n-player case. And it still feels unpleasantly hacky, according to my mathematical taste.

But the task at least feels possible, now.

EDIT: it turns out it was still susceptible to the conditional-threat issue, but t... (read more)

Threat-Resistant Bargaining Megapost: Introducing the ROSE Value

Diffractor3yΩ340

For 1, it's just intrinsically mathematically appealing (continuity is always really nice when you can get it), and also because of an intution that if your foe experiences a tiny preference perturbation, you should be able to use small conditional payments to replicate their original preferences/incentive structure and start negotiating with that, instead.

I should also note that nowhere in the visual proof of the ROSE value for the toy case, is continuity used. Continuity just happens to appear.

For 2, yes, it's part of game setup. The buttons are of whate... (read more)

2Daniel Kokotajlo3y

OK, thanks! Continuity does seem appealing to me but it seems negotiable; if you can find an even more threat-resistant bargaining solution (or an equally threat-resistant one that has some other nice property) I'd prefer it to this one even if it lacked continuity.

Threat-Resistant Bargaining Megapost: Introducing the ROSE Value

Diffractor3yΩ490

My preferred way of resolving it is treating the process of "arguing over which equilibrium to move to" as a bargaining game, and just find a ROSE point from that bargaining game. If there's multiple ROSE points, well, fire up another round of bargaining. This repeated process should very rapidly have the disagreement points close in on the Pareto frontier, until everyone is just arguing over very tiny slices of utility.

This is imperfectly specified, though, because I'm not entirely sure what the disagreement points would be, because I'm not sure how the "... (read more)

1M. Y. Zuo3y

They approach the Pareto frontier, but never in fact reach it, from what I understand. So wouldn’t this just move the problem into the meta level? What’s stopping the Players from going through the threat escalation spiral to force their ideal ROSE point? Sure they may be very tiny slices of utility in contention, but then it only takes one obstinate Player to threaten all utility to ensure they lose zero.

What's the longest a sentient observer could survive in the Dark Era?

So, if you are limited to only pure strategies, for some reason, then yes, Chinese would be on the Pareto frontier.
But if you can implement randomization, then Chinese is not on the Pareto frontier, because both sides agree that "flip a coin, Heads for Sushi, Tails for Italian" is just strictly better than Chinese.

The convex shape consists of all the payoff pairs you can get if you allow randomization.

Answer by DiffractorSep 16, 2022180

Alright, this is kind of a Special Interest, so here's your relevant thought dump.

First up, the image is kind of misleading, in the sense that you can always tack on extra orders of magnitude. You could tack on another thousand orders of magnitude and make it look even longer, or just go "this is 900 OOM's of literally nothing happening, lets clip that off and focus on the interesting part"

Assuming proton decay is a thing (that free protons decay with a ridiculously long half-life)....

ok, I was planning on going "as a ludicrous upper bound, here's the numb... (read more)

1Noosphere893y

Crucially, these are all virtual beings once you start getting past the stelliferous era, so you can do a whole lot of tricks with subjective time.

Unifying Bargaining Notions (2/2)

Diffractor3yΩ360

Yeah, "transferrable utility games" are those where there is a resource, and the utilities of all players are linear in that resource (in order to redenominate everyone's utilities as being denominated in that resource modulo a shift factor). I believe the post mentioned this.

Diffractor3y50

Task completed.

Diffractor3yΩ2142

Agreed. The bargaining solution for the entire game can be very different from adding up the bargaining solutions for the subgames. If there's a subgame where Alice cares very much about victory in that subgame (interior decorating choices) and Bob doesn't care much, and another subgame where Bob cares very much about it (food choice) and Alice doesn't care much, then the bargaining solution of the entire relationship game will end up being something like "Alice and Bob get some relative weights on how important their preferences are, and in all the subgam... (read more)

Diffractor3yΩ120

Actually, they apply anyways in all circumstances, not just after the rescaling and shifting is done! Scale-and-shift invariance means that no matter how you stretch and shift the two axes, the bargaining solution always hits the same probability-distribution over outcomes, so monotonicity means "if you increase the payoff numbers you assign for some or all of the outcomes, the Pareto frontier point you hit will give you an increased number for your utility score over what it'd be otherwise" (no matter how you scale-and-shift). And independence of ir... (read more)

Basic Inframeasure Theory

Diffractor3yΩ130

If you're looking for curriculum materials, I believe that the most useful reference would probably be my "Infra-exercises", a sequence of posts containing all the math exercises you need to reinvent a good chunk of the theory yourself. Basically, it's the textbook's exercise section, and working through interesting math problems and proofs on one's own has a much better learning feedback loop and retention of material than slogging through the old posts. The exercises are short on motivation and philosophy compared to the posts, though, much like how a fu... (read more)

Diffractor3yΩ120

So, if you make Nirvana infinite utility, yes, the fairness criterion becomes "if you're mispredicted, you have any probability at all of entering the situation where you're mispredicted" instead of "have a significant probability of entering the situation where you're mispredicted", so a lot more decision-theory problems can be captured if you take Nirvana as infinite utility. But, I talk in another post in this sequence (I think it was "the many faces of infra-beliefs") about why you want to do Nirvana as 1 utility instead of infinite utility.

Parfit's Hi... (read more)

Basic Inframeasure Theory

Diffractor3yΩ150

So, the flaw in your reasoning is after updating we're in the city, $e_{2}$ doesn't go "logically impossible, infinite utility". We just go "alright, off-history measure gets converted to 0 utility", a perfectly standard update. So $e_{2}$ updates to (0,0) (ie, there's 0 probability I'm in this situation in the first place, and my expected utility for not getting into this situation in the first place is 0, because of probably dying in the desert)

As for the proper way to do this analysis, it's a bit finicky. There's something called "acausal form... (read more)

1Thomas Larsen3y

Thank you so much for your detailed reply. I'm still thinking this through, but this is awesome. A couple things: 1. I don't see the problem at the bottom. I thought we were operating in the setting where Nirvana meant infinite reward? It seems like of course if N is small, we will get weird behavior because the agent will sometimes reason over logically impossible worlds. 2. Is Parfit's Hitchiker with a perfect predictor unsalvageable because it violates this fairness criteria? 3. The fairness criterion in your comment is the pseudocausality condition, right?

Diffractor3y50

Omega and hypercomputational powers isn't needed, just decent enough prediction about what someone would do. I've seen Transparent Newcomb being run on someone before, at a math camp. They were predicted to not take the small extra payoff, and they didn't. And there was also an instance of acausal vote trading that I managed to pull off a few years ago, and I've put someone in a counterfactual mugging sort of scenario where I did pay out due to predicting they'd take the small loss in a nearby possible world. 2/3 of those instances were cases where I was s... (read more)

2JenniferRM3y

What the heck would that proof even look like, indeed. That's what I haven't figured out yet. (On the practical level... I'm pretty sure it would be awesome to interact with a benevolent god and it seems that the thing you're suggesting is that there are prosaic versions. One obvious prosaic version of such proximity is a job in finance. The courts and contracts and so on are kind of like Omega, and surely this entity is benevolent? Luck goes well: millions in bonuses. Luck goes bad: you're laid off. Since of course the system in general is benevolent: surely it would be acceptable to participate? The personal asymmetry in outcomes would make the whole situation potentially nice to be near to. But then I wonder about that assumption of benevolence and think about Mammon, and I remember The Big Short ...and I go back to wondering how Omega offers a finite creature a proof of benevolence.)

Passing Troll Bridge

Diffractor3y30

The key piece that makes any Lobian proof tick is the "proof of X implies X" part. For Troll Bridge, X is "crossing implies bridge explodes".

For standard logical inductors, that Lobian implication holds because, if a proof of X showed up, every trader betting in favor of X would get free money, so there could be a trader that just names a really really big bet in favor of the X (it's proved, after all), the agent ends up believing X, and so doesn't cross, and so crossing implies bridge explodes.

For this particular variant of a logical inductor, there's an... (read more)

Diffractor3yΩ120

Said actions or lack thereof cause a fairly low utility differential compared to the actions in other, non-doomy hypotheses. Also I want to draw a critical distinction between "full knightian uncertainty over meteor presence or absence", where your analysis is correct, and "ordinary probabilistic uncertainty between a high-knightian-uncertainty hypotheses, and a low-knightian uncertainty one that says the meteor almost certainly won't happen" (where the meteor hypothesis will be ignored unless there's a meteor-inspired modification to what you do that's al... (read more)

Diffractor3yΩ120

Something analogous to what you are suggesting occurs. Specifically, let's say you assign 95% probability to the bandit game behaving as normal, and 5% to "oh no, anything could happen, including the meteor". As it turns out, this behaves similarly to the ordinary bandit game being guaranteed, as the "maybe meteor" hypothesis assigns all your possible actions a score of "you're dead" so it drops out of consideration.

The important aspect which a hypothesis needs, in order for you to ignore it, is that no matter what you do you get the same outcome, whether ... (read more)

4Charlie Steiner3y

The meteor doesn't have to really flatten things out, there might be some actions that we think remain valuable (e.g. hedonism, saying tearful goodbyes). And so if we have Knightian uncertainty about the meteor, maximin (as in Vanessa's link) means we'll spend a lot of time on tearful goodbyes.

Diffractor3yΩ120

Well, taking worst-case uncertainty is what infradistributions do. Did you have anything in mind that can be done with Knightian uncertainty besides taking the worst-case (or best-case)?

And if you were dealing with best-case uncertainty instead, then the corresponding analogue would be assuming that you go to hell if you're mispredicted (and then, since best-case things happen to you, the predictor must accurately predict you).

2Charlie Steiner3y

What if you assumed the stuff you had the hypothesis about was independent of the stuff you have Knightian uncertainty about (until proven otherwise)? E.g. if you're making hypotheses about a multi-armed bandit and the world also contains a meteor that might smash through your ceiling and kill you at any time, you might want to just say "okay, ignore the meteor, pretend my utility has a term for gambling wins that doesn't depend on the meteor at all." The reason I want to consider stuff more like this is because I don't like having to evaluate my utility function over all possibilities to do either an argmax or an argmin - I want to be lazy. The weird thing about this is now whether this counts as argmax or argmin (or something else) depends on what my utility function looks like when I do include the meteor. If getting hit by the meteor only makes things worse (though potentially the meteor can still depend on which arm of of the bandit I pull!) then ignoring it is like being optimistic. If it only makes things better (like maybe the world I'm ignoring isn't a meteor, it's a big space full of other games I could be playing) then ignoring it is like being pessimistic.

Countably Factored Spaces

Diffractor3yΩ6120Review for 2020 Review

This post is still endorsed, it still feels like a continually fruitful line of research. A notable aspect of it is that, as time goes on, I keep finding more connections and crisper ways of viewing things which means that for many of the further linked posts about inframeasure theory, I think I could explain them from scratch better than the existing work does. One striking example is that the "Nirvana trick" stated in this intro (to encode nonstandard decision-theory problems), has transitioned from "weird hack that happens to work" to "pops straight out... (read more)

4Richard_Ngo3y

I'm feeling very excited about this agenda. Is there currently a publicly-viewable version of the living textbook? Or any more formal writeup which I can include in my curriculum? (If not I'll include this post, but I expect many people would appreciate a more polished writeup.)

2Charlie Steiner3y

I'm confused about the Nirvana trick then. (Maybe here's not the best place, but oh well...) Shouldn't it break the instant you do anything with your Knightian uncertainty other than taking the worst-case?

Solve Corrigibility Week

Diffractor3yΩ120

Availability: Almost all times between 10 AM and PM, California time, regardless of day. Highly flexible hours. Text over voice is preferred, I'm easiest to reach on Discord. The LW Walled Garden can also be nice.

Finite Factored Sets: Polynomials and Probability

Amendment made.

Troll Bridge

Diffractor4yΩ230

A note to clarify for confused readers of the proof. We started out by assuming $□ (c r o s s \to U = - 10)$ , and $c r o s s$ . We conclude $□ (c r o s s \to U = 10) \lor □ (c r o s s \to U = 0)$ by how the agent works. But the step from there to $□ ⊥$ (ie, inconsistency of PA) isn't entirely spelled out in this post.

Pretty much, that follows from a proof by contradiction. Assume con(PA) ie $\neg □ ⊥$ , and it happens to be a con(PA) theorem that the agent can't prove in advance what it will do, ie, $\neg □ (\neg c r o s s)$ . (I can spell this out in more detail if anyone wants) However, com... (read more)

Diffractor4yΩ360

In the proof of Lemma 3, it should be

"Finally, since $χ_{C}^{F} (z, z) = z$ , we have that ${poly}_{C}^{F} (z) \cdot {poly}_{B ∖ C}^{F} (z) = Q_{z}^{F}$ .

Thus, $Q_{z}^{F} \cdot Q_{x \cap y \cap z}^{F}$ and $Q_{x \cap z}^{F} \cdot Q_{y \cap z}^{F}$ are both equal to ${poly}_{C}^{F} (x \cap z) \cdot {poly}_{B ∖ C}^{F} (y \cap z) \cdot {poly}_{C}^{F} (z) \cdot {poly}_{B ∖ C}^{F} (z)$ .

instead.

4Scott Garrabrant4y

Fixed, Thanks.

Yet More Modal Combat

Diffractor4yΩ120

Any idea of how well this would generalize to stuff like Chicken or games with more than 2-players, 2-moves?

2Donald Hobson4y

Not yet. I'll let you know if I make a follow-up post with this. Thanks for a potential research direction.

Treatments for depression that depressed LW readers may not have tried?

Answer by DiffractorAug 22, 202140

I was subclinically depressed, acquired some bupropion from Canada, and it's been extremely worthwhile.

3mingyuan4y

I'm confused, I'm pretty sure you live in the US, and I also live in the US and I've been prescribed bupropion. What am I missing?

The Many Faces of Infra-Beliefs

Diffractor4yΩ120

I don't know, we're hunting for it, relaxations of dynamic consistency would be extremely interesting if found, and I'll let you know if we turn up with anything nifty.

2Stuart_Armstrong4y

Hum... how about seeing enforcement of dynamic consistency as having a complexity/computation cost, and Dutch books (by other agents or by the environment) providing incentives to pay the cost? And hence the absence of these Dutch books meaning there is little incentive to pay that cost?

Diffractor4yΩ340

Looks good.

Re: the dispute over normal bayesianism: For me, "environment" denotes "thingy that can freely interact with any policy in order to produce a probability distribution over histories". This is a different type signature than a probability distribution over histories, which doesn't have a degree of freedom corresponding to which policy you pick.

But for infra-bayes, we can associate a classical environment with the set of probability distributions over histories (for various possible choices of policy), and then the two distinct notions becom... (read more)

2Rohin Shah4y

Ah right, that makes sense. That was a mistake on my part, my bad.

The Many Faces of Infra-Beliefs

Diffractor4yΩ340

I'd say this is mostly accurate, but I'd amend number 3. There's still a sort of non-causal influence going on in pseudocausal problems, you can easily formalize counterfactual mugging and XOR blackmail as pseudocausal problems (you need acausal specifically for transparent newcomb, not vanilla newcomb). But it's specifically a sort of influence that's like "reality will adjust itself so contradictions don't happen, and there may be correlations between what happened in the past, or other branches, and what your action is now, so you can exploit this by ac... (read more)

2Rohin Shah4y

Thanks for checking! I've changed point 3 to: Re: What I meant was that if you define a Bayesian belief over world-histories (oa)*, that is equivalent to having a Bayesian belief over environments E, which I think you agree with. I've edited slightly to make this clearer.

"Infra-Bayesianism with Vanessa Kosoy" – Watch/Discuss Party

Diffractor4y20

Re point 1, 2: Check this out. For the specific case of 0 to even bits, ??? to odd bits, I think solomonoff can probably get that, but not more general relations.

Re: point 3, Solomonoff is about stochastic environments that just take your action as an input, and aren't reading your policy. For infra-Bayes, you can deal with policy-dependent environments without issue, as you can consider hard-coding in every possible policy to get a family of stochastic environments, and UDT behavior naturally falls out as a result from this encoding. There's still some op... (read more)

"Infra-Bayesianism with Vanessa Kosoy" – Watch/Discuss Party

Diffractor4y40

Ah. So, low expected utility alone isn't too much of a problem. The amount of weight a hypothesis has in a prior after updating depends on the gap between the best-case values and worst-case values. Ie, "how much does it matter what happens here". So, the stuff that withers in the prior as you update are the hypotheses that are like "what happens now has negligible impact on improving the worst-case". So, hypotheses that are like "you are screwed no matter what" just drop out completely, as if it doesn't matter what you do, you might as well pick actions t... (read more)

"Infra-Bayesianism with Vanessa Kosoy" – Watch/Discuss Party

Diffractor4y20

"mixture of infradistributions" is just an infradistribution, much like how a mixture of probability distributions is a probability distribution.

Let's say we've got a prior $ζ \in Δ N$ , a probability distribution over indexed hypotheses.

If you're working in a vector space, you can take any countable collection of sets in said vector space, and mix them together according to a prior $ζ \in Δ N$ giving a weight to each set. Just make the set of all points which can be made by the process "pick a point from each set, and mix the points together according to ... (read more)

"Infra-Bayesianism with Vanessa Kosoy" – Watch/Discuss Party

Diffractor4y20

The concave functional view is "the thing you do with a probability distribution is take expectations of functions with it. In fact, it's actually possible to identify a probability distribution with the function $(X \to R) \to R$ mapping a function to its expectation. Similarly, the thing we do with an infradistribution is taking expectations of functions with it. Let's just look at the behavior of the function $(X \to R) \to R$ we get, and neglect the view of everything as a set of a-measures."

As it turns out, this view makes proofs a whole lot cleaner a... (read more)

Stuart_Armstrong's Shortform

Diffractor4yΩ330

Sounds like a special case of crisp infradistributions (ie, all partial probability distributions have a unique associated crisp infradistribution)

Given some $Q$ , we can consider the (nonempty) set of probability distributions equal to $Q$ where $Q$ is defined. This set is convex (clearly, a mixture of two probability distributions which agree with $Q$ about the probability of an event will also agree with $Q$ about the probability of an event).

Convex (compact) sets of probability distributions = crisp infradistributions.... (read more)

Diffractor4yΩ110

You're completely right that hypotheses with unconstrained Murphy get ignored because you're doomed no matter what you do, so you might as well optimize for just the other hypotheses where what you do matters. Your "-1,000,000 vs -999,999 is the same sort of problem as 0 vs 1" reasoning is good.

Again, you are making the serious mistake of trying to think about Murphy verbally, rather than thinking of Murphy as the personification of the "inf" part of the $E_{Ψ} [f] := {inf}_{(m, b) \in Ψ} m (f) + b$ definition of expected value, and writing actual equations. $Ψ$ &nb... (read more)

Diffractor4yΩ230

There's actually an upcoming post going into more detail on what the deal is with pseudocausal and acausal belief functions, among several other things, I can send you a draft if you want. "Belief Functions and Decision Theory" is a post that hasn't held up nearly as well to time as "Basic Inframeasure Theory".

2DanielFilan4y

Thanks for the offer, but I don't think I have room for that right now.

Diffractor4yΩ330

If you use the Anti-Nirvana trick, your agent just goes "nothing matters at all, the foe will mispredict and I'll get -infinity reward" and rolls over and cries since all policies are optimal. Don't do that one, it's a bad idea.

For the concave expectation functionals: Well, there's another constraint or two, like monotonicity, but yeah, LF duality basically says that you can turn any (monotone) concave expectation functional into an inframeasure. Ie, all risk aversion can be interpreted as having radical uncertainty over some aspects of how the environment... (read more)

2Rohin Shah4y

Sorry, I meant the combination of best-case reasoning (sup instead of inf) and the anti-Nirvana trick. In that case the agent goes "Murphy won't mispredict, since then I'd get -infinity reward which can't be the best that I do". Hmm, that makes sense, I think? Perhaps I just haven't really internalized the learning aspect of all of this.