LESSWRONG
LW

All of Daniel C's Comments + Replies

There's a chicken-and-egg problem here[...] and then using that assumption to prove that markets are causal.

That argument was more about accomodating "different traders with different beliefs", but here's an independent argument for market being causal:

When I cause a particular effect/outcome, that means I mediate the influence between the cause of my action and the effect/outcome of my action, the cause of my action is conditionally independent of the effect of my action given me

Futarchy is a similar case: There may be many causes that influence mar... (read more)

Futarchy's fundamental flaw

Daniel C19d10

The first expectation needs to be conditioned on the market activating. (That is not conditionally independent of u given d1 in general.)

If we commit to using futarchy to choose decision, then market 1 activating will have exactly the same truth conditions as executing d1, so "market activating and d1" would be the exact same thing as "d1" itself (commiting to use futarchy to choose decision means we assign 0 probability to "first market activating & execute d2" or "Second market activating & execute d1")

Different people have different belief

... (read more)

2dynomight19d

There's a chicken-and-egg problem here. You're assuming that markets are causal (meaning traders that are better at estimating causal probabilities) and then using that assumption to prove that markets are causal.

Futarchy's fundamental flaw

Daniel C20d10

My main objection to this logic is that there doesn't seem to be any reflection of the idea that different traders will have different beliefs.[...] All my logic is based on a setup where different traders have different beliefs.

Over time, traders who have more accurate beliefs (& act rationally according to those beliefs) will accumulate more money in expectation (& vice versa), so in the limit we can think of futarchy as aggregating the beliefs of different traders weighted by how accurate their beliefs were in the past

So I don't think the

... (read more)

2dynomight19d

This is incorrect. There are two errors here: 1. The first expectation needs to be conditioned on the market activating. (That is not conditionally independent of u given d1 in general.) 2. Different people have different beliefs, so the expectations are different for different traders. You can't write "E" without specifying for which trader. I agree that if you assume u is conditionally independent of market activation given d1 and that all traders have the same beliefs then the result seems to hold. But those assumptions are basically always false.

Futarchy's fundamental flaw

Daniel C21d30

We want you to pay more for a contract for coin A, since that’s the coin you think is more likely to be heads (60% vs 59%). But if you like money, you’ll pay more for a contract on coin B. You’ll do that because other people might figure out if it’s an always-heads coin or an always-tails coin. If it’s always heads, great, they’ll bid up the market, it will activate, and you’ll make money. If it’s always tails, they’ll bid down the market, and you’ll get your money back.

Let's call "Bidding on B, hoping that other people will figure out if B is an alw... (read more)

2dynomight20d

Regarding this, I'll note that my logic is not that different traders are following different strategies. I assume that all traders are rational agents and will maximize their expected return given their beliefs. My intended setup is that you believe coin A and coin B could have the biases stated, but you also believe that if you were to aggregate your beliefs with the beliefs of other people, the result would be more accurate than your beliefs alone. I think this feeds into my objection to this proof: My main objection to this logic is that there doesn't seem to be any reflection of the idea that different traders will have different beliefs. (It's possible that the market does give causal estimates with that assumption, but it's definitely not an assumption I'd be willing to make, since I think the central purpose of prediction markets is to aggregate diverse beliefs.) All my logic is based on a setup where different traders have different beliefs. So I don't think the condition "p1>E[u|d1]" really makes sense? I think a given trader will drive down that market iff their estimate of the utility conditioned on that market activating is higher than p1, i.e. if p1>E_i[u|d1, market 1 activates]. I'm claiming that for trader i, E_i[u|d1, market 1 activates] != E_i[u|d1], basically because the event that market 1 activates contains extra information, and this makes it unlikely that the market will converge to E[u|d1].

Futarchy's fundamental flaw

Daniel C22d10

I think I'm claiming 3, namely all we want from futarchy is for it to select the decision with the highest expected payout, and for that the property isn't necessary.

Ex: For the two coin two market case, the first market's price estimates the expected payout if we flip coin one (& similarly for the second market), & while neither market satisfies the property (E[f]=E[z] always), we would still select the decision that leads to the higher expected payout (as we select the higher price), and that's all that's needed

3dynomight21d

How do you feel about this example, which gives a setup where you have an incentive to bid more for a coin you think has a lower expected value? More generally, what's the argument that the market will always select the decision that leads to he higher expected payout?

Futarchy's fundamental flaw

Daniel C22d30

I think that's right. (I guess technically it depends on what version of Futarchy you're trying to use. You could have a single market for a single coin that's flipped iff the final price is above some threshold.)[...] That doesn't fit into my assumptions.

Yep agreed.

But I think you can also pretty easily generalize the proof? [...] just changing f(x,Y,Z) to f(x,Y,Z,C) everywhere?

The proof definitely shows that within a single market (e.g. conditional on y>=c), you would be indifferent to Z given the opposite counterfactual (y<c), but that's okay beca... (read more)

2dynomight22d

Can you help me understand what you're claiming? Is it fair to think of your argument as supporting one of the following conclusions? 1. There is a function f with f(x,y,z,c)=x for y<c such that E[f]=E[z] always. 2. Rational agents would have some beliefs about the conditional distribution of c, and there is some function f for which that property is true once you add that assumption. 3. That property isn't necessary, some other (weaker) property is all that's needed.

Futarchy's fundamental flaw

Daniel C22d52

Suppose you run a market where if you pay x and the final market price is y and z happens, then you get a payout of f(x,y,z) dollars. The payout function can be anything, subject only to the constraint that if the final market price is below some constant c, then bets are cancelled, i.e. f(x,y,z)=x for y < c.

But in futarchy the "threshold price" c wouldn't be constant, it would be the price of the market conditional on the scenario y<c.

IIUC the theorem is saying that you would be indifferent to whatever happens to Z if y<c, but that counterfactual would be estimated by another market (which estimates c) that activates when y<c and cancels when y>=c

5dynomight22d

I think that's right. (I guess technically it depends on what version of Futarchy you're trying to use. You could have a single market for a single coin that's flipped iff the final price is above some threshold.) In general I agree this doesn't capture the complexity of how markets would get resolved. For example, the most common case would probably be that you have two markets for two coins and you resolve whichever one has a higher price. That doesn't fit into my assumptions. I guess I was implicitly trying to argue that it doesn't work even with this extra simplifying assumption. But I think you can also pretty easily generalize the proof? Suppose we change C to be a random variable (an arbitrary random variable, which I think can reflect pretty much any market design), and we change the payout function to be f(x,y,z,c) with the restriction that f(x,y,z,c)=x for y<c. Then I think the same proof strategy still works, just changing f(x,Y,Z) to f(x,Y,Z,C) everywhere?

Report & retrospective on the Dovetail fellowship

Daniel C4mo10

<3!

The optimizer won’t just guess your intended semantics

Daniel C4mo10

Great post! Agree with the points raised but would like to add that restricting the expressivity isn’t the only way that we can try to make the world model more interpretable by design. There are many ways that we can decompose a world model into components, and human concepts correspond to some of the components (under a particular decomposition) as opposed to the world model as a whole. We can backpropagate desiderata about ontology identification to the way that the world model is decomposed.

For instance, suppose that we’re trying to identify the ... (read more)

Instrumental Goals Are A Different And Friendlier Kind Of Thing Than Terminal Goals

Daniel C5mo50

I think one pattern which needs to hold in the environment in order for subgoal corrigibility to make sense is that the world is modular, but that modularity structure can be broken or changed

For one, modularity is the main thing that enables general purpose search: If we can optimize for a goal by just optimizing for a few instrumental subgoals while ignoring the influence of pretty much everything else, then that reflects some degree of modularity in the problem space

Secondly, if the modularity structure of the environment stays constant no matter what (... (read more)

Values Are Real Like Harry Potter

Daniel C9mo30

Imagine that I'm watching the video of the squirgle, and suddenly the left half of the TV blue-screens. Then I'd probably think "ah, something messed up the TV, so it's no longer showing me the squirgle" as opposed to "ah, half the squirgle just turned into a big blue square". I know that big square chunks turning a solid color is a typical way for TVs to break, which largely explains away the observation; I think it much more likely that the blue half-screen came from some failure of the TV rather than an unprecedented behavior of the squirgle.

My me... (read more)