Comment Permalink

First off, kudos for discussing non-VoI reasons to run these experiments. Real decisions have many factors.

The eyeballed estimate of how much the experimental design reduces the value from perfect information should be replaced by a decision tree. If the experiment can't give you enough data to change your position, then it's not material.

Using the first example, where W is melatonin works and "W" is the experiment saying that melatonin works, it looks like you provided P(W)=.8, P("W"|W)=.95, and P("W"|~W)=.05. I assumed that >90% corresponded to a point estimate of 95%, and that the test was symmetric, which should get thought about more if you're doing this seriously.

In the case where you get "W", you update and P(W|"W")=99% and you continue taking melatonin. But in the case where you get "~W", you update and P(W|"~W")=17%. Given the massive RoI you calculated for melatonin, it sounds like it's worth taking even if there's only a 17% chance that it's actually effective. Rather than continuing blindly on, you'd probably continue the test until you had enough data to be sure / pin down your RoI calculation, but you should be able to map that out now before you start the experiment.

There's a question of prior information here- from what you're written, it sounds like you should be more than 80% sure that melatonin worked for you. You might be interested in a different question- "melatonin still works for me"- which it might be reasonable to have an 80% prior on. If the uncertainty is about the value of taking melatonin, it seems like you could design a better experiment that narrows your uncertainty there (by looking for cognitive costs, or getting a better estimate of time saved, etc.).

A brief terminology correction: the "value of perfect information" would be $41, not $205 (i.e. it includes the 20% estimate that melatonin doesn't work). If you replace that with "value of a perfect negative result" you should be fine.

In 3, you're considering adding a new supplement, not stopping a supplement you already use. The "I don't try Adderall" case has value $0, the "Adderall fails" case is worth -$40 (assuming you only bought 10 pills, and this number should be increased by your analysis time and a weighted cost for potential permanent side effects), and the "Adderall succeeds" case is worth $X-40-4099, where $X is the discounted lifetime value of the increased productivity due to Adderall, minus any discounted long-term side effect costs. If you estimate Adderall will work with p=.5, then you should try out Adderall if you estimate that .5(X-4179)>0 -> X>4179. (Adderall working or not isn't binary, and so you might be more comfortable breaking down the various "how effective Adderall is" cases when eliciting X, by coming up with different levels it could work at, their values, and then using a weighted sum to get X. This can also give you a better target with your experiment- "this needs to show a benefit of at least Y from Adderall for it to be worth the cost, and I've designed it so it has a reasonable chance of showing that.")

One thing to notice is that the default case matters a lot. This asymmetry is because you switch decisions in different possible worlds- when you would take Adderall but stop you're in the world where Adderall doesn't work, and when you wouldn't take Adderall but do you're in the world where Adderall does work (in the perfect information case, at least). One of the ways you can visualize this is that you don't penalize tests for giving you true negative information, and you reward them for giving you true positive information. (This might be worth a post by itself, and is very Litany of Gendlin.)

The rest is similar. I definitely agree with the last line: possibly a way to drive it home is to talk about dividing by ln(1.05), which is essentially multiplying by 20.5. If you can make a one-time investment that pays off annually until you die, that's worth 20.5 times the annual return, and multiplying the value of something by 20 can often move it from not worth thinking about to worth thinking about.

83

83

83