One possible answer to the argument "attempting to build FAI based on Eliezer's ideas seems infeasible and increases the risk of UFAI without helping much to increase the probability of a good outcome, and therefore we should try to achieve a positive Singularity by other means" is that it's too early to decide this. Even if our best current estimate is that trying to build such an FAI increases risk, there is still a reasonable chance that this estimate will turn out to be wrong after further investigation. Therefore, the counter-argument goes, we ought to mount a serious investigation into the feasibility and safety of Eliezer's design (as well as other possible FAI approaches), before deciding to either move forward or give up.
(I've been given to understand that this is a standard belief within SI, except possibly for Eliezer, which makes me wonder why nobody gave this counter-argument in response to my post linked above. ETA: Carl Shulman did subsequently give me a version of this argument here.)
This answer makes sense to me, except for the concern that even seriously investigating the feasibility of FAI is risky, if the team doing so isn't fully rational. For example they may be overconfident about their abilities and thereby overestimate the feasibility and safety, or commit sunken cost fallacy once they have developed lots of FAI-relevant theory in the attempt to study feasibility, or become too attached to their status and identity as FAI researchers, or some team members may disagree with a consensus of "give up" and leave to form their own AGI teams and take the dangerous knowledge developed with them.
So the question comes down to, how rational is such an FAI feasibility team likely to be, and is that enough for the benefits to exceed the costs? I don't have a lot of good ideas about how to answer this, but the question seems really important to bring up. I'm hoping this post this will trigger SI people to tell us their thoughts, and maybe other LWers have ideas they can share.
Good point. I need to keep this in mind more in the future. I guess at this point I understand my disagreement with Eliezer and even Paul relatively well but I don't know where the core of the disagreement between the two of us lies. Do you have a better idea of this? Is there one or two things that you think I'm wrong about, such that if I were to change my mind about them, we'd be in much better agreement on overall strategy? If not then I guess we need to talk a bunch more to figure it out.
Ah, availability bias on my part, since I follow LW more closely than MIRI's blog. I wonder why you don't post/crosspost those things here though (when Eliezer did post his open FAI questions here), given that the average LW post draws way more comments than the average MIRI blog post, and probably has a higher readership or at least an additional nonoverlapping readership. I mean, why not take advantage of all of MIRI's (multiple years of) hard work in building up this community?
No that's not clear. An almost-Friendly AI could be worse than a completely indifferent AI, since "bad" (as opposed to "neutral", e.g. suffering maximizers as opposed to paperclip maximizers) utility functions are close in design space to "good" utility functions. I think somebody made this argument before on LW but I forgot who/where. There's also an argument that if acausal control/trading is possible, more "philosophically sophisticated" UFAI can be bad because they are harder for FAIs to control or provide more competition for FAIs in the acausal economy.
I think what may be useful is "AI safety" (for lack of a better term) research that is done explicitly under the assumption that we may have to deploy a not-quite-Friendly AI to head off a greater danger, which would involved approaches quite different Eliezer's current one. I made some suggestions along these lines previously, but again this isn't MIRI's current focus.
In that case it seems like the cost of the strategic information you're seeking is really high (both in terms of resources and in terms of potential negative impact), and I'm having trouble understanding your current strategy of seeking this information. Again I'm not sure where the core of our disagreement lies so perhaps I should wait for your reply on that before going further.
Quick note: I'm glad we're doing this. Let's keep going.
Is it easy for you to sum up your core disagreements with Eliezer and with Paul? That would be pretty useful to my own strategic thinking.
As for where our core strategic disagreements are… I skimmed through your posts that looked fairly strategy-relevant: Cynical explanations of FAI critics, W... (read more)