User Comment Replies

Any “huge R&D center” constraint is trivialized in a future where agile, powerful robots will be ubiquitous and an AGI can use robots to create an underground lab in the middle of nowhere, using its superintelligence to be undetectable in all ways that are physically possible. An AGI will also be able to use robots and 3D printers to fabricate purpose-built machines that enable it to conduct billions of physical experiments a day. Sure, it would be harder to construct something like a massive particle accelerator, but 1) that isn’t needed to make killer nanobots 2) even that isn’t impossible for a sufficiently intelligent machine to create covertly and quickly.

AGI Ruin: A List of Lethalities

Keenmaster3y43

But we can refuse to be satisified with instructions that look like "cut the red one, then blue, etc...". We should request that the AI writing the textbook explain from first principles why that will work, in a way that is maximally comprehensible by a human or team of humans.

3Tapatakt3y

Did you mean "in a way that maximally convinces a human or a team of humans that they understand everything"? I don't think this is a good idea.

AGI Ruin: A List of Lethalities

Keenmaster3y*10

It seems like the solution space to the existential threat of AGI can be described as follows:

Solutions which convey a credible threat* to all AGI that we will make it physically impossible** for them to either achieve X desirable outcome and/or prevent Y undesirable outcome where the value of X or cost of Y exponentially exceeds the value obtained by eradicating humanity, if they decide to eradicate humanity, such that even a small chance of the threat materializing makes eradication a poor option***.

*Probably backed by a construction of some kind (e.g. E... (read more)

LESSWRONG
LW

All of Keenmaster's Comments + Replies