You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

cousin_it comments on The virtual AI within its virtual world - Less Wrong Discussion

6 Post author: Stuart_Armstrong 24 August 2015 04:42PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (34)

You are viewing a single comment's thread.

Comment author: cousin_it 24 August 2015 08:22:57PM *  1 point [-]

Yeah, this should work correctly, assuming that the AI's prior specifies just one mathematical world, rather than e.g. a set of possible mathematical worlds weighted by simplicity. I posted about something similar five years ago.

The application to "fake cancer" is something that hadn't occurred to me, and it seems like a really good idea at first glance.

Comment author: Stuart_Armstrong 25 August 2015 10:27:12AM 1 point [-]

Thanks, that's useful. I'll think how to formalise this correctly. Ideally I want a design where we're still safe if a) the AI knows, correctly, that pressing a button will give it extra resources, but b) still doesn't press it because its not part of its description.