Stuart_Armstrong comments on High impact from low impact - Less Wrong
You are viewing a comment permalink. View the original post to see all comments and the full post content.
You are viewing a comment permalink. View the original post to see all comments and the full post content.
Comments (12)
The approach I'm trying to get is to be able to make the AI do stuff without having to define hard concepts. "Deflect the meteor but without having undue impact on the world" is a hard concept.
"reduced impact" seems easier, and "false belief" is much easier. It seems we can combine the two in this way to get something we want without needing to define it.