ChristianKl comments on Boxing an AI? - Less Wrong

2 Post author: tailcalled 27 March 2015 02:06PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (39)

You are viewing a single comment's thread.

Comment author: ChristianKl 30 March 2015 08:40:58PM 0 points [-]

This lets us adjust their morality until the AIs act sensibly.

The difficult thing isn't to have the AI act sensibly in the medium term. The difficult thing is to have it's values stay stable under self modification and to complex problems right like not wireheading everyone right.

Comment author: tailcalled 30 March 2015 08:49:49PM 0 points [-]

This would definitely let you test the values-stable-under-self-modification. Just plonk the AI in an environment where it can self-modify and keep track of its values. Since this is not dependent on morality, you can just give it easily-measurable values.