You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

SilentCal comments on Tackling the subagent problem: preliminary analysis - Less Wrong Discussion

5 Post author: Stuart_Armstrong 12 January 2016 12:26PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (16)

You are viewing a single comment's thread. Show more comments above.

Comment author: SilentCal 14 January 2016 07:21:44PM 2 points [-]

That's essentially what these posts are to me, except instead of a video game it's pen-and-paper with Stuart Armstrong as DM :).

It might be worth the extra motivation of writing up a framing with evil AI designers applying the proposed controls. I'll consider doing this on future posts.

Comment author: Gunnar_Zarncke 14 January 2016 09:28:57PM 0 points [-]

Awesome! Stuart Armstrong be our Dungeon Master! :-) I haven't seen you write up your responses to our DM though. I'd like to see them.

Comment author: SilentCal 15 January 2016 06:57:45PM 1 point [-]

I've made a few shots, e.g. at http://lesswrong.com/r/discussion/lw/mfq/presidents_asteroids_natural_categories_and/cjkr and http://lesswrong.com/lw/m25/high_impact_from_low_impact/cah1. There's no explicit role-playing, but I was very much in the mindset of trying to break the protection scheme.

I haven't been keeping up with these posts as well lately.