Gunnar_Zarncke comments on Tackling the subagent problem: preliminary analysis - Less Wrong

5 Post author: Stuart_Armstrong 12 January 2016 12:26PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (16)

You are viewing a single comment's thread. Show more comments above.

Comment author: SilentCal 14 January 2016 07:21:44PM 2 points [-]

That's essentially what these posts are to me, except instead of a video game it's pen-and-paper with Stuart Armstrong as DM :).

It might be worth the extra motivation of writing up a framing with evil AI designers applying the proposed controls. I'll consider doing this on future posts.

Comment author: Gunnar_Zarncke 14 January 2016 09:28:57PM 0 points [-]

Awesome! Stuart Armstrong be our Dungeon Master! :-) I haven't seen you write up your responses to our DM though. I'd like to see them.

Comment author: SilentCal 15 January 2016 06:57:45PM 1 point [-]

I've made a few shots, e.g. at http://lesswrong.com/r/discussion/lw/mfq/presidents_asteroids_natural_categories_and/cjkr and http://lesswrong.com/lw/m25/high_impact_from_low_impact/cah1. There's no explicit role-playing, but I was very much in the mindset of trying to break the protection scheme.

I haven't been keeping up with these posts as well lately.