Roko comments on A Master-Slave Model of Human Preferences - Less Wrong

58 Post author: Wei_Dai 29 December 2009 01:02AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (80)

You are viewing a single comment's thread. Show more comments above.

Comment deleted 29 December 2009 11:11:31PM *  [-]
Comment author: Vladimir_Nesov 29 December 2009 11:15:09PM *  1 point [-]

If the agent has a well-defined "predictive module" which has a "map" (probability distribution over the environment given an interaction history), and some "other stuff", then you can clamp the predictive module down to the truth, and then perform what I said before:

Yeah, maybe. But it doesn't.

Comment deleted 30 December 2009 02:03:05PM [-]
Comment deleted 30 December 2009 02:18:12PM *  [-]
Comment author: Vladimir_Nesov 01 January 2010 04:44:46PM 2 points [-]

What is left of the time cube guy once you subtract off his false beliefs and delusions? Not much, probably.

Beware: you are making a common sense-based prediction about what would be the output of a process that you don't even have the right concepts for specifying! (See my reply to your other comment.)

Comment author: SilasBarta 10 January 2010 04:10:10PM 1 point [-]

Wow. Too bad I missed this when it was first posted. It's what I wish I'd said when justifying my reply to Wei_Dai's attempted belief/values dichotomy here and here.

Comment deleted 10 January 2010 06:09:24PM *  [-]
Comment author: SilasBarta 10 January 2010 08:25:13PM 0 points [-]

Indeed. Most of the FAI's job could consist of saying, "Okay, there's soooooo much I have to disentangle and correct before I can even begin to propose solutions. Sit down and let's talk."