cousin_it comments on Imagine a world where minds run on physics - Less Wrong

12 Post author: cousin_it 31 October 2010 07:09PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (29)

You are viewing a single comment's thread. Show more comments above.

Comment author: cousin_it 04 November 2010 06:38:36PM 0 points [-]

I'm afraid I still don't understand your reasoning. How are "goals" different from "values", in your terms?

Comment author: red75 04 November 2010 08:26:13PM 0 points [-]

Goal is what an agent optimizes for at a given point in time. Value is the initial goal of an agent (in your toy model at least).

In my root post it seems to be optimal for agent A to self-modify into agent A', which optimizes for G2, thus agent A' succeeds in optimizing world according to its values (goal of agent A). But original goal doesn't influence its optimization procedure anymore. Thus if we'll analyze agent A' (without knowledge of world's history), we'll be unable to infer its values (its original goal).

Comment author: cousin_it 04 November 2010 08:30:29PM 1 point [-]

Yes, that seems to be correct.