You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

dlthomas comments on CEV-inspired models - Less Wrong Discussion

7 Post author: Stuart_Armstrong 07 December 2011 06:35PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (41)

You are viewing a single comment's thread. Show more comments above.

Comment author: dlthomas 08 December 2011 06:07:15PM *  0 points [-]

It only causes me to wonder, what if we are literally too stupid to even comprehend the best possible CEV that can be offered to us?

I think this doesn't matter, if we can

1) successfully define the CEV concept itself,

2) define a suitable reference class,

3) build a superintelligence, and

4) ensure that the superintelligence continues to pursue the best CEV it can find for the appropriate reference class.

Comment author: TheOtherDave 08 December 2011 06:27:41PM 1 point [-]

Well, it would be helpful if we could also:
2.5) work out a reliable test for whether a given X really is an instance of the CEV concept for the given reference class

Which seems to depend on having some kind of understanding.

Lacking that, we are left with having to trust that whatever the SI we've built is doing is actually what we "really want" it to do, even if we don't seem to want it to do that, which is an awkward place to be.