paulfchristiano comments on Proper value learning through indifference - Less Wrong

16 Post author: Stuart_Armstrong 19 June 2014 09:39AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (50)

You are viewing a single comment's thread. Show more comments above.

Comment author: paulfchristiano 30 October 2015 02:56:41PM 1 point [-]

Yes, we can't build models today that reliably make these kinds of inferences. But if we consider a model which is architecturally identical, yet improved far enough to make good predictions, it seems like it would be able to make this kind of inference.

As Stuart points out, the hard part is pointing to the part of the model that you want to access. But for that you don't have to define "freely, unpressured and unmanipulated." For example, it would be sufficient to describe any environment that is free of pressure, rather than defining pressure in a precise way.