TheOtherDave comments on The Human's Hidden Utility Function (Maybe) - Less Wrong
You are viewing a comment permalink. View the original post to see all comments and the full post content.
You are viewing a comment permalink. View the original post to see all comments and the full post content.
Comments (87)
At a glance, I might be more comfortable embracing an extrapolation of the combination of the model-based system's preferences and the Pavlovian system's preferences.
Admittedly, a first step in extrapolating the Pavlovian system's preferences might be to represent its various targets as goals in a model, thereby leaving the extrapolator with a single system to extrapolate, but given that 99% of the work takes place after this point I'm not sure how much I care. Much more important is to not lose track of that stuff accidentally.