RichardKennaway comments on Work harder on tabooing "Friendly AI" - Less Wrong

18 Post author: ChrisHallquist 20 May 2012 08:51AM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (51)

You are viewing a single comment's thread. Show more comments above.

Comment author: RichardKennaway 12 June 2012 07:07:40AM 2 points [-]

It doesn't give you a utility-based model. A model is a structure whose parts correspond to parts of the thing modelled, and which interact in the same way as in the thing modelled. This post-hoc utility function does not correspond to anything.

What next? Label with 1 everything that happens and 0 everything that doesn't and call that a utliity-based model of the universe?

Comment author: timtyler 12 June 2012 09:50:54AM *  -2 points [-]

Here, I made it pretty clear from the beginning that I was starting with an existing model - and then modifying it. A model with a few bits strapped onto it is still a model.

Comment author: RichardKennaway 12 June 2012 10:10:08AM 4 points [-]

If I stick a hamburger on my car, the car is still a car -- but the hamburger plays no part in what makes it a car.

Comment author: timtyler 12 June 2012 10:30:23AM -1 points [-]

AFAICS, I never made the corresponding claim - that the utility function was part of what made the model a model.

Comment author: RichardKennaway 12 June 2012 12:02:47PM 2 points [-]

How else can I understand your words "utility-based models"? This is no more a utility-based model than a hamburger on a car is a hamburger-based car.

Comment author: timtyler 12 June 2012 11:28:09PM *  -1 points [-]

Well, I would say "utilitarian", but that word seems to be taken. I mean that the model calculates utilities associated with its possible actions - and then picks the action with the highest utility.

Comment author: RichardKennaway 13 June 2012 08:57:21AM 2 points [-]

But that is exactly what this wrapping in a post-hoc utility function doesn't do. The model first picks an action in whatever way it does, then labels that with utility 1.