You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

Slider comments on Assessors that are hard to seduce - Less Wrong Discussion

5 Post author: Stuart_Armstrong 09 March 2015 02:19PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (12)

You are viewing a single comment's thread. Show more comments above.

Comment author: Slider 10 March 2015 01:11:49PM 0 points [-]

Wouldn't we know that in this context this would be true (rather than untrue as you write)? Also the degree that the assessor is properly shielded from tampereing means it will become closer to imaginary (no need to mention assessor implementation details, but then it seems to work like "magic" lessening the evidecne to believe in it existence). Also it seems that things that make people turn on religion are valued and here we are counting on the AI not pulling those same stunts.

Hiding the assessor among multiple plausible targets might make the AI play mafia on people (such as trying to get the assessor replaced when it can't (no longer) satisfy it's demands, inhopes that the replacement has easier attitudes or atleast possibility to have flaws to exploit).

Comment author: Stuart_Armstrong 10 March 2015 02:36:56PM 0 points [-]

These can be defined in counterfactual ways, if needed. There need not actually be an assessor, just a small probability of one.

Comment author: Slider 11 March 2015 01:46:25AM 0 points [-]

Wouldn't that be the equivalent of thinking that a Pascal's wager will keep it in check?

Comment author: Stuart_Armstrong 11 March 2015 10:25:22AM 0 points [-]