Tetronian comments on Best career models for doing research? - Less Wrong

27 Post author: Kaj_Sotala 07 December 2010 04:25PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (999)

You are viewing a single comment's thread. Show more comments above.

Comment author: [deleted] 09 December 2010 07:15:14PM 8 points [-]

Let me try to rephrase: correct FAI theory shouldn't have dangerous ideas. If we find that the current version does have dangerous ideas, then this suggests that we are on the wrong track. The "Friendly" in "Friendly AI" should mean friendly.

Comment author: Eliezer_Yudkowsky 09 December 2010 07:20:10PM 8 points [-]

Pretty much correct in this case. Roko's original post was, in fact, wrong; correctly programmed FAIs should not be a threat.

Comment author: Vladimir_Nesov 09 December 2010 07:25:12PM 10 points [-]

(FAIs shouldn't be a threat, but a theory to create a FAI will obviously have at least potential to be used to create uFAIs. FAI theory will have plenty of dangerous ideas.)

Comment author: XiXiDu 09 December 2010 07:40:41PM 5 points [-]

I want to highlight at this point how you think about similar scenarios:

I do think that TORTURE is the obvious option, and I think the main instinct behind SPECKS is scope insensitivity.

That isn't very reassuring. I believe that if you had the choice of either letting a Paperclip maximizer burn the cosmic commons or torture 100 people, you'd choose to torture 100 people. Wouldn't you?

...correctly programmed FAIs should not be a threat.

They are always a threat to some beings. For example beings who oppose CEV or other AI's. Any FAI who would run a human version of CEV would be a potential existential risk to any alien civilisation. If you accept all this possible oppression in the name of what is subjectively friendliness, how can I be sure that you don't favor torture for some humans that support CEV, in order to ensure it? After all you already allow for the possibility that many beings are being oppressed or possible killed.

Comment author: wedrifid 09 December 2010 07:44:16PM 3 points [-]

They are always a threat to some beings. For example beings who oppose CEV or other AI's. Any FAI who would run a human version of CEV would be a potential existential risk to any alien civilisation.

This seems to be true and obviously so.

Comment author: Vladimir_Nesov 09 December 2010 07:43:08PM *  1 point [-]

...correctly programmed FAIs should not be a threat.

They are always a threat to some beings.

Narrowness. You can parry almost any statement like this, by posing a context outside its domain of applicability.

Comment deleted 09 December 2010 11:23:52PM *  [-]
Comment author: Vladimir_Nesov 09 December 2010 11:38:11PM *  5 points [-]

From my point of view, and as I discussed in the post (this discussion got banned with the rest, although it's not exactly on that topic), the problem here is the notion of "blackmail". I don't know how to formally distinguish that from any other kind of bargaining, and the way in which Roko's post could be wrong that I remember required this distinction to be made (it could be wrong in other ways, but that I didn't notice at the time and don't care to revisit).

(The actual content edited out and posted as a top-level post.)

Comment author: cousin_it 09 December 2010 11:48:18PM *  1 point [-]

(I seem to have a talent for writing stuff, then deleting it, and then getting interesting replies. Okay. Let it stay as a little inference exercise for onlookers! And please nobody think that my comment contained interesting secret stuff; it was just a dumb question to Eliezer that I deleted myself, because I figured out on my own what his answer would be.)

Thanks for verbalizing the problems with "blackmail". I've been thinking about these issues in the exact same way, but made no progress and never cared enough to write it up.

Comment author: Perplexed 10 December 2010 01:39:10AM 4 points [-]

Perhaps the reason you are having trouble coming up with a satisfactory characterization of blackmail is that you want a definition with the consequence that it is rational to resist blackmail and therefore not rational to engage in blackmail.

Pleasant though this might be, I fear the universe is not so accomodating.

Elsewhere VN asks how to unpack the notion of a status-quo, and tries to characterize blackmail as a threat which forces the recipient to accept less utility than she would have received in the status quo. I don't see any reason in game theory why such threats should be treated any differently than other threats. But it is easy enough to define the 'status-quo'.

The status quo is the solution to a modified game - modified in such a way that the time between moves increases toward infinity and the current significance of those future moves (be they retaliations or compensations) is discounted toward zero. A player who lives in the present and doesn't respond to delayed gratification or delayed punishment is pretty much immune to threats (and to promises).

Comment author: David_Gerard 09 December 2010 11:30:12PM *  4 points [-]

Another pointless flamewar.

On RW it's called Headless Chicken Mode, when the community appears to go nuts for a time. It generally resolves itself once people have the yelling out of their system.

The trick is not to make any decisions based on the fact that things have gone into headless chicken mode. It'll pass.

[The comment this is in reply to was innocently deleted by the poster, but not before I made this comment. However, I think I'm making a useful point here, so would prefer to keep this comment.]

Comment deleted 09 December 2010 08:08:04PM *  [-]
Comment author: Eliezer_Yudkowsky 09 December 2010 08:22:03PM 1 point [-]

No, the rationale for deletion was not based on the possibility that his exact, FAI-based scenario could actually happen.

Comment author: wedrifid 09 December 2010 08:31:28PM 2 points [-]

No, the rationale for deletion was not based on the possibility that his exact, FAI-based scenario could actually happen.

What was the grandparent?

Comment author: ata 09 December 2010 08:38:16PM *  3 points [-]

Hm? Did my comment get deleted? I still see it.

Comment author: komponisto 09 December 2010 09:10:29PM 2 points [-]

I noticed you removed the content of the comment from the record on your user page. I would have preferred you not do this; those who are sufficiently curious and know about the trick of viewing the user page ought to have this option.

Comment author: ata 09 December 2010 09:15:13PM *  1 point [-]

There was nothing particularly important or interesting in it, just a question I had been mildly curious about. I didn't think there was anything dangerous about it either, but, as I said elsewhere, I'm willing to take Eliezer's word for it if he thinks it is, so I blanked it. Let it go.

Comment author: komponisto 09 December 2010 09:33:58PM *  7 points [-]

I'm willing to take Eliezer's word for it if he thinks it is, so I blanked it

I know why you did it. My intention is to register disagreement with your decision. I claim it would have sufficed to simply let Eliezer delete the comment, without you yourself taking additional action to further delete it, as it were.

Let it go.

I could do without this condescending phrase, which unnecessarily (and unjustifiably) imputes agitation to me.

Comment author: ata 10 December 2010 11:11:17PM *  2 points [-]

I could do without this condescending phrase, which unnecessarily (and unjustifiably) imputes agitation to me.

Sorry, you're right. I didn't mean to imply condescension or agitation toward you; it was written in a state of mild frustration, but definitely not at or about your post in particular.

Comment author: Vladimir_Nesov 09 December 2010 09:14:37PM 1 point [-]

Only if you disagree with correctness of moderator's decision.

Comment author: komponisto 09 December 2010 09:17:41PM *  1 point [-]

Disagreement may be only partial. One could agree to the extent of thinking that viewing of the comment ought to be restricted to a more narrowly-filtered subset of readers.

Comment author: Vladimir_Nesov 09 December 2010 09:20:02PM 0 points [-]

Yes, this is a possible option, depending on the scope of moderator's decision. Banning comments from a discussion, even if they a backed up and publicly available elsewhere, is still an effective tool in shaping the conversation.

Comment author: waitingforgodel 09 December 2010 08:44:38PM 1 point [-]

yep, this one is showing as deleted

Comment author: wedrifid 09 December 2010 08:46:05PM 0 points [-]

Weird. I see:

09 December 2010 08:08:04PM* Comment deleted [-]

How does Eliezer's delete option work exactly? It stays visible to the author? Now I'm curious.

Comment deleted 09 December 2010 09:04:55PM *  [-]
Comment author: wedrifid 09 December 2010 09:06:30PM *  15 points [-]

Whats frustrating is I would have had no idea it was deleted- and just assumed it wasn't interesting to anyone, had I not checked after reading the above. I'd much rather be told to delete the relevant portions of the comment- lets at least have precise censorship!

Wow. Even the people being censored don't know it. That's kinda creepy!

his comment led me to discover that quite a long comment I made a little bit ago had been deleted entirely.

How did you work out that it had been deleted? Just by logging out, looking and trying to remember where you had stuff posted?

Comment author: Vladimir_Nesov 09 December 2010 09:09:23PM 17 points [-]

I think it's a standard tool: trollish comments look like being ignored to the trolls. But I think it's impolite to delete comments made in good faith without notification and usable guidelines for cleaning up and reposting. (Hint hint.)

Comment author: Jack 09 December 2010 09:15:09PM *  3 points [-]

How did you work out that it had been deleted? Just by logging out, looking and trying to remember where you had stuff posted?

I only made one comment on the subject and I was rather confused that it was being ignored. I also knew I might have said too much about the Roko post and actually included a sentence saying that if I crossed the line I'd appreciate being told to edit it instead of having the entire thing deleted. So I just checked that one comment in particular. If other comments of mine have been deleted I wouldn't know about it, though this was the only comment in which I have discussed the Roko post.

Comment author: [deleted] 09 December 2010 09:08:02PM 3 points [-]

I doubt that this is a deliberate feature.

Comment author: ata 09 December 2010 08:47:46PM *  -1 points [-]

Yes, I've been told that it was deleted but that I still see it since I'm logged in.

In that case I won't repeat what I said in it, partly because it'll just be deleted again but mainly because I actually do trust Eliezer's judgment on this. (I didn't realize that I was saying more than I was supposed to.) All I'll say about it is that it did not actually contain the question that Eliezer's reply suggests he thought it was asking, but it's really not important enough to belabor the point.

Comment deleted 09 December 2010 08:50:59PM *  [-]
Comment deleted 09 December 2010 09:04:05PM [-]
Comment author: Eliezer_Yudkowsky 09 December 2010 09:00:45PM -1 points [-]

Reposting comments deleted by the authors or by moderators will be considered hostile behavior and interfering with the normal and intended behavior of the site and its software, and you will be asked to leave the Less Wrong site.

-- Eliezer Yudkowsky, Less Wrong Moderator.

Comment author: Jack 09 December 2010 07:21:18PM 1 point [-]

This is certainly the case with regard to the kind of decision theoretic thing in Roko's deleted post. I'm not sure if it is the case with all ideas that might come up while discussing FAI.