bogdanb — LessWrong

LESSWRONG
LW

bogdanb — LessWrong

You might want to know that I took a look through the site, and was curious, but I just closed the page the moment the “Calculate your contribution” form refused to show me the pricing options unless I gave it an email address.

Replying toWhere I agree and disagree with Eliezer

bogdanb3y

Where I agree and disagree with Eliezer

I’m not sure I understand your weighting argument. Some capabilities are “convergently instrumental” because they are useful for achieving a lot of purposes. I agree that AIs construction techniques will target obtaining such capabilities, precisely because they are useful.

But if you gain a certain convergently instrumental capability, it then automatically allows you to do a lot of random stuff. That’s what the words mean. And most of that random stuff will not be safe.

I don’t get what the difference is between “the AI will get convergently instrumental capabilities, and we’ll point those at AI alignment” and “the AI will get very powerful and we’ll just ask it to be aligned”, other than... (read more)

Replying toWhere I agree and disagree with Eliezer

bogdanb4y

Where I agree and disagree with Eliezer

Exactly. You can’t generalize from “natural” examples to adversarial examples. If someone is trying hard to lie to you about something, verifying what they say can very well be harder than finding the truth would have been absent their input, particularly when you don’t know if and what they want to lie about.

I’m not an expert in any of these and I’d welcome correction, but I’d expect verification to be at least as hard as “doing the thing yourself” in cases like espionage, hacking, fraud and corruption.

Replying toWhere I agree and disagree with Eliezer

bogdanb4y

Where I agree and disagree with Eliezer

AI accelerates the timetable for things we know how to point AI at

It also accelerates the timetable for random things that we don’t expect and don’t even try to point the AI at but that just happen to be easier for incrementally-better AI to do.

Since the space of stuff that helps alignment seems much smaller than the space of dangerous things, you’d expect most things the AI randomly accelerates without us pointing it at will be dangerous.

Replying toParable: The Bomb that doesn't Explode

bogdanb4y

Parable: The Bomb that doesn't Explode

See above. Don’t become a munitions engineer, and, being aware that someone else will take that role, try to prevent anyone from taking that role. (Hint: That last part is very hard.)

The conclusions might change if planet-destroying bombs are necessary for some good reason, or if you have the option of safely leaving the planet and making sure nobody that comes with you will also want to build planet-destroying bombs. (Hint: That last part is still hard.)

Replying toBragging Thread May 2015

bogdanb11y

Bragging Thread May 2015

For what it’s worth, the grammar and spelling was much better than is usual for even the native English part of the Internet. That’s probably fainter praise than it deserves, I don’t remember actually noticing any such fault, which probably means there are few of them.

The phrasing and wording did sound weird, but I guess that’s at least one reason why you’re writing, so congratulations and I hope you keep it up! I’m quite curious to see where you’ll take it.

Replying toHarry Potter and the Methods of Rationality discussion thread, February 2015, chapter 113

bogdanb11y

Harry Potter and the Methods of Rationality discussion thread, February 2015, chapter 113

Indeed, the only obvious “power” Harry has that is (as far as we know) unique to him is Partial Transfiguration. I’m not sure if Voldie “knows it not”; as someone mentioned last chapter, Harry used it to cut trees when he had his angry outburst in the Forbidden Forest, and in Azkhaban as well. In the first case Voldie was nearby, allegedly to watch out for Harry, but far enough that to be undetectable via their bond, so it’s possible he didn’t see what exact technique Harry used. In Azkhaban as well he was allegedly unconscious.

I can’t tell if he could have deduced the technique only by examining the results. (At... (read more)

Replying toHarry Potter and the Methods of Rationality discussion thread, February 2015, chapter 104

bogdanb11y

Harry Potter and the Methods of Rationality discussion thread, February 2015, chapter 104

Well, we only know that Harry feels doom when near Q and/or his magic, and that in one case in Azkhaban something weird happened when Harry’s Patronus interacted with what appeared to be an Avada Kedavra bolt, and that Q appears to avoid touching Harry.

Normally I’d say that faking the doom sensations for a year, and faking being incapacitated while trying to break someone out of Azkhaban, would be too complicated. But in this case...

Replying toThe Great Filter is early, or AI is hard

bogdanb11y

The Great Filter is early, or AI is hard

Both good points, thank you.

Replying toThe Great Filter is early, or AI is hard

bogdanb11y

The Great Filter is early, or AI is hard

Thank you, that was very interesting!

Harry Potter and the Methods of Rationality discussion thread, part 13, chapter 81

bogdanb

14y

This is a new thread to discuss Eliezer Yudkowsky’s Harry Potter and the Methods of Rationality and anything related to it. This thread is intended for discussing chapter 81, which should be published later today. The previous thread passed 400 comments as of the time of this writing, so it will pass 500 comments soon after the next chapter is posted, if not before. I suggest refraining from commenting here until chapter 81 is posted; comment in the 12th thread until you read chapter 81. After chapter 81 is posted, I suggest all discussion of previous guesses be kept here, with links to comments in the previous thread.

There is now a site... (read 200 more words →)

1112

Musings on probability

bogdanb

16y

I read this comment, and after a bit of rambling I realized I was as confused as the poster. A bit more thinking later I ended up with the “definition” of probability under the next heading. It’s not anything groundbreaking, just a distillation (specifically, mine) of things discussed here over the time. It’s just what my brain thinks when I hear the word.

But I was surprised and intrigued when I actually put it in writing and read it back and thought about it. I don’t remember seeing it stated like that (but I probably read some similar things).

It probably won’t teach anyone anything, but it might trigger a similar “distillation” of “mind... (read 1736 more words →)