Replying toAI #22: Into the Weeds

RE: GPT getting dumber, that paper is horrendous.

The code gen portion was completely thrown off because of Markdown syntax (the authors mistook back-ticks for single-quotes, afaict). I think the update to make there is that it is decent evidence that there was some RLHF on ChatGPT outputs. If you remember from that "a human being will die if you don't reply with pure JSON" tweet, even that final JSON code was escaped with markdown. My modal guess is that markdown was inserted via cludge to make the ChatGPT UX better, and then RLHF was done on that cludged output. Code sections are often mislabeled for what language they contain. My secondary guess... (read more)

Replying toWhat Boston Can Teach Us About What a Woman Is

Robert Kennedy3y

What Boston Can Teach Us About What a Woman Is

If you take the distance between the North and South pole and divide it by ten million: voilà, you have a meter!

NB: The circumference of the Earth is ~40k km - this definition of a meter should instead mention the distance from the North or South pole to the Equator.

Replying toOn the Crisis at Silicon Valley Bank

Robert Kennedy3y

On the Crisis at Silicon Valley Bank

The problem with this is that you get whatever giant risks you aren’t measuring properly. That’s what happened at SVB, they bought tons of ‘safe’ assets while taking on a giant unsafe bet on interest rates because the system didn’t check for that. Also they cheated on the accounting, because the system allowed that too.

A very good example of Goodhart's Law/misalignment. Highlighting for the skimmers. Thanks for the write up Zvi!

Tidbit to make this comment useful: "duration" is the (negative) derivative of price with respect to yield - a bond with duration of 10 will be worth 5% (relative to par) after a 50 bip (0.5%) rate hike. So why do they... (read more)

Replying toEvaluating 2022 ACX Predictions

Robert Kennedy3y

Evaluating 2022 ACX Predictions

New U.S. sanctions on Russia (70%): Scott holds, I sell to 60%.

This seems like a better sale than the sale on Russia going to war, by a substantial amount. So if I was being consistent I should have sold more here. Given that I was wrong about the chances of the war, the sale would have been bad, but I didn’t know that at the time. Therefore this still counts as a mistake not to sell more.

This seems like a conjunctive fallacy. "US sanctions Russia" is very possible outside "Russia goes to war", even if "Russia goes to war" implies "US sanctions Russia". You had 30% on "major flare up in Russia-Ukraine". Perhaps you are anchoring your relative sells or something?

I obviously agree that you know these things, and am only noting a self-flagellation that seemed unearned. Thanks for writing Zvi!

Replying toSolidGoldMagikarp (plus, prompt generation)

Robert Kennedy3y*

SolidGoldMagikarp (plus, prompt generation)

What prompts maximize the chance of returning these tokens?

Idle speculation: cloneembedreportprint and similar end up encoding similar to /EOF.

Replying toDid ChatGPT just gaslight me?

Robert Kennedy3y

Did ChatGPT just gaslight me?

I am sorry for insulting you. My experience in the rationality community is that many people choose abstinence from alcohol, which I can respect, but I forgot that likely in many social circles that choice leads to feelings of alienation. While I thought you were signaling in-group allegiance, I can see that you might not have that connection. I will attempt to model better in the future, since this seems generalizable.

I'm still interested in whether the beet margarita with OJ was good~

Replying toDid ChatGPT just gaslight me?

Robert Kennedy3y

Did ChatGPT just gaslight me?

Did you try the beet margarita with orange juice? Was it good?

To be honest, this exchange seems completely normal for descriptions of alcohol. Tequila is canonically described as sweet. You are completely correct that when people say "tequila is sweet" they are not trying to compared it to super stimulants like orange juice and coke. GPT might not understand this fact. GPT knows that the canonical flavor profile for tequila includes "sweet", and your friend knows that it'd be weird to call tequila a sweet drink.

I think the gaslighting angle is rather overblown. GPT knows that tequila is sweet. GPT knows that most the sugar in tequila has been converted to alcohol.... (read more)

-8

-6

Replying toWe must be very clear: fraud in the service of effective altruism is unacceptable

Robert Kennedy3y

We must be very clear: fraud in the service of effective altruism is unacceptable

I wish this post talked about object level trade offs. It did that somewhat with the reference to the importance of "have a decision theory that makes it easier to be traded with". However, the opening was extremely strong and was not supported:

I care deeply about the future of humanity—more so than I care about anything else in the world. And I believe that Sam and others at FTX shared that care for the world. Nevertheless, if some hypothetical person had come to me several years ago and asked “Is it worth it to engage in fraud to send billions of dollars to effective causes?”, I would have said unequivocally no.

What level of... (read more)

Replying toIntelligence as a Platform

Robert Kennedy3y

Intelligence as a Platform

Thanks for feedback, I am new to writing in this style and may have erred too much towards deleting sentences while editing. But, if you never cut too much you're always too verbose, as they say. I in particular appreciate that, when talking about how I am updating, I should make clear where I am updating from.

For instance, regarding human level intelligence, I was also describing relative to "me a year/month ago". I relistened to the Sam Harris/Yudkowsky podcast yesterday, and they detour for a solid 10 minutes about how "human level" intelligence is a straw target. I think their arguments were persuasive, and that I would have endorsed them a year... (read more)

Intelligence as a Platform

Robert Kennedy

In this post I review the platform/product distinction and note how GPT can be modeled as a platform, which further products will be built upon. I argue that prosaic alignment is best viewed as ~~hypnotism capabilities training~~ instantiating products on that platform. I explain why this slightly pushes out my X-Risk timing while increasing my X-Risk factor.

Products vs Platforms

In Stevey's Google Platforms Rant (2011), Steve Yegge lays out the differences between products and platforms.

The other big realization [Jeff Bezos] had was that he can't always build the right thing. I think Larry Tesler [senior UX from Apple] might have struck some kind of chord in Bezos when he said his mom couldn't

... (read 834 more words →)

Right, okay. I am trying to learn your ontology here, but the concepts are not close to my current inferential distance. I don't understand what the 95% means. I don't understand why the d100 has 99% chance to be fixed after one roll, while a d10 only has 90%. By the second roll I think I can start to stomach the logic here though, so maybe we can set that aside.

In my terms, when you say that a Bayesian wouldn't bet $1bil:$1 that the sun will rise tomorrow, that doesn't seem correct to me. It's true that I wouldn't actually make that nightly bet, because the risk free rate is like 3%... (read more)

Probabilistic Negotiation

Robert Kennedy

Follow up to Deterministic Strategies Can Be Sub-optimal

The Ultimatum Game is a simple experiment. Two people have been allocated $10. One person decides how to divide the profits, and the other decides whether to Accept that allocation or to Deny it, in which case both participants get $0. Suppose you are the person whose job it is to choose whether to Accept or Deny an offer. What strategy could you use to maximize your returns?

Yudkowsky offers the following solution (NB: the original text splits $12, because sci-fi; I have changed the numbers inline/without brackets, let me know if that offends)

It goes like this:
When somebody offers you a 6:4 split, instead of the

... (read 709 more words →)

Deterministic Strategies Can Be Suboptimal

Robert Kennedy

Yesterday I posted the following problem

You are playing a game against ROB, the robber. In this game, ROB will choose two arbitrary (real/computable) numbers, $A$ and $B$ . ROB will send both numbers to TABI the arbiter, who will secretly flip a fair coin and then hand you either $A$ or $B$ depending on the outcome of the flip. Your task is to come up with an algorithm which does better than 50% accuracy to determine if you have the larger number.

I will now insert line breaks, then some hints, then more line breaks, then the solution. Edit: This sentence is meant to remind you that you can only discover the answer once, and it is sweet victory to solve... (read 169 more words →)

A Warm Up Problem for Yudkowsky's New Fiction

Robert Kennedy

Hello, this is my first post on lesswrong, a community I deeply value. Feel free to tell me to RTFM, especially if you can point me there.

Yesterday I attended the weekly rationalist meetup at "The Territory" in Seattle for the first time. I attended for many reasons, but I was particularly excited to discuss Yudkowsky's new novel with others. His novel can be found here: https://www.glowfic.com/posts/4582

Only one of the eleven or so people there had heard about it, so this post is my attempt to market the book. Since all marketing should also add value, I will now move on to adding value by giving you a warm-up problem for some of... (read more)

LESSWRONG
LW

LESSWRONG
LW

Robert Kennedy

Probabilistic Negotiation

A Warm Up Problem for Yudkowsky's New Fiction

Intelligence as a Platform

Deterministic Strategies Can Be Suboptimal

Robert Kennedy

Robert Kennedy

Intelligence as a Platform

Probabilistic Negotiation

Deterministic Strategies Can Be Suboptimal

A Warm Up Problem for Yudkowsky's New Fiction

Robert Kennedy

Probabilistic Negotiation

A Warm Up Problem for Yudkowsky's New Fiction

Intelligence as a Platform

Deterministic Strategies Can Be Suboptimal

Robert Kennedy

Robert Kennedy

Intelligence as a Platform

Probabilistic Negotiation

Deterministic Strategies Can Be Suboptimal

A Warm Up Problem for Yudkowsky's New Fiction

Products vs Platforms