LESSWRONG
LW

Jesse Richardson — LessWrong

Proposal: Liquid Prediction Markets for AI Forecasting

Jesse Richardson

9mo

Background

Polymarket is a prediction market platform where people can trade contracts representing the probability of events occurring (e.g., "Who will win the 2024 Presidential election").
Trading volume on Polymarket is heavily influenced by liquidity rewards, a program which pays traders to place competitive bids and offers in various markets, thereby increasing liquidity and encouraging more trades. Polymarket currently spends an average of $17K/day on liquidity rewards, and can range from $1/day to as much as $5K/day on various individuals.
Polymarket pays for the liquidity rewards itself (thanks, VC money!), and chooses how high to set the rewards on each market.

The problem

Despite significant interest in AI forecasting, there is a lack of liquid high-quality prediction

... (read 712 more words →)

2028 Should Not Be AI Safety's First Foray Into Politics

Jesse Richardson

I liked the idea in this comment that it could be impactful to have someone run for President in 2028 on an AI notkilleveryoneism platform. Even better would be for them to run on a shared platform with numerous candidates for Congress, ideally from both parties. I don't think it's particularly likely to work, or even get off the ground, but it seems worthwhile to explore, given that we don't know what the state of play will be by then. In my view, either the 2024 or the 2028 US Presidential election is probably the most important election in human history, and it's too late to affect the former.

My suggestion is that,... (read 381 more words →)

Superintelligent Agents Pose Catastrophic Risks: Can Scientist AI Offer a Safer Path?

Yoshua Bengio

Yoshua Bengio, Jesse Richardson, dwk, mattmacdermott

A new paper by Yoshua Bengio and the Safe Artificial Intelligence For Humanity (SAIFH) team argues that the current push towards building generalist AI agents presents catastrophic risks, creating a need for more caution and an alternative approach. We propose such an approach in the form of Scientist AI, a non-agentic AI system that aims to be the foundation for safe superintelligence. (Note that this paper is intended for a broad audience, including readers unfamiliar with AI safety.)

Abstract

The leading AI companies are increasingly focused on building generalist AI agents—systems that can autonomously plan, act, and pursue goals across almost all tasks that humans can perform. Despite how useful these systems might be,

... (read 3097 more words →)

To an LLM, everything looks like a logic puzzle

Jesse Richardson

I keep seeing this meme doing the rounds where people present ChatGPT with a common logic problem or riddle, only with some key component changed to make it trivial. ChatGPT has seen the original version a million times, so it gives the answer to the original, not the actually correct and obvious answer.

Screenshot of ChatGPT 4o dialog

You:

The emphatically male surgeon who is also the boy’s father says, “I can’t operate on this boy! He’s my son!” How is this possible?

ChatGPT:

The surgeon is the boy's mother.

The idea is to show that ChatGPT isn't intelligent, it's just reciting what it's seen before in similar contexts to the one it's seeing now and there's no actual reasoning taking place. My issue with this is it's pretty clear to me that most humans fail in very similar ways, just at a slightly higher level of complexity.

The clearest way... (read 340 more words →)

I've tried to give (see on the post) a different description of an equivalence relation that I find intuitive and I think gives the space V as we want it, but it may not be fully correct.

Yep I see what you mean, I've changed the setup back to what you wrote with V_1 and V_0. My main concern is the part where we quotient V_1 by an equivalence relation to get V, I found this not super intuitive to follow and I'd ideally love to have a simpler way to express it.

The main part I don't get right now: I see that 1/(c(v+ + w−))*(v+ + w−) and 1/(c(v+ + w−))*(v- + w+) are convex combinations of elements of L and are therefore in L, however it seems to me that these two things being the same corresponds to v+ + w- = v- + w+, which is... (read more)

Replying toWhy Not Subagents?

Jesse Richardson3y

Why Not Subagents?

You recognise this in the post and so set things up as follows: a non-myopic optimiser decides the preferences of a myopic agent. But this means your argument doesn’t vindicate coherence arguments as traditionally conceived. Per my understanding, the conclusion of coherence arguments was supposed to be: you can’t rely on advanced agents not to act like expected-utility-maximisers, because even if these agents start off not acting like EUMs, they’ll recognise that acting like an EUM is the only way to avoid pursuing dominated strategies. I think that’s false, for the reasons that I give in my coherence theorems post and in the paragraph above. But in any case, your argument doesn’t

... (read more)

Replying toWhy Not Subagents?

Jesse Richardson3y

Why Not Subagents?

Can you explain more how this might work?

Replying toWhy Not Subagents?

Jesse Richardson3y

Why Not Subagents?

Epistemic Status: Really unsure about a lot of this.

It's not clear to me that the randomization method here is sufficient for the condition of not missing out on sure gains with probability 1.

Scenario: B is preferred to A, but preference gap between A & C and B & C, as in the post.

Suppose both your subagents agree that the only trades that will ever be offered are A->C and C->B. These trades occur with a Poisson distribution, with $λ$ = 1 for the first trade and $λ$ = 3 for the second. Any trade that is offered must be immediately declined or accepted. If I understand your logic correctly, this would mean randomizing the preferences such... (read more)

Replying toWhy Not Subagents?

Jesse Richardson3y

Why Not Subagents?

Something I have a vague inkling about based on what you and Scott have written is that the same method by which we can rescue the Completeness axiom i.e. via contracts/commitments may also doom the Independence axiom. As in, you can have one of them (under certain premises) but not both?

This may follow rather trivially from the post I linked above so it may just come back to whether that post is 'correct', but it might also be a question of trying to marry/reconcile these two frameworks by some means. I'm hoping to do some research on this area in the next few weeks, let me know if you think it's a dead end I guess!

Replying toWhy Not Subagents?

Jesse Richardson3y

Why Not Subagents?

Really enjoyed this post, my question is how does this intersect with issues stemming from other VNM axioms e.g. Independence as referenced by Scott Garrabrant?

https://www.lesswrong.com/s/4hmf7rdfuXDJkxhfg/p/Xht9swezkGZLAxBrd

It seems to me that you don't get expected utility maximizers solely from not-strong-Incompleteness, as there are other conditions that are necessary to support that conclusion.

Replying toWhy Not Subagents?

Jesse Richardson3y

Why Not Subagents?

Hi EJT, I'm starting research on incomplete preferences / subagents and would love to see this entry too if possible!

Replying toThe Pointers Problem: Human Values Are A Function Of Humans' Latent Variables

Jesse Richardson3y

The Pointers Problem: Human Values Are A Function Of Humans' Latent Variables

Furthermore, human values are over the “true” values of the latents, not our estimates - e.g. I want other people to actually be happy, not just to look-to-me like they’re happy.

I'm not sure that I'm convinced of this. I think when we say we value reality over our perception it's because we have no faith in our perception to stay optimistically detached from reality. If I think about how I want my friends to be happy, not just appear happy to me, it's because of a built-in assumption that if they appear happy to me but are actually depressed, the illusion will inevitably break. So in this sense I care not just... (read more)

Replying toDoes Agent-like Behavior Imply Agent-like Architecture?

Jesse Richardson3y

Does Agent-like Behavior Imply Agent-like Architecture?

Would it perhaps be helpful to think of agent-like behavior as that which takes abstractions as inputs, rather than only raw physical inputs? e.g. an inanimate object such as a rock only interacts with the world on the level of matter, not on the level of abstraction. A rock is affected by wind currents according to the same laws, regardless of the type of wind (breeze, tornado, hurricane), while an agent may take different actions or assume different states dependent on the abstractions the wind has been reduced to in its world model.