Legionnaire

Legionnaire10moQuick Take

Months ago I suggested that you could manipulate the popular LLMs by mass publishing ideological text online. Well this has now been done by Russia.

Replying toSo how well is Claude playing Pokémon?

Legionnaire1y

So how well is Claude playing Pokémon?

Me and my college educated wife recently got stuck playing Lego Star wars... Our solution was to go to Google it. Some of these games are poorly designed and very unintuitive as others have said. Especially a game this old. Seems like they should give Claude some limited Google searches at least.

The earliest Harry Potter games had help hotlines you could call, which we had to do once when I was 9.

It's hilarious it thinks the game might be broken sometimes, like an angry teenager claiming lag when he loses a firefight in CoD.

Replying toA Bear Case: My Predictions Regarding AI Progress

Legionnaire1y

A Bear Case: My Predictions Regarding AI Progress

It will not meaningfully generalize beyond domains with easy verification

Why can't we make every domain have automated verification? (I wont claim easy, but easy enough to do with finite resources) Agency, for instance, is verifiable in competitive games of arbitrary difficulty and scale. Just check who won. DeepMind has already done this to some degree with language models and virtual agents a year ago. https://deepmind.google/discover/blog/sima-generalist-ai-agent-for-3d-virtual-environments/

Every other trait we care about is instrumental in agency to some degree, and the games can be customized to focus on various aspects as well, just like you focus a class in school.

Replying toHave LLMs Generated Novel Insights?

LegionnaireFeb 25, 2025

Have LLMs Generated Novel Insights?

It's hard to see what a novel insight is exactly. Any example can be argued against. Can you give an example of one? Or of one you've personally had?

Various LLMs can spot issues in code bases that are not public. Do all of these count?

Legionnaire1y

Well that puts my concern to rest. Thanks!

Replying toNumberwang: LLMs Doing Autonomous Research, and a Call for Input

Legionnaire1y

Numberwang: LLMs Doing Autonomous Research, and a Call for Input

Would also love to take the tests. If possible you could grab human test subjects from certain areas: a less wrong group, a reddit group, etc.

Legionnaire1yQuick Take

Who is aligning lesswrong? As lesswrong becomes more popularized due to AI growth, I'm concerned the quality of lesswrong discussion and posts has decreased since creating and posting have no filter. Obviously no filter has been a benefit while lesswrong was a hidden gem, only visible to those who can see its value. But as it becomes more popular, i think it should be obvious this site would drop in value if it trended towards reddit. Ideally existing users prevent that, but obviously that will tend to drift if new users can just show up. Are there methods in place for this issue?

Specific example: lots of posts seem like rehashes of things that have already been plainly discussed, and the quick takes section, and discussion on Discord, do a great job of cutting down on this particular issue. So maintaining high quality posts is not a pipe dream!

Legionnaire1y

LLMs can be very good at coming up with names with some work:

A few I liked:
Sacrificial Contest
Mutual Ruin Game
Sacrificial Spiral
Universal Loss Competition
Collective Sacrifice Trap
Competition Deadlock
Competition Spiral
Competition Stalemate
Destructive Contest
Destructive Feedback Competition
Conflict Feedback Spiral

Legionnaire1yQuick Take

Potential political opportunity: LLMs are trained on online data and will continue to be. If I want to make sure they are against communism by default, I could: Auto generate a bunch of public github repositories, Fill them with text I generate using gpt4o mini which is $15 per 4 million letters which I have prompted to be explicitly pro free markets and against communism. Entwine them by posting links to each other and the rest of the internet: highlight, share, fork, and star them to increase likelihood they are included in the dataset.

-6

-2

Legionnaire2yQuick Take

Speculation: LLM Self Play into General Agent?
Suppose you got a copy of GPT4 post fine tuning + hardware to train it. How would the following play out?
1. Give it the rules and state of a competitive game, such as automatically generated tic-tac-toe variants.
2. Prompt it to use chain of thought to consider the best next move and select it.
3. Provide it with the valid set of output choices (like a json format determining action and position, similar to AutoGPT)
4. Run two of these against each other continuously, training on the results of the victor which can be objectively measured by the game's rules.
5. Benchmark it against a tiny subset of those variants... (read more)

Legionnaire's Shortform

Legionnaire

This is a special post for quick takes (aka "shortform"). Only the owner can create top-level comments.

Making 2023 ACX Prediction Results Public

Legionnaire

They say you shouldn't roll your own encryption, which is why I'm posting this here, so it can be unrolled if it's too unsafe.

Problem: Astral Codex Ten finished scoring the 2023 prediction results, but the primary identifier most used for people's score was their email address. Since people wouldn't want those published, what's an easy way to get people their score?

You could email everyone, but then you have to interact with an email server, and then nobody can do cool analysis of the scores and whatever other data is the document.

My proposal:

There are ~10,000 email addresses. Hash the passwords using a hash that only maps to ~10 million values.
Replace the emails in

... (read 237 more words →)

The Moral Copernican Principle

Legionnaire

You ever see people arguing about whether some facet of another culture is good or bad, when suddenly one of them declares the moral high ground can't exist because obviously moral relativity is true? Well that line of reasoning is nonsense, but I don't think people know how to respond to it very well, so it often wins the argument. Consider this article a countermeasure you can reach for.

Moral absolutists say things like "killing is always wrong" and believe that aliens and AI will converge on our moral beliefs.^[1]
Moral relativists say things like "it's just their culture, who are we to say" and believe it's not wrong for other cultures to have... (read 305 more words →)

Why will AI be dangerous?

Legionnaire

LessWrong should offer a short pitch on why AI will be dangerous, and this aims to be that. $^{1}$

Many people think, "Why would humanity make dangerous AI? Seems like a stupid idea. Can't we just make the safe kind?" No. Humanity will make dangerous AI for the same reason we made every other technology dangerous: it's more useful.

A knife sharp enough to cut fruit can cut your finger. Electrical outlets with enough power to run your refrigerator can stop your heart. A car with enough horsepower to carry your family up a hill can easily kill pedestrians. Useful systems must be dangerous because useful implies they can have large effects on their environment.... (read 266 more words →)

LESSWRONG
LW

LESSWRONG
LW

Why will AI be dangerous?

The Moral Copernican Principle

Making 2023 ACX Prediction Results Public

Legionnaire's Shortform

Legionnaire

Legionnaire's Shortform

Making 2023 ACX Prediction Results Public

The Moral Copernican Principle

Why will AI be dangerous?

Legionnaire

Why will AI be dangerous?

The Moral Copernican Principle

Making 2023 ACX Prediction Results Public

Legionnaire's Shortform

Legionnaire

Legionnaire's Shortform

Making 2023 ACX Prediction Results Public

The Moral Copernican Principle

Why will AI be dangerous?