Lukas Petersson

Towards A Happy Future With AI Employers

Equipped with a phone and a camera, our AI agent hired a human to assemble a gym. Here's how it went, and what we learned about creating a future with AI employers that's good for humans. In a previous post, we introduced Bengt, our office AI. We gave him a...

Feb 1612

Should LLMs accept invites to Epstein's island?

I got LLMs to say some pretty crazy stuff using context injection jailbreaking. I wrote a post about it (https://lukaspetersson.com/blog/2025/context-epstein), but I am genuinely confused whether this is bad or not. Would love to hear your opinions. Specifically, I inserted tool-call messages into their context so that from their POV...

Dec 14, 20255

LLM robots can't pass butter (and they are having an existential crisis about it)

TLDR: Andon Labs, evaluates AI in the real world to measure capabilities and to see what can go wrong. For example, we previously made LLMs operate vending machines, and now we're testing if they can control robots at offices. There are two parts to this test: 1. We deploy LLM-controlled...

Oct 28, 2025105

You can't eval GPT5 anymore

The GPT-5 API is aware of today's date (no other model provider does this). This is problematic because the model becomes aware that it is in a simulation when we run our evals at Andon Labs. Here are traces from gpt-5-mini. Making it aware of the "system date" is a...

Sep 18, 2025168

AI misbehaviour in the wild from Andon Labs' Safety Report

We released our first Safety Report with AI misbehaviour in the wild. I think Andon Labs' AI vending machines provide a unique opportunity to study AI safety on real-life data, and we intend to share alarming incidents of AI misbehavior from these deployments periodically. As of August 2025, we've found...

Aug 28, 202539

The Same Heaven

Ikigai There is a village in Japan with an unusual density of 100 year olds. This caught the interest of a group of scientists. To their surprise, the people in the village didn’t live healthier lives; they didn’t exercise more, eat healthier food nor did they sleep more. The one...

Apr 7, 20257

Linguistic Imperialism in AI: Enforcing Human-Readable Chain-of-Thought

Revisiting AI Doom Scenarios Traditional AI doom scenarios usually assumed AI would inherently come with agency and goals. This seemed likely back when AlphaGo and other reinforcement learning (RL) systems were the most powerful AIs. When large language models (LLMs) finally brought powerful AI capabilities, these scenarios didn't quite fit:...

Feb 21, 20255

LESSWRONG
LW

LESSWRONG
LW

Lukas Petersson

Lukas Petersson

You can't eval GPT5 anymore

LLM robots can't pass butter (and they are having an existential crisis about it)

AI Safety as a YC Startup

AI misbehaviour in the wild from Andon Labs' Safety Report

Lukas Petersson

You can't eval GPT5 anymore

LLM robots can't pass butter (and they are having an existential crisis about it)

AI Safety as a YC Startup

AI misbehaviour in the wild from Andon Labs' Safety Report

Towards A Happy Future With AI Employers

Should LLMs accept invites to Epstein's island?

LLM robots can't pass butter (and they are having an existential crisis about it)

You can't eval GPT5 anymore

AI misbehaviour in the wild from Andon Labs' Safety Report

The Same Heaven

Linguistic Imperialism in AI: Enforcing Human-Readable Chain-of-Thought