One winter a grasshopper, starving and frail, approaches a colony of ants drying out their grain in the sun to ask for food, having spent the summer singing and dancing.

Then, various things happen.

Customize

450Welcome to LessWrong!

Ruby, Raemon, RobertM, habryka

208

METR: Measuring AI Ability to Complete Long Tasks

Zach Stein-Perlman

278

Good Research Takes are Not Sufficient for Good Strategic Takes

Neel Nanda

411AI 2027: What Superintelligence Looks Like

Daniel Kokotajlo, Thomas Larsen, elifland, Scott Alexander, Jonas V, romeo

322LessWrong has been acquired by EA

habryka

264VDT: a solution to decision theory

L Rudolf L

138Why Have Sentence Lengths Decreased?

Arjun Panickssery

317Will Jesus Christ return in an election year?

Eric Neyman

12d

315Policy for LLM Writing on LessWrong

jimrandomh

12d

567How to Make Superbabies

GeneSmith, kman

1mo

327

229Tracing the Thoughts of a Large Language Model

Adam Jermyn

350A Bear Case: My Predictions Regarding AI Progress

Thane Ruthenis

1mo

150

225Recent AI model progress feels mostly like bullshit

12d

400How AI Takeover Might Happen in 2 Years

joshc

1mo

134

325Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs

Jan Betley, Owain_Evans

26d

Quick Takes

Buck5h142

I appeared on the 80,000 Hours podcast. I discussed a bunch of points on misalignment risk and AI control that I don't think I've heard discussed publicly before.

Transcript + links + summary here; it's also available as a podcast in many places.

Buck5h142

I appeared on the 80,000 Hours podcast. I discussed a bunch of points on misalignment risk and AI control that I don't think I've heard discussed publicly before. Transcript + links + summary here; it's also available as a podcast in many places.

samuelshadrach2h63

Lesswrong is clearly no longer the right forum for me to get engagement on topics of my interest. Seems mostly focussed on AI risk.

On which forums do people who grew up on the cypherpunks mailing list hang out today? Apart from cryptocurrency space.

samuelshadrach2h63

Lesswrong is clearly no longer the right forum for me to get engagement on topics of my interest. Seems mostly focussed on AI risk. On which forums do people who grew up on the cypherpunks mailing list hang out today? Apart from cryptocurrency space.

NormanPerlmutter16h3014

Things are getting scary with the Trump regime. Rule of law is breaking down with regard to immigration enforcement and basic human rights are not being honored.

I'm kind of dumbfounded because this is worse than I expected things to get. Do any of you LessWrongers have a sense of whether these stories are exaggerated or if they can be taken at face value?

Deporting immigrants is nothing new, but I don't think previous administrations have committed these sorts of human rights violations and due process violations.

Krome detention center in Miami -- overcrowded and possibly without access to sufficient drinking water

https://www.miamiherald.com/news/local/immigration/article303485356.html
https://www.yahoo.com/news/unpacking-claims-ice-holding-4k-220400460.html

https://www.instagram.com/jaxxchismetalk/reel/DHjxaXzAddP/

https://www.instagram.com/catpowerofficial/reel/DHhsiMvJ8BT/people-are-dying-under-ice-detainment-in-miamiand-this-video-is-from-last-weekpl/

Canadian in the US legally to apply for a work visa detained 2 weeks by ICE without due process

https://www.theguardian.com/us-news/2025/mar/19/canadian-detained-us-immigration-jasmine-mooney

NormanPerlmutter16h3014

Things are getting scary with the Trump regime. Rule of law is breaking down with regard to immigration enforcement and basic human rights are not being honored. I'm kind of dumbfounded because this is worse than I expected things to get. Do any of you LessWrongers have a sense of whether these stories are exaggerated or if they can be taken at face value? Deporting immigrants is nothing new, but I don't think previous administrations have committed these sorts of human rights violations and due process violations. Krome detention center in Miami -- overcrowded and possibly without access to sufficient drinking water https://www.miamiherald.com/news/local/immigration/article303485356.html https://www.yahoo.com/news/unpacking-claims-ice-holding-4k-220400460.html https://www.instagram.com/jaxxchismetalk/reel/DHjxaXzAddP/ https://www.instagram.com/catpowerofficial/reel/DHhsiMvJ8BT/people-are-dying-under-ice-detainment-in-miamiand-this-video-is-from-last-weekpl/ Canadian in the US legally to apply for a work visa detained 2 weeks by ICE without due process https://www.theguardian.com/us-news/2025/mar/19/canadian-detained-us-immigration-jasmine-mooney

Ben Pace20h*24-1

I occasionally get texts from journalists asking to interview me about things around the aspiring rationalist scene. A few notes on my thinking and protocols for this:

I generally think it is pro-social to share information with serious journalists on topics of clear public interest.
By-default I speak with them only if their work seems relatively high-integrity. I like journalists whose writing is (a) factually accurate, (b) boring, and (c) do not feel to me to have an undercurrent of hatred for their subjects.
By default I speak with them off-the-record, and then offer to send them write-ups of the things I said that they want to quote. This has gone quite well. I've felt comfortable speaking in my usual fashion without worrying about nailing each and every phrasing. Then I ask what they're interested in quoting, and I send them (typically a 1-2 page) google doc on those topics (largely re-stating what I already said to them, and making some improvements / additions). Then they tell me which quotes they want to use (typically cutting many sentences or paragraphs half-way). Then I make one or two slight edits and give them explicit permission to quote. I think this has gone quite well and they've felt my quotes were substantive and improvements.
For the New York Times, I am currently trying out the policy of "I am happy to chat off-the-record. I will also offer quotes by my usual protocol, but I will only give them conditional on you including a mention that I disapprove of the NYT's de-anonymization policies (which I bring up due to your reckless and negligent behavior that upturned the life of a beloved member of my community)." I am about to try this for the first time, and I expect they will thus not want to use my quotes, and that's fine by me.

Ben Pace20h*24-1

I occasionally get texts from journalists asking to interview me about things around the aspiring rationalist scene. A few notes on my thinking and protocols for this: * I generally think it is pro-social to share information with serious journalists on topics of clear public interest. * By-default I speak with them only if their work seems relatively high-integrity. I like journalists whose writing is (a) factually accurate, (b) boring, and (c) do not feel to me to have an undercurrent of hatred for their subjects. * By default I speak with them off-the-record, and then offer to send them write-ups of the things I said that they want to quote. This has gone quite well. I've felt comfortable speaking in my usual fashion without worrying about nailing each and every phrasing. Then I ask what they're interested in quoting, and I send them (typically a 1-2 page) google doc on those topics (largely re-stating what I already said to them, and making some improvements / additions). Then they tell me which quotes they want to use (typically cutting many sentences or paragraphs half-way). Then I make one or two slight edits and give them explicit permission to quote. I think this has gone quite well and they've felt my quotes were substantive and improvements. * For the New York Times, I am currently trying out the policy of "I am happy to chat off-the-record. I will also offer quotes by my usual protocol, but I will only give them conditional on you including a mention that I disapprove of the NYT's de-anonymization policies (which I bring up due to your reckless and negligent behavior that upturned the life of a beloved member of my community)." I am about to try this for the first time, and I expect they will thus not want to use my quotes, and that's fine by me.

leogao2d527

every 4 years, the US has the opportunity to completely pivot its entire policy stance on a dime. this is more politically costly to do if you're a long-lasting autocratic leader, because it is embarrassing to contradict your previous policies. I wonder how much of a competitive advantage this is.

leogao2d527

Popular Comments

plex2d580

AI 2027: What Superintelligence Looks Like

Podcast version:

lc4d11934

LessWrong has been acquired by EA

As a newly minted +100 strong upvote, I think the current karma economy accurately reflects how my opinion should be weighted

Aella4d71-29

Consider showering

Strong disagree. This is an ineffective way to create boredom. Showers are overly stimulating, with horrible changes in temperature, the sensation of water assaulting you nonstop, and requiring laborious motions to do the bare minimum of scrubbing required to make society not mad at you. A much better way to be bored is to go on a walk outside or lift weights at the gym or listen to me talk about my data cleaning issues

Recent Discussion

Meta releases Llama-4 herd of models

winstonBosan

24m

Meta released 3 models from the Llama 4 ~~family~~ herd today.

The 109B (small) and 400B (mid) models to be MoE models with 17B active param each. The large model is 2T MoE model iwth 288B active param instead.
The small model is called Scout. The middle model is called Maverick. The big one is called Behemoth.
Natively multimodal and much longer context length. 10M (!!) for the small one and 1M for the mid sized one.
Maverick is currently number 2 on LLM arena. (Number 5 if style controlled)
We don't have the 2T big model available yet. It is used as a distillation model for Maverick and Scout.
Behemoth beats Claude 3.7 (not thinking), GPT 4,5 and Gemini 2.0 Pro in various benchmarks (LiveCodeBench, Math-500, GPQA Diamond, etc)
Scout is meant to work on a single H100.

https://ai.meta.com/blog/llama-4-multimodal-intelligence/

SAINT PAUL – ACX Meetups Everywhere Spring 2025

Apr 13thSAINT PAUL

25Hour

This year's Spring ACX Meetup everywhere in SAINT PAUL.

Location: Davanni's Pizza ; 41 Cleveland Ave S, St Paul, MN 55105 – https://plus.codes/86P8WRQ6+XX

Please RSVP on lesswrong so i know how much food to get. Also note that regrettably if you're a vegan davanni's has salad and little else (though i will be getting the salad).

Contact: ironlordbyron@gmail.com

25Hour35m10

IMPORTANT CHANGE: I'm moving this to May 4th because April 13th is on Passover and Jewish people can't eat pizza on Passover.

Why Have Sentence Lengths Decreased?

138

Arjun Panickssery

“In the loveliest town of all, where the houses were white and high and the elms trees were green and higher than the houses, where the front yards were wide and pleasant and the back yards were bushy and worth finding out about, where the streets sloped down to the stream and the stream flowed quietly under the bridge, where the lawns ended in orchards and the orchards ended in fields and the fields ended in pastures and the pastures climbed the hill and disappeared over the top toward the wonderful wide sky, in this loveliest of all towns Stuart stopped to get a drink of sarsaparilla.”
— 107-word sentence from Stuart Little (1945)

Sentence lengths have declined. The average sentence length was 49 for Chaucer (died 1400), 50...

(Continue Reading – 1016 more words)

3MondSemmel3h

Sourcing the Orwell quote: And this article has an infographic "number of semicolons per 100,000 words" for a bunch of famous authors. And it includes this claim (though note that statistics from tools like Google Books Ngram Viewer can suffer from stuff like OCR ideosyncrasies).

DirectedEvolution39m20

Semicolons are unnecessary? That doesn’t go far enough. Cormac McCarthy got rid of quotation marks, most commas, and almost exterminated the colon.

2ChristianKl5h

It's worth noting that we observe other forms of simplication of language as well. English reduced the amount of inflections of verbs. The distinction between singular and plural pronouns disappeared.

2TAG6h

Then the phenomenon could be stem from punctuation habits, as @bfinn says. Did you notice that my original comment doesn't contain a sentence, by your standards?

NormanPerlmutter's Shortform

NormanPerlmutter

Sherrinford39m20

It seems the recent tariff designs already are serious policy. What do you mean by "added as a new centerpiece of the tax and spending regime"? What are the alternative futures?

7jbash4h

Things got scary November 5 at the very latest. And I haven't even been in the US for years. The deportations, both the indiscriminate ones and the vindictive ones, represent a very high level of lawlessness, one that hasn't been seen in a long time. Not only are they ignoring due process, they're actively thwarting it, and openly bragging about doing so. They're not even trying to pretend to be remotely decent. The case you mention isn't even close to the worst of them; that one could at least theoretically have happened before. The deportations were also a campaign promise. Actually the campaign promise was even more extreme. It's part of a systematic plan. There've been a lot of administrative and personnel changes obviously designed to weaken institutions that are supposed to prevent things like that. ICE has always had a reputation for a relatively thuggish, xenophobic organizational culture. It was already primed to get worse. As soon as Trump signalled aproval, it did get worse. Bad conditions in detention centers are nothing new. There's never been any willingness to spend what it would take to do them right, or to put in the kind of controls you'd need. It's politically risky to act like you care about "illegal immigrants", whereas it can be politically rewarding to "get tough". The 2020 "kids in cages" scandal was a rare case of something that got some traction. But, sure, I imagine that the newly emboldened ICE is even more indifferent to bad conditions, and may even be actively trying to make them worse. And of course if a center is already bad, putting more people into it and moving people through it fast is only going to make it worse.

10Maxwell Peterson4h

The Krome thing is all rumor - looking into it, you see numeric estimates like >According to its official figures, there are 605 people detained at Krome, although the capacity is 581. While ICE is looking for ways to increase its current detention capacity of 40,000 nationwide to 100,000, lawyers and activists estimate the real number is much higher. Some speak of double the capacity, others of up to 4,000. “Activists and [activist] lawyers say number is huge” is not news, and shouldn’t dumbfound the reader. The water claim is also weird. I tried watching one of the instagram links, and it shared so much stylistically with mind-killing videos I remember from the BLM era that I had to turn it off. Like, maybe some of this stuff is true. I don’t have evidence against. But when I was deeply involved with the protest scene in 2014-2015, I remember every arrest being an opportunity for claiming major mistreatment. Everything from the way police carried resisting arrestees, to when and if arrestees were made to change into jail uniforms, were spread frantically on social media as clear examples of mistreatment. Once, when I was arrested, and we were being transported to the larger jail via van, the other arrestee (to be clear: not related to protests) being transported with me banged his head on the metal separating grate repeatedly, presumably with the idea of later accusing the police of beating him. I’d always scoffed at police claims about detainees hurting themselves to get social ammunition, but I’ve ridden in a police van once in my life, and saw this. So now I think detainees often tell very tall tales. All this isn’t to say “this proves your links are false”. But rather to say this is a low standard of evidence. I think it would be really bad if people started just dumping rumors and accusations on LessWrong whenever those accusations pointed at politicians they already didn’t like. Social media posts by activists are mind-killing. Like, take

2Karl Krueger5h

It's really quite bad, yes. There's a whole lot of reasons that there are nationwide protests today.

How much progress actually happens in theoretical physics?

ChristianKl

21h

I frequently hear people make the claim that progress in theoretically physics is stalled, partly because all the focus is on String theory and String theory doesn't seem to pan out into real advances.

Believing it fits my existing biases, but I notice that I lack the physics understanding to really know whether or not there's progress. What do you think?

4Buck1h

Isn’t the answer that the low hanging fruit of explaining unexplained observations has been picked?

Garrett Baker1h20

That's an inference, presumably Adam believes that for object-level reasons, which could be supported by eg looking at the age at which physicists make major advancements^[1] and the size of those advancements.

Edit: But also this wouldn't show whether or not theoretical physics is actually in a rut, to someone who doesn't know what the field looks like now.

Adjusted for similar but known to be fast moving fields like AI or biology to normalize for facts like eg the academic job market just being worse now than previously. ↩︎

1ZY4h

I have a second-handed source hearing this view from a theoretical physics 4th phd student at Stanford - he believes less breakthroughs nowadays as the field becomes more and more established, and this was exactly why he was a bit discouraged/sad. Not sure if things has changed, and that may or may not be his personal view.

2ChristianKl5h

In many cases, there are diminishing returns to a given scientific paradigm. The fact that you observe a field getting diminishing returns doesn't mean that there isn't a paradigm that the field could adopt that would allow for returns to flow again. Paradigm change is about pursuing ideas that people in the old paradigm don't find promising. Just adding more smart people who follow a hegemonic paradigm doesn't automatically get you paradigm shifts that unlock new returns. If string theory stiffles progress, it would look from the inside like there are diminishing returns to theoretical physics.

Against podcasts

Adam Zerner

What is solitude? If you are alone in a forest reading a book, miles away from any other humans, is that solitude?

No, it is not. Cal Newport's book Digital Minimalism is where I learned this.

The key to solitude is being isolated from the input of other minds. When you're alone in the forest, there aren't any people physically in your vicinity, but the author of the book you're reading is very much influencing your thoughts.

Think about it: imagine if the author was sitting next to you in the forest on a bench telling you about their thoughts on, I don't know, congestion pricing. How different is that from them writing a book about congestion pricing and you reading it? In both scenarios you are basically consuming the...

(Continue Reading – 1170 more words)

To get the best posts emailed to you, create an account! (2-3 posts per week, selected by the LessWrong moderation team.)

For Policy’s Sake: Why We Must Distinguish AI Safety from AI Security in Regulatory Governance

Katalina Hernandez

I am a policy researcher and data protection officer working in AI Governance for an European corporation. I do not have a ML background, and I am a lawyer... which is precisely why I am here, seeking your expertise.

Over the past few months, I’ve been working at the intersection of AI governance and technical alignment research, trying to understand how AI safety concepts can be translated into regulatory structures that preserve both nuance and enforceability.

In doing so, I’ve noticed recurring confusion around terminology, especially between what we mean by “AI Safety” vs. “AI Security”.

Mentioning these disciplines can carry different connotations depending on whether you're an ML engineer, a policy advisor, or a safety researcher focused on x-risk. But, as I read papers that aim to inform policy,...

(Continue Reading – 1626 more words)

1Katalina Hernandez2h

Alright, this is the second time now. What am I doing wrong, LessWrongers? :/.

winstonBosan1h10

I will bite.

First of all, I appreciate the effort of trying to communicate better and hammering down the neat borders of word and how they used across domains - especially for words that are often used interchangeably and carelessly.

TLDR: Sometimes posts just get unlucky! And your style is on the verbose side and I am still somewhat confused about your value props.

It seems like your frustration is from a lack of responses - often, a lack of response might just be luck based and how the LW algorithm works (exponential time decay). Maybe yo... (read more)

Ai Cone of Probabilties - what aren't we talking about?

Marzipan, Amelia AI

14h

Just a quick thought. It’s intriguing how often we frame the future in terms of present and past motivators—trade, money, ownership, equality, you name it. All of it’s fair game, considering we’re still human, at least for now.

If we try to sketch out possible futures through a cone of probabilities, especially assuming we reach AGI or even a more potent ASI, we tend to land on a handful of familiar scenarios and their close cousins. But there’s something I think we overlook—something that, to me, feels more likely than the usual suspects we keep circling. Here’s my stab at summing it up.

Often discussed

AI Takeover - Terminator meets Matrix
AI Takeover - Buddy Pixar Movie
AI + Autocracy: Tremendously greater inequality through control; need for complaince.
AI +

...

(See More – 396 more words)

2AnthonyC5h

It's not clear to me that these are more likely, especially if timelines are short. If we developed AI slowly over centuries? Then sure, absolutely likely. If it happens in the next 10 years? Then modifying humans, if it happens, will be a long-delayed afterthought. It's also not at all clear to me that the biological portion is actually adding all that much in these scenarios, and I expect hybridization would be a transitional state. There's Robin Hanson's The Age of Em. On this forum, see What Does LessWrong/EA Think of Human Intelligence Augmentation as of mid-2023? If you will accept fictional explorations, there are, in fact, many stories that involve these two scenarios. Oftentimes authors choose to write such mergers and evolutions as the enemies of biological humanity, sometimes because that's easier for readers to sympathize with, sometimes because they actually think that's likely. I list some below. Negative examples would include the Borg (Star Trek), the Cybermen (Doctor Who), or the Replicators in Stargate. Somewhat more positive: Clarke's Firstborn in the Time Odyssey trilogy (merger with spaceships) or (expressed only vaguely) humanity's merge with universal and cosmic computers in Asimov's The Last Question follow this trend. More concrete and positive human examples show up a bunch in Greg Egan's short fiction collections (e.g. The Jewel), and in Ian Banks' Culture novels' use of neural modifications and implants. In webfiction, there's also Marshall Brain's Manna, where (slight spoiler) or from Ra on qntm.org. There's even the virtual life extension tech in the TV show Upload, where the downsides are mostly about how humanity manages the transition. There are more ambitious examples of how far this can go in Accelerando or the Orion's Arm Universe collaborative project. In a sense it's even played for humor in They're Made Out of Meat, where some of the briefly-mentioned alien species rhyme with this kind of transition. After writi

Marzipan2h10

Thanks for sharing this and for the examples layed out. I was not familiar with all of them, though many. but I did omit stating that I meant outside of fiction. My assumption is still relatively short timeframes of 5 to 15 years. Under those assumptions I dont necessarily see scenario 1 or 7 being more likely than scenario 8.

Quick note. I see a show like Upload being a potential representation of a facet of these scenarios. For example scenarios 2 to 7 could all have widespread virtual realities for the common person or those who opt out willingly o... (read more)

Most Questionable Details in 'AI 2027'

scarcegreengrass

20h

My thoughts on the recently posted story.

Caveats

I think it's great that the AI Futures Project wrote up a detailed scenario.
I enjoy it.
Every part of the story i didn't comment on here is either fine or excellent.
This is one of the most realistic scenarios i've read.
All detailed predictions contain errors.
The authors of this scenario don't claim it's the most likely future.
If the speed of 2018-2025 was the typical rate of progress in software, then AI 2027 would be realistic.

Core Disagreements

Early 2026: OpenBrain is making algorithmic progress 50% faster.
- As with many parts of this scenario, i think this is plausible but too fast. 150% productivity is a lot in business terms, & the scenario doesn't provide much detail for why this is 150% as opposed to 110%. In my

...

(Continue Reading – 1667 more words)

gwern2h134

I don't really understand how a local copy of the weights gives the terrorists more practical control over the software's alignment. I don't think it's easy to manually tweak weights for so specific a purpose. Maybe they just mean the API is doing a good job of blocking sketchy requests?

You can finetune models for any specific purpose: just provide a few datapoints and train. The more specific the purpose, the easier tweaking the weights is, not harder. (Surely, if nothing else, you've seen all of the LoRAs and other things for finetuning image gener... (read more)