Of course they are fitting an exponential curve, and only one thing happens when you do that. (Newborn on track to swallow the sun by 2040.) You can get a hyperbolic curve to fit about equally as well [citation needed] and predict negative infinity resources on Jan 2 2028. I wish they had defended this choice a bit more clearly. Like plot binomial and sigmoid best fit for comparison, to show it really does look like an exponential. (Y axis can be something arbitrary, like the price of land measured in gold.) An exponential makes sense, when an output is an input, so I would agree with it, but you can say the same thing about a puppy's cells & organs.

Reply

Recent AI model progress feels mostly like bullshit

lemonhope8d151

Almost every time I use Claude Code (3.7 I think) it ends up cheating at the goal. Optimizing performance by replacing the API function with a constant, deleting test cases, ignoring runtime errors with silent try catch, etc. It never mentions these actions in the summary. In this narrow sense, 3.7 is the most misaligned model I have ever used.

Reply

How to Make Superbabies

lemonhope22d40

23andMe (and all their data) seems to be for sale at a cheap discount.

Reply

Why Were We Wrong About China and AI? A Case Study in Failed Rationality

lemonhope25d61

I think Alibaba has not made any crazy developments yet. So let's consider DeepSeek. I think almost nobody had heard of DeepSeek before v3. Before v3, predicting strong AI progress in China would probably sound like "some AI lab in China will appear from nowhere and do something great. I don't know who or what or when or where, but it will happen soon." That was roughly my opinion, at least in my memory. Maybe making that kind of prediction does not match the tastes of people who are good at predicting things? Awfully vague claim to make I guess.

There was time between v3 and r1 where folks could have more loudly commented DeepSeek was ascendant. What would this have accomplished? I suppose it would have shown some commitment to truth and awareness of reality. I am guessing people who are against the international AI race are a bit lazy to point out stuff that would accelerate the race. I guess at some point the facts can't be avoided.

Reply

lemonhope's Shortform

lemonhope1mo*10

[This comment is no longer endorsed by its author]Reply

Do safety-relevant LLM steering vectors optimized on a single example generalize?

lemonhope1mo20

Holy cow

Reply

Annapurna's Shortform

lemonhope2mo1617

So unbelievably convenient I don't even believe it

Reply

How to Make Superbabies

lemonhope2mo50

Could you do all the research on a boat in the ocean? Excuse the naive question.

Reply

How to Make Superbabies

lemonhope2mo20

Women/girls with big heads tend to hit their heads but you can solve that with bigger arms.

Reply

How to Make Superbabies

lemonhope2mo120

use of a genotyping pipeline poorly suited to ancient DNA which meant that 80% of the genetic variants they "analysed" were likely completely artefactual and did not exist.

Brutal!! I didn't know this gotcha existed. I hope there aren't too many papers silently gotch'd by it. Sounds like the type of error that could easily be widespread and unnoticed, if the statistical trace it leaves isn't always obvious.

Reply