LESSWRONG
LW

Comment Permalink

Milan W1mo10

I have also seen this.

See in context

shawnghu's Shortform

by shawnghu

23rd Feb 2025

1 min read

2

This is a special post for quick takes by shawnghu. Only they can create top-level comments. Comments here also appear on the Quick Takes page and All Posts page.

3 comments, sorted by

top scoring

Click to highlight new comments since: Today at 9:14 AM

[-]shawnghu1mo60

Is anyone else noticing that Claude (Sonnet 3.5 new, the default on claude.ai) is a lot worse at reasoning recently? In the past five days or so its rate of completely elementary reasoning mistakes, which persist despite repeated clarification in different ways, seems to have skyrocketed for me.

[-]cubefox1mo41

Maybe they are preparing for switching from merely encouraging their main model to do CoT (old technique) to a full RL-based reasoning model. I recently saw this, before the GUI aborted and said the model was over capacity:

Then it wouldn't make sense anymore to have the non-reasoning model attempt to do CoT.

[-]Milan W1mo10

I have also seen this.

Moderation Log