User Comment Replies

Recent AI model progress feels mostly like bullshit

vire18d1-2

That's an interesting point, why didn't we see major improvements in LLMs for instance when coding... Despite them achieving reasoning on the level that allows them become a GM on codeforces.

I'd say this is a fundamental limitation of reinforcement learning. Using purely reinforcement learning is stupid. Look at humans, we do much more than that. We make observations about our failures and update, we develop our own heuristics for what it means to be good at something and then try to figure out how to make ourselves better by reasoning about it watching ot... (read more)

vire's Shortform

vire19d10

I'll understand if this offends some people here who are researchers and don't have much profit but I'm assuming a functioning society where good research is adequately rewarded.

vire's Shortform

vire19d10

How to fix universities: make their profits tied to competency of leaving students by taking a percentage of their future profits for the next x years.

1vire19d

I'll understand if this offends some people here who are researchers and don't have much profit but I'm assuming a functioning society where good research is adequately rewarded.

vire's Shortform

vire21d00

The new Dwarkesh AMA episode is really great. Would recommend everyone to watch it.

vire's Shortform

vire26d10

Does listening to music regularly lead to lower probability of achieving goals?

LESSWRONG
LW

All of vire's Comments + Replies