All of vire's Comments + Replies

vire1-2

That's an interesting point, why didn't we see major improvements in LLMs for instance when coding... Despite them achieving reasoning on the level that allows them become a GM on codeforces.

I'd say this is a fundamental limitation of reinforcement learning. Using purely reinforcement learning is stupid. Look at humans, we do much more than that. We make observations about our failures and update, we develop our own heuristics for what it means to be good at something and then try to figure out how to make ourselves better by reasoning about it watching ot... (read more)

vire10

I'll understand if this offends some people here who are researchers and don't have much profit but I'm assuming a functioning society where good research is adequately rewarded.

vire10

How to fix universities: make their profits tied to competency of leaving students by taking a percentage of their future profits for the next x years.

1vire
I'll understand if this offends some people here who are researchers and don't have much profit but I'm assuming a functioning society where good research is adequately rewarded.
vire00

The new Dwarkesh AMA episode is really great. Would recommend everyone to watch it.

vire10

Does listening to music regularly lead to lower probability of achieving goals?