User Comment Replies

Open Thread Fall 2024

Declarative and procedural knowledge are two different memory systems. Spaced repetition is good for declarative knowledge, but for procedural (like playing music) you need lots of practice. Other examples include math and programming - you can learn lots of declarative knowledge about the concepts involved, but you still need to practice solving problems or writing code.

Edit: as for why practice every day - the procedural system requires a lot more practice than the declarative system does.

6cubefox5mo

Do we actually know procedural knowledge is linear rather than logarithmic, unlike declarative knowledge?

Debating with More Persuasive LLMs Leads to More Truthful Answers

Dan Valentine1y30

"More persuasive" here means a higher win rate in debate, which I think is the same thing it would mean in any debate context? I agree the limitation to inference time rather than training is definitely important to keep in mind. I think that best-of-N using the judge as a preference model is a reasonable approximation of moderate amounts of RL training, but doing actual training would allow us to apply a lot more optimization pressure and get a wider spread of Elos. There has been some good debate RL work done in a similar setting here, and I'd love to see more research done with debate-trained models.

2the gears to ascension1y

Right, but it wasn't actually optimized on persuasiveness by a gradient, the optimization is weak inference time stuff. I'm not saying the word is used wrong, just that I was surprised by it not being a gradient.

Debating with More Persuasive LLMs Leads to More Truthful Answers

Dan Valentine1yΩ7100

Thanks for the feedback Ryan!

I like this paper, but I think the abstract is somewhat overstated.

This is good to know. We were trying to present an accurate summary in the abstract while keeping it concise, which is a tricky balance. Seems like we didn’t do a good enough job here, so we’ll update the abstract to caveat the results a bit more.

Hidden passage debate on QuALITY is actually pretty narrow as far as domains go and might have pretty different properties from future cases.

Yep, agreed! QuALITY is a great testbed for debate, but we definite... (read more)

7ryan_greenblatt1y

Thanks for the response! I think I agree with everything you said and I appreciate the level of thoughtfulness. Great! I appreciate the inclusion of negative results here. Yep, I'd be interested in this setup, but maybe where we ban egregious jailbreaks or simillar.

Mississauga, Ontario, Canada – ACX Meetups Everywhere Fall 2023

Dan Valentine2y10

Seems weird for this to be the same time and date as the Toronto meetup. Lots of people who might have been interested in going will probably be at the one in Toronto instead.

Dan Valentine2y10

The bottleneck in this scenario becomes brain health, as receiving a brain transplant is not very useful. I’m not sure how much of an obstacle this will be in practice.

Dan Valentine2y20

For a high level look at quantum physics I’d recommend Something Deeply Hidden by Sean Carroll. I feel like I understand many worlds much better after reading it. If you like audiobooks this one is great too.

2Adam Zerner2y

I don't think I'm motivated enough to read something book-length on quantum physics, at least not in the next few years. Thanks for the recommendation though. If there was something blog-post-length that did a good job of communicating the big picture ideas I'd be interested in that.

Apply to the Redwood Research Mechanistic Interpretability Experiment (REMIX), a research program in Berkeley

Dan Valentine2y10

My employer isn’t gonna allow me to take a couple months off to go do this thing I personally am very interested in

Have you considered asking them about it? I've worked at several software jobs where this would have been no problem. I've also seen a few people take sabbaticals and there was no issue with it, their teammates generally thought it was really cool. One guy I know took a 1-year sabbatical to live in a van and drive around Europe.

This is all anecdotal and your situation may be different of course. I just wanted to add this data point as it seemed like you may be prematurely dismissing sabbaticals as some crazy thing that never happens in real life.

Why hasn't deep learning generated significant economic value yet?

Dan Valentine3y170

The worst part is, for most of these, time lost is gone forever. It's just a slowdown. Like the Thai floods simply permanently set back hard drive progress and made them expensive for a long time, there was never any 'catchup growth' or 'overhang' from it.

Isn’t this great news for AI safety due to giving us longer timelines?

MIRI announces new "Death With Dignity" strategy

Dan Valentine3y10

I found your earlier comment in this thread insightful and I think it would be really valuable to know what evidence convinced you of these timelines. If you don't have time to summarize in a post, is there anything you could link to?

1Not Relevant3y

Note also that I would still endorse these actions (since they’re still necessary even with shorter timelines) but they need to be done much faster and so we need to be much more aggressive.

2Not Relevant3y

Yep, just posted it: https://www.lesswrong.com/posts/wrkEnGrTTrM2mnmGa/it-s-time-for-ea-leadership-to-pull-the-fast-takeoff-fire

Faerie Ring: A Small Gather.Town Call

Dan Valentine4y10

How long do you expect the event to last for? I'd love to join but this week I'll have to leave after the first hour.

2hamnox4y

I don't expect these to go longer than 1.5 hours, though it's possible people want to hang out on the line.

Dublin SSC Meetup - Death and Self

Dan Valentine5y10

Update: Black Sheep is fully booked tomorrow, so the location has changed to Kimchi Hophouse!

1jmh5y

Sounds like a great place for some interesting discussion.

LESSWRONG
LW

All of Dan Valentine's Comments + Replies