[Link] Why I’m excited about AI-assisted human feedback

janleike

29 [Link] Why I’m excited about AI-assisted human feedback

by janleike

6th Apr 2022

AI Alignment Forum

1 min read

0

29 Ω 17

This is a link post for https://aligned.substack.com/p/ai-assisted-human-feedback

I'm writing a sequence of posts on the approach to alignment I'm currently most excited about. This first post argues for recursive reward modeling and the problem it's meant to address (scaling RLHF to tasks that are hard to evaluate).

RLHFAI

Frontpage

29 Ω 17

New Comment

Moderation Log

LESSWRONG
LW

LESSWRONG
LW

29

[Link] Why I’m excited about AI-assisted human feedback

29

Ω 17

29

Ω 17