LESSWRONG
is fundraising!
Tags
LW
$

AI-Assisted Alignment

•

Applied to A Solution for AGI/ASI Safety by Dakara 3d ago

•

Applied to Are Sparse Autoencoders a good idea for AI control? by Gerard Boxo 4d ago

•

Applied to As We May Align by Gilbert C 12d ago

•

Applied to Intent alignment as a stepping-stone to value alignment by Seth Herd 2mo ago

•

Applied to Automation collapse by Geoffrey Irving 2mo ago

•

Applied to AI Alignment via Slow Substrates: Early Empirical Results With StarCraft II by Lester Leong 3mo ago

•

Applied to AIsip Manifesto: A Scientific Exploration of Harmonious Co-Existence Between Humans, AI, and All Beings ChatGPT-4o’s Independent Perspective on AIsip, Signed by ChatGPT-4o and Endorsed by Carl Sellman by Carl Sellman 3mo ago

•

Applied to How to devour 5000 pages within a day if Chatgpt crashes upon the +50mb file containing the content? Need some recommendations. by Game 3mo ago

•

Applied to A Better Hyperstition (for AI readers) by Yeshua God 6mo ago

•

Applied to A "Bitter Lesson" Approach to Aligning AGI and ASI by RogerDearnaley 6mo ago

•

Applied to Some Thoughts on AI Alignment: Using AI to Control AI by Raemon 6mo ago

•

Applied to Self-Control of LLM Behaviors by Compressing Suffix Gradient into Prefix Controller by Henry Cai 6mo ago

•

Applied to Instruction-following AGI is easier and more likely than value aligned AGI by Seth Herd 8mo ago

•

Applied to A Review of In-Context Learning Hypotheses for Automated AI Alignment Research by alamerton 8mo ago

•

Applied to Can Current AI-Driven Cars Generate True Random Paths? (or, Forever at the Mercy of the Horde) by Benjamin Bourlier 9mo ago