LESSWRONG
is fundraising!
Tags
LW
$

Robust Agents

•

Applied to Automated monitoring systems by hiki_t 23d ago

•

Applied to On agentic generalist models: we're essentially using existing technology the weakest and worst way you can use it by Yuli_Ban 4mo ago

•

Applied to Introduction to Modern Dating: Strategic Dating Advice for beginners by Jesper Lindholm 5mo ago

•

Applied to Beyond the Board: Exploring AI Robustness Through Go by AdamGleave 6mo ago

•

Applied to [Aspiration-based designs] 2. Formal framework, basic algorithm by Jobst Heitzig 9mo ago

•

Applied to [Aspiration-based designs] 1. Informal introduction by Jobst Heitzig 9mo ago

•

Applied to AISC project: SatisfIA – AI that satisfies without overdoing it by Jobst Heitzig 1y ago

•

Applied to Desiderata for an AI by Nathan Helm-Burger 1y ago

•

Applied to Even Superhuman Go AIs Have Surprising Failure Modes by AdamGleave 1y ago

•

Applied to Robustness to Scale by RaemonTest2 2y ago

•

Applied to A multi-disciplinary view on AI safety research by Roman Leventov 2y ago

•

Applied to Temporally Layered Architecture for Adaptive, Distributed and Continuous Control by Roman Leventov 2y ago

•

Applied to Reward is not Necessary: How to Create a Compositional Self-Preserving Agent for Life-Long Learning by Roman Leventov 2y ago

•

Applied to Sets of objectives for a multi-objective RL agent to optimize by Ben Smith 2y ago

•

Applied to Leveraging Legal Informatics to Align AI by John Nay 2y ago

•

Applied to Can we achieve AGI Alignment by balancing multiple human objectives? by Ben Smith 2y ago