This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
is fundraising!
Tags
LW
$
Login
Robust Agents
•
Applied to
Automated monitoring systems
by
hiki_t
23d
ago
•
Applied to
On agentic generalist models: we're essentially using existing technology the weakest and worst way you can use it
by
Yuli_Ban
4mo
ago
•
Applied to
Introduction to Modern Dating: Strategic Dating Advice for beginners
by
Jesper Lindholm
5mo
ago
•
Applied to
Beyond the Board: Exploring AI Robustness Through Go
by
AdamGleave
6mo
ago
•
Applied to
[Aspiration-based designs] 2. Formal framework, basic algorithm
by
Jobst Heitzig
9mo
ago
•
Applied to
[Aspiration-based designs] 1. Informal introduction
by
Jobst Heitzig
9mo
ago
•
Applied to
AISC project: SatisfIA – AI that satisfies without overdoing it
by
Jobst Heitzig
1y
ago
•
Applied to
Desiderata for an AI
by
Nathan Helm-Burger
1y
ago
•
Applied to
Even Superhuman Go AIs Have Surprising Failure Modes
by
AdamGleave
1y
ago
•
Applied to
Robustness to Scale
by
RaemonTest2
2y
ago
•
Applied to
A multi-disciplinary view on AI safety research
by
Roman Leventov
2y
ago
•
Applied to
Temporally Layered Architecture for Adaptive, Distributed and Continuous Control
by
Roman Leventov
2y
ago
•
Applied to
Reward is not Necessary: How to Create a Compositional Self-Preserving Agent for Life-Long Learning
by
Roman Leventov
2y
ago
•
Applied to
Sets of objectives for a multi-objective RL agent to optimize
by
Ben Smith
2y
ago
•
Applied to
Leveraging Legal Informatics to Align AI
by
John Nay
2y
ago
•
Applied to
Can we achieve AGI Alignment by balancing multiple human objectives?
by
Ben Smith
2y
ago