This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
Tags
LW
Login
Reinforcement Learning
•
Applied to
Speedrun ruiner research idea
by
lukehmiles
1mo
ago
•
Applied to
The theory of Proximal Policy Optimisation implementations
by
salman.mohammadi
1mo
ago
•
Applied to
Measuring Learned Optimization in Small Transformer Models
by
J Bostock
1mo
ago
•
Applied to
[Aspiration-based designs] 2. Formal framework, basic algorithm
by
Jobst Heitzig
2mo
ago
•
Applied to
[Aspiration-based designs] 1. Informal introduction
by
Jobst Heitzig
2mo
ago
•
Applied to
Skepticism About DeepMind's "Grandmaster-Level" Chess Without Search
by
Arjun Panickssery
3mo
ago
•
Applied to
Krueger Lab AI Safety Internship 2024
by
Joey Bream
4mo
ago
•
Applied to
Interpreting the Learning of Deceit
by
RogerDearnaley
5mo
ago
•
Applied to
Refinement of Active Inference agency ontology
by
Roman Leventov
5mo
ago
•
Applied to
Utility ≠ Reward
by
Oliver Sourbut
5mo
ago
•
Applied to
Planning in LLMs: Insights from AlphaGo
by
jco
5mo
ago
•
Applied to
Reinforcement Learning using Layered Morphology (RLLM)
by
MiguelDev
6mo
ago
•
Applied to
AISC project: SatisfIA – AI that satisfies without overdoing it
by
Jobst Heitzig
6mo
ago
•
Applied to
We have promising alignment plans with low taxes
by
Seth Herd
6mo
ago
•
Applied to
Wireheading and misalignment by composition on NetHack
by
pierlucadoro
7mo
ago
•
Applied to
VLM-RM: Specifying Rewards with Natural Language
by
ChengCheng
7mo
ago