This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
Wikitags
LW
Login
Agent Foundations
Settings
Applied to
Lectures on statistical learning theory for alignment researchers
by
Vanessa Kosoy
8d
ago
Applied to
Synthesizing Standalone World-Models, Part 4: Metaphysical Justifications
by
Thane Ruthenis
19d
ago
Applied to
Crisp Supra-Decision Processes
by
Brittany Gelb
22d
ago
Applied to
Proof Section to Crisp Supra-Decision Processes
by
Brittany Gelb
22d
ago
Applied to
Natural Latents: Latent Variables Stable Across Ontologies
by
Kabir Kumar
1mo
ago
Applied to
Re-imagining AI Interfaces
by
Harsha G.
1mo
ago
Applied to
Proof Section to an Introduction to Credal Sets and Infra-Bayes Learnability
by
Vanessa Kosoy
2mo
ago
Applied to
An Introduction to Credal Sets and Infra-Bayes Learnability
by
Vanessa Kosoy
2mo
ago
Applied to
Agent foundations: not really math, not really science
by
Alex_Altair
2mo
ago
Applied to
Apply for the 2025 Dovetail fellowship
by
Alex_Altair
2mo
ago
Applied to
Directly Try Solving Alignment for 5 weeks
by
Kabir Kumar
3mo
ago
Applied to
Unbounded Embedded Agency: AEDT w.r.t. rOSI
by
Cole Wyeth
3mo
ago
Applied to
No, Futarchy Doesn’t Have This EDT Flaw
by
Mikhail Samin
3mo
ago
Applied to
New Paper: Ambiguous Online Learning
by
Vanessa Kosoy
3mo
ago
Applied to
A New Framework for AI Alignment: A Philosophical Approach
by
niscalajyoti
3mo
ago
Applied to
Clarifying “wisdom”: Foundational topics for aligned AIs to prioritize before irreversible decisions
by
Anthony DiGiovanni
4mo
ago
Applied to
S-Expressions as a Design Language: A Tool for Deconfusion in Alignment
by
Johannes C. Mayer
4mo
ago
1622