LESSWRONG
Wikitags
LW

Agent Foundations

Settings
Applied to Lectures on statistical learning theory for alignment researchers by Vanessa Kosoy 8d ago
Applied to Synthesizing Standalone World-Models, Part 4: Metaphysical Justifications by Thane Ruthenis 19d ago
Applied to Crisp Supra-Decision Processes by Brittany Gelb 22d ago
Applied to Proof Section to Crisp Supra-Decision Processes by Brittany Gelb 22d ago
Applied to Natural Latents: Latent Variables Stable Across Ontologies by Kabir Kumar 1mo ago
Applied to Re-imagining AI Interfaces by Harsha G. 1mo ago
Applied to Proof Section to an Introduction to Credal Sets and Infra-Bayes Learnability by Vanessa Kosoy 2mo ago
Applied to An Introduction to Credal Sets and Infra-Bayes Learnability by Vanessa Kosoy 2mo ago
Applied to Agent foundations: not really math, not really science by Alex_Altair 2mo ago
Applied to Apply for the 2025 Dovetail fellowship by Alex_Altair 2mo ago
Applied to Directly Try Solving Alignment for 5 weeks by Kabir Kumar 3mo ago
Applied to Unbounded Embedded Agency: AEDT w.r.t. rOSI by Cole Wyeth 3mo ago
Applied to No, Futarchy Doesn’t Have This EDT Flaw by Mikhail Samin 3mo ago
Applied to New Paper: Ambiguous Online Learning by Vanessa Kosoy 3mo ago
Applied to A New Framework for AI Alignment: A Philosophical Approach by niscalajyoti 3mo ago
Applied to Clarifying “wisdom”: Foundational topics for aligned AIs to prioritize before irreversible decisions by Anthony DiGiovanni 4mo ago
Applied to S-Expressions as a Design Language: A Tool for Deconfusion in Alignment by Johannes C. Mayer 4mo ago
1622