Summary. In this post, we present the formal framework we adopt during the sequence, and the simplest form of the type of aspiration-based algorithms we study. We do this for a simple form of aspiration-type goals: making the expectation of some variable equal to some given target value. The algorithm...
Sequence Summary. This sequence documents research by SatisfIA, an ongoing project on non-maximizing, aspiration-based designs for AI agents that fulfill goals specified by constraints ("aspirations") rather than maximizing an objective function . We aim to contribute to AI safety by exploring design approaches and their software implementations that we believe...
Summary: The post describes a method that allows us to use an untrustworthy optimizer to find satisficing outputs. Acknowledgements: Thanks to Benjamin Kolb (@benjaminko), Jobst Heitzig (@Jobst Heitzig) and Thomas Kehrenberg (@Thomas Kehrenberg) for many helpful comments. Introduction Imagine you have black-box access to a powerful but untrustworthy optimizing system,...
This is a report on our work in AISC Virtual 2023. For AISC 2023, our team looked into the foundations of soft optimization. Our goal at the beginning was to investigate variations of the original quantilizer algorithm, in particular by following intuitions that uncertainty about goals can motivate soft optimization....
Introduction A recent post (AGI Ruin: A List of Lethalities) gives an argument for humanity’s likely inability to survive the invention of superintelligent AGI. The argument goes very roughly like this: 1. We haven’t solved the alignment problem (citation needed?). 2. Given access only to an unaligned AI, we can’t...
Epistemic status: Pretty confident, seems obviously correct. Summary: We should promote widespread use of reusable silicone masks to stop the spread of SARS-CoV-2. Masks and transmission of SARS-CoV-2 SARS-CoV-2 spreads mainly through respiratory droplets. Most of the discussion of mask usage to prevent transmission has so far been centered on...