Alex_Altair

Proof Explained: Touchette-Lloyd Theorem

This is a sequel to our previous post on the Touchette-Lloyd theorem[1]. The previous post contained some introductory material and motivation for the theorem. Here, we will walk through the proof of the theorem and explore its applications in a few worked examples. It isn't strictly necessary to read that...

Apr 1110

How much information does an optimal policy contain about its environment?

This post is an informal explainer of our paper which can be found on arxiv. This work was funded by the Advanced Research + Invention Agency (ARIA) Safeguarded AI Programme through project code MSAI-SE01-P005. Introduction There is an intuition that a powerful agent might have to contain some kind of...

Feb 1920

Upcoming Dovetail fellow talks & discussion

As the current Dovetail research fellowship comes to a close, the fellows are giving talks on their projects. All are welcome to join! Unlike the previous cohort talks, these talks will be scheduled one at a time. This is partly because there are too many to do all in one...

Jan 2629

When bits of optimization imply bits of modeling: the Touchette-Lloyd theorem

This post is about one of the results described in the 2004 paper 'Information-theoretic approach to the study of control systems' by Hugo Touchette and Seth Lloyd.[1] The paper compares 'open-loop' and 'closed-loop' controllers (which we here call 'blind' and 'sighted' policies) for the task of reducing entropy and quantifies...

Dec 15, 202532

Answering a child's questions

I recently had a conversation with a friend of a friend who has a very curious child around 5 years of age. I offered to answers some of their questions, since I love helping people understand the world. They sent me eight questions, and I answered them by hand-written letter....

Dec 6, 202545

A review of Red Heart, the new alignment novel by Max Harms

I recently read Red Heart, a spy novel taking place in the core of a Chinese AGI project. Disclaimer that the author is my friend, and that I’m ideologically incentivized to promote stuff about AI safety! That said, I think you should read it. If nothing else, it’s a fun...

Nov 19, 202532

I store some memories spatially and I don't know why

Every so often, I have this conversation: > Them: So you know how the other day we talked about whether we should leave for our trip on that sunday or monday? > Me: …doesn’t sound familiar… > Them: And you said it depended on what work you had left to...

Nov 18, 202511

Alex_Altair

Alex_Altair

Introduction to abstract entropy

Should you make stone tools?

Somebody invented a better bookmark

An Intuitive Explanation of Solomonoff Induction

Alex_Altair

Introduction to abstract entropy

Should you make stone tools?

Somebody invented a better bookmark

An Intuitive Explanation of Solomonoff Induction

Proof Explained: Touchette-Lloyd Theorem

How much information does an optimal policy contain about its environment?

Upcoming Dovetail fellow talks & discussion

When bits of optimization imply bits of modeling: the Touchette-Lloyd theorem

Answering a child's questions

A review of Red Heart, the new alignment novel by Max Harms

I store some memories spatially and I don't know why