Alex_Altair

Proof Explained: Touchette-Lloyd Theorem

This is a sequel to our previous post on the Touchette-Lloyd theorem[1]. The previous post contained some introductory material and motivation for the theorem. Here, we will walk through the proof of the theorem and explore its applications in a few worked examples. It isn't strictly necessary to read that...

Apr 1115

How much information does an optimal policy contain about its environment?

by Alfred Harwood, Alex_Altair, and JoseFaustino

This post is an informal explainer of our paper which can be found on arxiv. This work was funded by the Advanced Research + Invention Agency (ARIA) Safeguarded AI Programme through project code MSAI-SE01-P005. Introduction There is an intuition that a powerful agent might have to contain some kind of...

Feb 1930

Upcoming Dovetail fellow talks & discussion

As the current Dovetail research fellowship comes to a close, the fellows are giving talks on their projects. All are welcome to join! Unlike the previous cohort talks, these talks will be scheduled one at a time. This is partly because there are too many to do all in one...

Jan 2629

When bits of optimization imply bits of modeling: the Touchette-Lloyd theorem

by Alfred Harwood and Alex_Altair

This post is about one of the results described in the 2004 paper 'Information-theoretic approach to the study of control systems' by Hugo Touchette and Seth Lloyd.[1] The paper compares 'open-loop' and 'closed-loop' controllers (which we here call 'blind' and 'sighted' policies) for the task of reducing entropy and quantifies...

Dec 15, 202532

Answering a child's questions

I recently had a conversation with a friend of a friend who has a very curious child around 5 years of age. I offered to answers some of their questions, since I love helping people understand the world. They sent me eight questions, and I answered them by hand-written letter....

Dec 6, 202545

A review of Red Heart, the new alignment novel by Max Harms

I recently read Red Heart, a spy novel taking place in the core of a Chinese AGI project. Disclaimer that the author is my friend, and that I’m ideologically incentivized to promote stuff about AI safety! That said, I think you should read it. If nothing else, it’s a fun...

Nov 19, 202533

I store some memories spatially and I don't know why

Every so often, I have this conversation: > Them: So you know how the other day we talked about whether we should leave for our trip on that sunday or monday? > Me: …doesn’t sound familiar… > Them: And you said it depended on what work you had left to...

Nov 18, 202511

Alex_Altair

Alex_Altair

Introduction to abstract entropy

Should you make stone tools?

Somebody invented a better bookmark

An Intuitive Explanation of Solomonoff Induction

Alex_Altair

Introduction to abstract entropy

Should you make stone tools?

Somebody invented a better bookmark

An Intuitive Explanation of Solomonoff Induction

Proof Explained: Touchette-Lloyd Theorem

How much information does an optimal policy contain about its environment?

Upcoming Dovetail fellow talks & discussion

When bits of optimization imply bits of modeling: the Touchette-Lloyd theorem

Answering a child's questions

A review of Red Heart, the new alignment novel by Max Harms

I store some memories spatially and I don't know why