Thomas Kehrenberg

An introduction to modular induction and some attempts to solve it

The current crop of AI systems appears to have world models to varying degrees of detailedness, but we cannot understand these world models easily as they are mostly giant floating-point arrays. If we knew how to interpret individual parts of the AIs’ world models, we would be able to specify...

Dec 23, 202512

The optimizer won’t just guess your intended semantics

A desirable property of an AI’s world model is that you as its programmer have an idea what’s going on inside. It would be good if you could point to a part of the world model and say, “This here encodes the concept of a strawberry; here is how this...

Mar 6, 202520

Vector Planning in a Lattice Graph

You want to get to your sandwich: Well, that’s easy. Apparently we are in some kind of grid world, which is presented to us in the form of a lattice graph, where each vertex represents a specific world state, and the edges tell us how we can traverse the world...

Apr 23, 202420

Sunlight is yellow parallel rays plus blue isotropic light

When you look up the color temperature of daylight, most sources will say 6500K, but if you buy an LED with that color temperature, it will not look like the sun in the sky. It will seem bluer (or, less yellow-y). Yet, 6500K is arguably the correct number. What is...

Mar 1, 202381

Extensionality and the univalence axiom of type theory

This is the final part of my introduction to dependent type theory. The big theme of this article is equality though that may not be immediately obvious. Let’s start by finally discussing function extensionality and propositional extensionality. What’s the deal with extensionality? Whenever we define concepts, there are basically two...

Jan 19, 20236

Set-like mathematics in type theory

This is the fourth entry in my series on type theory. We will introduce a lot of notation that is reminiscent of set theory, to make everyone feel more at home, and then we’ll discuss the axiom of choice. Subtypes Our next topic is “subsets” within types. I mentioned in...

Jan 3, 20235

A few thoughts on my self-study for alignment research

In June, I received a grant from the LTFF for a 6-months period of self-study aimed at mastering the necessary background for AI alignment research. The following is advice I would give to people who are attempting something similar. I have tried to keep it short. Basic advice You’ll naturally...

Dec 30, 20227

LESSWRONG
LW

LESSWRONG
LW

Thomas Kehrenberg

Thomas Kehrenberg

Sunlight is yellow parallel rays plus blue isotropic light

Exploring Finite Factored Sets with some toy examples

Basic building blocks of dependent type theory

The optimizer won’t just guess your intended semantics

Thomas Kehrenberg

Sunlight is yellow parallel rays plus blue isotropic light

Exploring Finite Factored Sets with some toy examples

Basic building blocks of dependent type theory

The optimizer won’t just guess your intended semantics

An introduction to modular induction and some attempts to solve it

The optimizer won’t just guess your intended semantics

Vector Planning in a Lattice Graph

Sunlight is yellow parallel rays plus blue isotropic light

Extensionality and the univalence axiom of type theory

Set-like mathematics in type theory

A few thoughts on my self-study for alignment research