Rahul Chand

Singular Learning Theory for Dummies

In this post, I will cover Jesse Hoogland's work on Singular Learning Theory. The post is mostly meant as a dummies guide, and therefore won't be adding anything meaningfully new to his work. As a helpful guide at each point I try to mark the math difficulty of each section, some of it is trivial (I), some of it can be with followed with enough effort (II) and some of it out of our(my) scope and we blindly believe them to be true (III). Background > Statistical learning theory is lying to you: "overparametrized" models actually aren't overparametrized, and generalization is not just a question of broad basins Singular Learning Theory (SLT) tries to explain how neural networks generalize. One motivation for this is that the previous understanding of neural network generalization isn't quite correct. So what is the previous theory and why is it wrong? Broad Basins Below are excerpts from "Towards Understanding Generalization of Deep Learning: Perspective of Loss Landscapes"[1] that summarize the logic behind broad basins theory of generalization According to SLT there are two issues here. First, finding volume of a basin in the loss landscape by approximating it via hessian isn't accurate and second, the reason why models generalize has more to do with their symmetries than with with the fact that intialization drops the NNs in a good "broad basin" area. Starting with SLT A lot of basic maths for SLT is already covered in the original blog post. Here I will try to go over small questions that one might have (atleast i had) to get a clearer picture What we know so far? (Math difficulty I) Screenshot from the blog 1. We define the truth, model and prior. The model is just the likelihood. These are standard bayesian terms. 2. Based on the above we know how to formalize the posterior & model evidence, p(w|D_n) & p(D_n) in terms of the likelihood and prior. Usually we stop at the posterior part, we are happy to get the p(w|D_n) for the model, but here we are

5Oct 15, 2024

Rahul Chand

Message

SB-1047, ChatGPT and AI's Game of Thrones

Part-I (The Sin of Greed) On 30 November 2022, OpenAI released ChatGPT. According to Sam Altman, it was supposed to be a demo[1] to show the progress in language models. By December 4, in just 5 days it had gained 1 million users, for comparison it took Instagram 75 days,...

Nov 24, 2024-2

Introduction to Choice set Misspecification in Reward Inference

In classical RL, we have an agent with a set of States (S), a set of action (A), and given some reward function (R), the aim is to find out the optimal policy (pi) which maximizes the following. This is the cummulative rewards we get by sampling actions using our...

Oct 29, 20242

Monosemanticity & Quantization

In this post, I will cover Anthropic's work on monosemanticity[1]. Starting with a brief introduction to the motivation and methodology. Then move on to my ablation experiments where I train a sparse autoencoder on "gelu-2l"[2] and its quantized versions to see what insights I can gain. Introduction The holy grail...

Oct 22, 20241

Singular Learning Theory for Dummies

Oct 15, 20245

AGI & Consciousness - Joscha Bach

In this post, I cover Joscha Bach' views on consciousness, how it relates to intelligence, and what role it can play to get us closer to AGI. The post is divided into three parts, first I try to cover why Joscha is interested in understanding consciousness, next, I go over...

Oct 8, 20241

AGI Farm

This post discusses Joe Carlsmith’s views on how to approach the problem of AI risk as interspecies interaction and how humans can use it navigate future AI development better. The essay is divided into three parts. First I give my understanding of Carlsmith's views, then I build upon some of...

Oct 1, 20241

LESSWRONG
LW

LESSWRONG
LW

Rahul Chand

Rahul Chand

Rahul Chand

Singular Learning Theory for Dummies

Introduction to Choice set Misspecification in Reward Inference

AGI & Consciousness - Joscha Bach

Monosemanticity & Quantization

Rahul Chand

SB-1047, ChatGPT and AI's Game of Thrones

Introduction to Choice set Misspecification in Reward Inference

Monosemanticity & Quantization

Singular Learning Theory for Dummies

AGI & Consciousness - Joscha Bach

AGI Farm

SB-1047, ChatGPT and AI's Game of Thrones

Introduction to Choice set Misspecification in Reward Inference

Monosemanticity & Quantization

Singular Learning Theory for Dummies

AGI & Consciousness - Joscha Bach

AGI Farm

Singular Learning Theory for Dummies

Introduction to Choice set Misspecification in Reward Inference

AGI & Consciousness - Joscha Bach

Monosemanticity & Quantization