Low-ish effort post just sharing something I found fun. No AI-written text outside the figures. I was recently nerd-sniped by proportional representation voting, and so when playing around with claude code I decided to have it build a simulation. Hot take: * If you're electing a legislature and want it...
Stumbled across a book in the new section of the library: "AI For Humanity," by Andeed Ma, James Ong (founder of the think tank AIII, which is also the sound I make when thinking about AI risk), and Siok Siok Tan. It's a mass-market-ish book about, well, AI for humanity,...
Quick psychology experiment Right now, if I offered you a bet[1] that was a fair coin flip, on tails you give me $100, heads I give you $110, would you take it? Got an answer? Good. Hover over the spoiler to see what other people think: About 90% of undergrads...
In the oceans of the planet Water, a species of intelligent squid-like aliens - we'll just call them the People - debate about what it means to be fleeb. Fleeb is a property of great interest to the People, or at least they think so, but they also have a...
EDIT 1/27: This post neglects the entire sub-field of estimating uncertainty of learned representations, as in https://openreview.net/pdf?id=e9n4JjkmXZ. I might give that a separate follow-up post. Introduction Suppose you've built some AI model of human values. You input a situation, and it spits out a goodness rating. You might want to...
A mostly finished post I'm kicking out the door. You'll get the gist. I There's a tempting picture of alignment that centers on the feeling of "As long as humans stay in control, it will be okay." Humans staying in control, in this picture, is something like humans giving lots...
I read Quine's Word and Object on vacation last week. Overall it was fine, but there were two things that might be worth quick mentions. Quine, Supervised Learning Supremacist One important facet of the book is Quine's picture of how humans learn language. It's not quite that Quine is a...