Zvi

Monthly Roundup #41: April 2025

AI continue to accelerate and dominate the schedule, which is why this is a bit late, but we do occasionally need to pay our respects to the Goddess of Everything Else. There’s cool or interesting things everywhere. Also maddenning things. But did you hear, for example, that they’re making some...

Apr 2425

AI #165: In Our Image

This was the week of Claude Opus 4.7. The reception was more mixed than usual. It clearly has the intelligence and chops, especially for coding tasks, and a lot of people including myself are happy to switch over to it as our daily driver. But others don’t like its personality,...

Apr 2341

Opus 4.7 Part 3: Model Welfare

It is thanks to Anthropic that we get to have this discussion in the first place. Only they, among the labs, take the problem seriously enough to attempt to address these problems at all. They are also the ones that make the models that matter most. So the people who...

Apr 2265

Opus 4.7 Part 2: Capabilities and Reactions

Claude Opus 4.7 raises a lot of key model welfare related concerns. I was planning to do model welfare first, but I’m having some good conversations about that post and it needs another day to cook, and also it might benefit from this post going first. So I’m going to...

Apr 2141

Opus 4.7 Part 1: The Model Card

Less than a week after completing coverage of Claude Mythos, here we are again as Anthropic gives us Claude Opus 4.7. So here we are, with another 232 pages of light reading. This post covers the first six sections of the Model Card. It excludes section seven, model welfare, because...

Apr 2046

AI #164: Pre Opus

This is a day late because, given the discourse around Dwarkesh Patel’s interview with Jensen Huang, I pushed the weekly to Friday. This week’s coverage focused on the most important model in a while, Claude Mythos, which was a large jump in cybersecurity capabilities, especially in its ability to autonomously...

Apr 1745

On Dwarkesh Patel’s Podcast With Nvidia CEO Jensen Huang

Some podcasts are self-recommending on the ‘yep, I’m going to be breaking this one down’ level. This was one of those. So here we go. As usual for podcast posts, the baseline bullet points describe key points made, and then the nested statements are my commentary. Some points are dropped....

Apr 1646

Zvi

Zvi

An Unexpected Victory: Container Stacking at the Port of Long Beach

Slack

The Online Sports Gambling Experiment Has Failed

OpenAI: The Battle of the Board

Zvi