The speed of GPT-5 could be explained by using GB200 NVL72 for inference, even if it's an 8T total param model.
Ah, interesting! So the speed we see shouldn't tell us much about GPT-5's size.
I omitted one other factor from my shortform, namely cost. Do you think OpenAI would be willing to serve an 8T params (1T active) model for the price we're seeing? I'm basically trying to understand whether GPT-5 being served for relatively cheap should be a large or small update.
One difference between the releases of previous GPT versions and the release of GPT-5 is that it was clear that the previous versions were much bigger models trained with more compute than their predecessors. With the release of GPT-5, it's very unclear to me what OpenAI did exactly. If, instead of GPT-5, we had gotten a release that was simply an update of 4o + a new reasoning model (e.g., o4 or o5) + a router model, I wouldn't have been surprised by their capabilities. If instead GPT-4 were called something like GPT-3.6, we would all have been more or less equally impressed, no matter the naming. The number after "GPT" used to track something pretty specific that had to do with some properties of the base model, and I'm not sure it's still tracking the same thing now. Maybe it does, but it's not super clear from reading OpenAI's comms and from talking with the model itself. For example, it seems too fast to be larger than GPT-4.5.
If you can express empathy, show that you do in fact care about the harms they're worried about as well
Someone can totally do that and express that indeed "harms to minorities" is something we should care about. But OP said that the objection was "the harm AI and tech companies do to minorities and their communities" and... AI is doing no harm that only affects "minorities and their communities". If anything, current AI is likely to be quite positive. The actually honest answer here is "I care about minorities, but you're wrong about the interaction between AI and minorities". And this isn't going to land super well on leftists IMO.
when I was running the EA club
Also, were the people you were talking to EAs or there because interested in EA in the first place? If that's the case your positive experience in tackling these topics is very likely not representative of the kind of thing OP is dealing with.
How much do you think subjective experience owes to the internal-state-analyzing machinery?
I'm actually not really sure. I find it plausible that subjective experience could exist without internal-state-analyzing machinery, and that's what I'm hypothesizing is going on with LLMs to some extent. I think they do have some self-model, but they don't seem to have access to internal states the way we do. Although I somehow think it's more likely that an LLM experiences something than a tree experiences something.
if I wanted to inspect that subjective experience, I probably couldn't do it by strapping a Broca's area etc.
I maybe agree with that, conditional on trees having subjective experience. What I do think might work is doing something more comprehensive: maybe bootstrapping trees with a lot more machinery that includes something to form concepts that correspond to whatever processes are leading to their experiences (insofar as there are processes corresponding to experiences. I'd guess things do work in this way, but I'm not sure). That machinery needs to be somehow causally entangled with those processes; consider how humans have complicated feedback loops such as touch-fire -> pain -> emotion -> self-modeling -> bodily-reaction -> feedback-from-reaction-back-to-brain...
The introspection, self-modeling, and language-production circuitry isn't an impartial window into what's going on inside, the story it builds reflects choices about how to interpret its inputs.
Yeah, that seems true too, but I guess if you have a window at all, then you still have some causal mechanism that goes from internal states corresponding to experiences to internal concepts correlated to those, which might be enough. Now, though, I'm pretty unsure whether the experience is actually due to the concepts themselves or the states that caused them, or whether this is just a confused way of seeing the problem.
I'm in an uncomfortable situation of having a better-argued and more convincing answer than any of the comments here, but I'm unable to trust it fully. ChatGPT o3 gave it. The prompt was simply my post, title included. Here is the answer:
GDP is value added at market prices. Statistical agencies report it two ways:
Because the deflators strip out economy-wide price changes, a rise in real GDP cannot be caused by “just paying more for the same tomatoes.” In other words, inflation and real-GDP growth are separated by construction.
Where things get messy is measurement: digital goods are often free or priced in hard-to-track ways, so the official statistics probably under-state their contribution to welfare rather than over-state it. Experiments that impute the willingness-to-pay for free online services show real growth would have been 0.1-0.2 pp a year higher in the 2005-15 period if that value were included Bureau of Economic AnalysisNBER.
What happens to physical-goods consumption?
Either way, people are richer in utility terms because they now enjoy both their old physical bundle and a vast new bundle of virtual goods whose marginal resource cost is near zero.
Economic growth has already been shifting from material to intangible output for decades—software, streaming, consulting, design. The metaverse would push that frontier further. Because these activities consume little energy, metal or land, GDP can rise without a proportional rise in resource throughput—exactly Noah Smith’s point. The OECD, IMF and others note that the challenge is not physical scarcity but statistical visibility of intangibles and data OECDIMF eLibrary.
So the answer is yes, we would genuinely be richer, even if the number of apples harvested per hectare barely changes. The gains show up partly as higher real incomes that can (within supply limits) buy more physical goods, and partly—as economists increasingly argue—as consumer surplus from wholly new digital experiences that standard GDP still struggles to count.
There's something that I think is usually missing from time-horizon discussions, which is that the human brain seems to operate on a very long time horizon for entirely different reasons. The story for LLMs looks like this: LLMs become better at programming tasks, therefore they become capable of doing (in a relatively short amount of time) tasks that would take increasingly longer for humans to do. Humans, instead, can just do stuff for a lifetime, and we don't know where the cap is, and our brain has ways to manage its memories depending on how often they are recalled, and probably other ways to keep itself coherent over long periods. It's a completely different sort of thing! This makes me think that the trend here isn't very "deep". The line will continue to go up as LLMs become better and better at programming, and then it will slow down due to capability gains generally slowing down due to training compute bottlenecks and due to limited inference compute budgets. On the other hand, I think it's pretty dang likely that we get a drastic trend break in the next few years (i.e., the graph essentially loses its relevance) when we crack the actual mechanisms and capabilities related to continuous operation. For example, continuous learning, clever memory management, and similar things that we might be completely missing at the moment even as concepts.