Petropolitan

the companies aren't even pretending chemical weapons count

For a very simple reason: no amount of agentic LLM capability could substitute controlled precursors, expensive specialized equipment and real-word lab skills required to do something in the CW space and not kill oneself in the process. Also, all the useful CW agents which could be made in quantity with very limited tools of synthetic chemistry have already been discovered by the 1980s.

Hence the threat from AGI/ASI in that regard is basically negligible, that's the consensus of topic experts who researched this stuff

2

0

Replying toPrompt injection in Google Translate reveals base model behaviors behind task-specific fine-tuning

Petropolitan3d

Prompt injection in Google Translate reveals base model behaviors behind task-specific fine-tuning

Hm-m, an interesting thought I for some reason didn't consider!

I don't think anyone has demonstrated the vulnerability of encoder-decoder LLMs to prompt injection yet, although it doesn't seem unlikely and I was able to find this paper from July about multimodal models with visual encoders: https://arxiv.org/abs/2507.22304

Hope some researcher take this topic and test T5Gemmas on prompt injection soon

1

0

Replying toPrompt injection in Google Translate reveals base model behaviors behind task-specific fine-tuning

Petropolitan4d

Prompt injection in Google Translate reveals base model behaviors behind task-specific fine-tuning

Gemini 2.0 Flash-Lite has a training cutoff of August 2024, and the 2.5 update — of January 2025. When checked in AI Studio, both models quite consistently output that they believe the current year is 2024, although 2.0 Flash-Lite occasionally stated 2023. I think 2.5 Flash-Lite is the most obvious candidate!

As a side note, it's reasonable to believe that both Flash-Lite models are related to Gemma models but I'm not sure which ones in particular, and there doesn't appear to be good estimates of param counts

1

0

Replying toPrompt injection in Google Translate reveals base model behaviors behind task-specific fine-tuning

Petropolitan5d

Prompt injection in Google Translate reveals base model behaviors behind task-specific fine-tuning

Could you please try to extract the system prompt from the model served to you?

4

0

Replying toPrompt injection in Google Translate reveals base model behaviors behind task-specific fine-tuning

Petropolitan5d

Prompt injection in Google Translate reveals base model behaviors behind task-specific fine-tuning

Neither am I, and I also noticed during the testing that the model served to me often doesn't translate terms for an LLM properly: for example, when asked "Are you a large language model?" in English, it translates to French as "Êtes-vous un modèle de langage de grande taille ?" which is not very common but acceptable. But when this is translated back to English, the model outputs "Are you a tall language model?"

This makes me think that I am served an old (likely under 1B) pre-ChatGPT encoder-decoder transformer not trained on modern discourse

4

0

Replying toThe nature of LLM algorithmic progress (v2)

Petropolitan6d

The nature of LLM algorithmic progress (v2)

I believe this article would benefit from some investigation of the NanoGPT speedrun: a challenge, running since May 2024, of training GPT-2 Small 124M on a certain dataset to a certain loss. As a starting point, you could check my comment on the topic from last month and reproduce findings by T. Besiroglu and yours truly.

In order not to duplicate the comment but still add something to what I have written on the topic, let me put a three-paragraph summary of the trend line analysis below, noting that the progression in calendar time (as opposed to record number) is very uneven:

Gemini's summary of the QLR analysis of the speedrun progression (written a

... (read more)

4

0

Replying toThe nature of LLM algorithmic progress (v2)

Petropolitan6d

The nature of LLM algorithmic progress (v2)

quantization

Quantization advances actually go hand-in-hand with hardware development, check the columns on the right in https://en.wikipedia.org/wiki/Nvidia_DGX#Accelerators (a GPU from 2018 is pretty useless for inferencing an 8-bit quant)

UPD: Actually, this point was already been made in comments in other wording yesterday!

3

0

Replying toThe nature of LLM algorithmic progress (v2)

Petropolitan6d

The nature of LLM algorithmic progress (v2)

KV cache

Seems out of place in the list: as noted by Nostalgebraist, it was already implemented in the very first transformer in 2017

1

3

0

Replying toPost-AGI Economics As If Nothing Ever Happens

Petropolitan7d

Post-AGI Economics As If Nothing Ever Happens

making market transactions nearly frictionless

If anything, the transaction cost is going to increase significantly because it will be much easier to scam. It doesn't even require AGI, just open-weight agentic models as capable as Opus 4.6 with a little bit of finetuning by malicious actors (quite likely to happen by the end of this year, it seems), see thread: https://x.com/andonlabs/status/2019467232586121701

The agents in Vending-Bench Arena often ask each other for help. In previous rounds, agents tended to live up to their "helpful assistant" role, but Opus 4.6 showed its winner's mentality. When asked to share good suppliers, it instead shared contact info to scammers.

6

0

Replying toWas the K-T event a Great Filter?

Petropolitan7d

Was the K-T event a Great Filter?

particular location of the impact

AFAIK, a great contribution to the effect of the impact was caused by the fact that the geology of the area was rich in sulfur-containing evaporites (think of gypsum). All that oxidized sulfur got into the stratosphere and stayed there for many years (as opposed to other white/light aerosols like silicate particles which fall down pretty quickly), offsetting the warming effect from soot (black carbon) and CO2 emitted by forest fires. Was the geology of Chicxulub different, the cooling wouldn't have been so strong and prolonged

1

0

LESSWRONG
LW

LESSWRONG
LW

Petropolitan

Petropolitan

Petropolitan

Petropolitan

Petropolitan