Diffusion Guided NLP: better steering, mostly a good thing

Nathan Helm-Burger

13 Diffusion Guided NLP: better steering, mostly a good thing

by Nathan Helm-Burger

10th Aug 2024

1 min read

0

13

This is a linkpost for https://arxiv.org/html/2408.04220v1

I think this is a very promising method for improving the steering of LLMs. Which is great for reducing risk from model-originating harms like deception.

The flipside is that it increases misuse potential.

This is yet another possibility for the widening of the safety gap between closed-weight models with locked-down controls, and open weight models.

AI CapabilitiesAI ControlMachine Learning (ML)AI

Frontpage

13

New Comment

Moderation Log