qxcv

Tensor Trust: An online game to uncover prompt injection vulnerabilities

TL;DR: Play this online game to help CHAI researchers create a dataset of prompt injection vulnerabilities. RLHF and instruction tuning have succeeded at making LLMs practically useful, but in some ways they are a mask that hides the shoggoth beneath. Every time a new LLM is released, we see just...

Sep 1, 202330

LESSWRONG
LW

LESSWRONG
LW

qxcv

Tensor Trust: An online game to uncover prompt injection vulnerabilities

qxcv

qxcv

Tensor Trust: An online game to uncover prompt injection vulnerabilities