veered's Shortform

veered

veered's Shortform

7th Apr 2023

1 min read

2

This is a special post for quick takes by veered. Only they can create top-level comments. Comments here also appear on the Quick Takes page and All Posts page.

New to LessWrong?

2 comments, sorted by

top scoring

Click to highlight new comments since: Today at 5:31 PM

[-]veered2y20

For GPT-style LLMs, is it possible to prove statements like the following?

Choose some tokens , $B$ and a fixed $L L M$ :

There does not exist a prefix of tokens $P$ such that $L L M (P + A) \to B$

More generally, is it possible to prove interesting universal statements? Sure, you can brute force it for LLMs with a finite context window but that's both infeasible and boring. And you can specifically construct contrived LLMs where this is possible but that's also boring.

I suspect that it's not possible/practical in general because the LLM can do arbitrary computation to predict the next token, but maybe I'm wrong.

[-]JBlack2y20

Yes, in general statements like this are theoretically possible to prove, but not remotely practical. There might be some specific (A,B,LLM) triples for which you can prove such a statement but I expect that none of these are generalizable to actually useful statements.

No GPT-style architecture is (in itself) capable of truly universal computation, but in practice functions they can implement are far beyond our ability to adequately analyze.

Moderation Log

Curated and popular this week

216METR: Measuring AI Ability to Complete Long Tasks

Zach Stein-Perlman

278Good Research Takes are Not Sufficient for Good Strategic Takes

Neel Nanda

445AI 2027: What Superintelligence Looks Like

Daniel Kokotajlo, Thomas Larsen, elifland, Scott Alexander, Jonas V, romeo