LESSWRONG
LW

Comment Permalink

Answer by Tao LinSep 10, 202354

Pythia is meant for this

LawrenceC2y20

Aren't the larger Pythias pretty undertrained?

See in context

16 What are some good language models to experiment with?

by tailcalled

10th Sep 2023

1 min read

2 3

16

Like if I want to experiment with a steering technique, it would be useful to have a language model that is small, capable, but not so finetuned that it becomes inflexible. (Or maybe ideally, a model which has both a finetuned and a non-finetuned variant.)

I've seen some people use GPT-2. Is that recommended? Are there any alternatives?

Personal Blog

16

What are some good language models to experiment with?

6LawrenceC

5Tao Lin

2LawrenceC

New Answer

New Comment

2 Answers sorted by
top scoring

LawrenceC

Sep 10, 2023

If you care about having both the instruction-finetuned variant and the base model, I think I'd go with one of the smaller LLaMAs (7B/13B). Importantly, they fit on one 40/80 GB A100 comfortably, which saves a lot of hassle. There's also a bajillion fine-tuned versions of them if you want to experiment.

Tao Lin

Sep 10, 2023

Pythia is meant for this

[-]LawrenceC2y20

Aren't the larger Pythias pretty undertrained?

Moderation Log

16

[ Question ]

What are some good language models to experiment with?

16

16

2 Answers sorted by top scoring

Sep 10, 2023

Sep 10, 2023

2 Answers sorted by
top scoring