Comment Permalink

Answer by Juraj VitkoAug 04, 2020Ω360

Here's a list of resources that may be of use to you. The GPT-3 paper isn't too specific on implementation details because the changes that led to it were rather incremental (especially from GPT-2, and more so the farther back we look at the Transformer lineage). So the scope to understand GPT-3 is broader than one might expect.

adamShimi5yΩ110

Thanks! I'll try to read that.

Reply

See in context

22

[ Question ]

What are the most important papers/post/resources to read to understand more of GPT-3?

by adamShimi

2nd Aug 2020

AI Alignment Forum

1 min read

A

2 4

22 Ω 6

I'm way more used to thinking about weird maths or distributed algorithms or abstract philosophical problems than about concrete machine learning architectures. But based on everything I see about GPT-3, it seems a nice idea to learn more about it, even if only for participating in the discussion without spouting non-sense.

So I'm asking for what you think are the must-reads on GPT-3 specifically, and maybe any requirement to understand them.

GPTMachine Learning (ML)AI

Frontpage

22 Ω 6

New Answer

New Comment

2 Answers sorted by
top scoring

Peter Jin

Aug 03, 2020

Ω6130

nostalgebraist's blog is a must-read regarding GPT-x, including GPT-3. Perhaps, start here ("the transformer... 'explained'?"), which helps to contextualize GPT-x within the history of machine learning.

(Though, I should note that nostalgebraist holds a contrarian "bearish" position on GPT-3 in particular; for the "bullish" case instead, read Gwern.)

[-]adamShimi5yΩ230

Thanks for the answer! I knew about the "transformer explained" post, but I was not aware of its author's position on GPT-3.

Reply

Juraj Vitko

Aug 04, 2020

Ω360

Here's a list of resources that may be of use to you. The GPT-3 paper isn't too specific on implementation details because the changes that led to it were rather incremental (especially from GPT-2, and more so the farther back we look at the Transformer lineage). So the scope to understand GPT-3 is broader than one might expect.

[-]adamShimi5yΩ110

Thanks! I'll try to read that.

Reply

Rendering 0/2 comments, sorted by

top scoring

(show more) Click to highlight new comments since: Today at 6:31 PM

Moderation Log

Curated and popular this week

22

[ Question ]

What are the most important papers/post/resources to read to understand more of GPT-3?

22

Ω 6

22

Ω 6

2 Answers sorted by top scoring

Aug 03, 2020

Aug 04, 2020

2 Answers sorted by
top scoring