Tamay Besiroglu

Message

Tamay Besiroglu

Message

Tamay Besiroglu

Compute Trends Across Three eras of Machine Learning

Jsevillamol, Pablo Villalobos, lennart, Marius Hobbhahn, Tamay Besiroglu, anson.ho

Ω 334y

What do you need to develop advanced Machine Learning systems? Leading companies don’t know. But they are very interested in figuring it out. They dream of replacing all these pesky workers with reliable machines who take no leave and have no morale issues.

So when they heard that throwing processing power at the problem might get you far along the way, they did not sit idly on their GPUs. But, how fast is their demand for compute growing? And is the progress regular?

Enter us. We have obsessively analyzed trends in the amount of compute spent training milestone Machine Learning models.

Our analysis shows that:

Before the Deep Learning era, training compute approximately followed Moore’s law, doubling every ≈20 months.
The Deep Learning era starts somewhere between 2010 and 2012. After that,

...

(See More - 324 more words)

Yudkowsky and Christiano discuss "Takeoff Speeds"

Tamay Besiroglu4y80

I’m confused why you think looking at the rate and lumpiness of historical progress on narrowly circumscribed performance metrics is not meaningful, because it seems like you do seem to think that drawing straight lines is fine when compute is on the x-axis—which seems like a similar exercise. What’s going on there?

The Best Software For Every Need

Tamay Besiroglu5y30

Mathematica is the most powerful solver I’ve come across (it’s basically Wolfram Alpha with additional computational time).

LESSWRONG
LW

LESSWRONG
LW

Tamay Besiroglu

Tamay Besiroglu

Tamay Besiroglu

Tamay Besiroglu