x

J L

Subscribe

Message

2

Ω

2

1

4y

J L

Subscribe

Message

2

Ω

2

1

4y

J L

J L has not written any posts yet.

Replying toUnderstanding “Deep Double Descent”

J L4y

Understanding “Deep Double Descent”

Apologies if it's obvious, but why the focus on SGD? I'm assuming it's not meant as shorthand for other types of optimization algorithms given the emphasis on SGD's specific inductive bias, and the Deep Double Descent paper mentions that the phenomena hold across most natural choices in optimizers.

1

3

0

LESSWRONG
LW

LESSWRONG
LW

J L

J L

J L

J L

J L

J L