I think that rather than ML engineering (recreating GPT, learning PyTorch, etc.) it's more effective for an AI safety researcher to learn one or several general theories of ML, deep learning, or specifically transformers, such as:
I've personally learned (well, at least, read the corresponding paper in full, making sure that I understand or "almost" understand every part of it) from the list above: the circuit theory (Olah et al. 2020) and the mathematical framework for transformers (Elhage et al. 2021). However, this is a very "low variance" choice: if AI safety researchers know any of these theories, it's exactly these two because these papers are referenced in the AGI Safety Fundamentals Alignment curriculum. I think it would be more useful for the community for more people to get acquainted with more different theories of ML or DL so that the community as a whole has a more diversified understanding and perspective. Of course, it would be ideal if some people learned all these theories and were able to synthesise them, but in practice, we can hardly expect that such super-scholars will appear because everyone has so little time and attention.
The list above is copied from the post "A multi-disciplinary view on AI safety research". See also the section "Weaving together theories of cognition and cognitive development, ML, deep learning, and interpretability through the abstraction-grounding stack" in this post, which is relevant to this question.
TL;DR: I'm trying to either come up with a new promising AIS direction or decide (based on my inside view and not based on trust) that I strongly believe in one of the existing proposals. Is there some ML background that I better get? (and if possible: why do you think so?)
I am not asking how to be employable
I know there are other resources on that, and I'm not currently trying to be employed.
Examples of seemingly useful things I learned so far (I want more of these)
(Please correct me if something here was wrong)
I'm asking because I'm trying to decide what to learn next in ML
if anything.
My background
Thanks!
I don't intend to advance ML capabilities