Killing Recurrent Memory Over Self Attention?
Should we kill recurrent memory over self attention ❓ Spending most of my time on time series problems, I often think about the consequence of memory and the sequential nature we are exposed to in the physical world. Memory is the idea for a learning algorithm to store a representation...
Jun 6, 20233