All of Einar Urdshals's Comments + Replies

The code is currently not public. We intend to make it public once we have finished a few more projects with the same codebase. One of the things we would like to look at is varying the amount of noise. I don't have great intuitions for what the loss landscape of a model trained on a finite random dataset will look like.

As to the translational symmetry of the circuits, the measure just sums the absolute difference between adjacent elements parallel to the diagonal, does the same for elements perpendicular to the diagonal and takes the difference of the two... (read more)

Thanks for the suggestion! You can access the still images that have been used to generate the gifs here. We have also added the link to the still images to the post!