This algorithm won't get caught in a loop like the one you mentioned, because it uses the same process as the one described in the AutoML-Zero paper. In the article, they 'found a better algorithm and iterated' without any problem whatsoever, using the processes described in figures 1 and 2. Please check the paper for that.
About your second point: that's exactly the aim of the experiment, to know if a strictly-better agent can be found with an automatic process. If we don't get there using substantial computation within a... (read more)
Hey! Thanks for your comment.
This algorithm won't get caught in a loop like the one you mentioned, because it uses the same process as the one described in the AutoML-Zero paper. In the article, they 'found a better algorithm and iterated' without any problem whatsoever, using the processes described in figures 1 and 2. Please check the paper for that.
About your second point: that's exactly the aim of the experiment, to know if a strictly-better agent can be found with an automatic process. If we don't get there using substantial computation within a... (read more)