https://www.theguardian.com/technology/2017/dec/07/alphazero-google-deepmind-ai-beats-champion-program-teaching-itself-to-play-four-hours
https://arxiv.org/abs/1712.01815
I'm posting this slightly late; the paper is from December 5.
I'd be interested to learn if AlphaZero could be applied to other closed-environment tasks, such as designing hardware in a simulator.
Noted!
Sorry, i couldn't find the previous link here when i searched for it.