AlphaGo Zero Explained In One Diagram | by David Foster – Medium

The AlphaGo Zero Cheat Sheet (high-res link below)

Download the AlphaGo Zero cheat sheet

Recently Google DeepMind announced AlphaGo Zero an extraordinary achievement that has shown how it is possible to train an agent to a superhuman level in the highly complex and challenging domain of Go, tabula rasa that is, from a blank slate, with no human expert play used as training data.

It thrashed the previous reincarnation 1000, using only 4TPUs instead of 48TPUs and a single neural network instead of two.

The paper that the cheat sheet is based on was published in Nature and is available here. I highly recommend you read it, as it explains in detail how deep learning and Monte Carlo Tree Search are combined to produce a powerful reinforcement learning algorithm.

Hopefully you find the AlphaGo Zero cheat sheet useful let me know if you find any typos or have questions about anything in the document.

If you would like to learn more about how our company, Applied Data Science develops innovative data science solutions for businesses, feel free to get in touch through our website or directly through LinkedIn.

and if you like this, feel free to leave a few hearty claps 🙂

Applied Data Science is a London based consultancy that implements end-to-end data science solutions for businesses, delivering measurable value. If youre looking to do more with your data, lets talk.

Read more here:
AlphaGo Zero Explained In One Diagram | by David Foster - Medium

Related Posts

Comments are closed.