MuZero figures out chess, rules and all – Chessbase News
12/12/2019 Just imagine you had a chess computer the auto-sensor kind. Would someone who had no knowledge of the game be able to work it out, just by moving pieces. Or imagine you are a very powerful computer. By looking at millions of images of chess games would you be able to figure out the rules and learn to play the game proficiently? The answer is yes because that has just been done by Google's Deep Mind team. For chess and 76 other games. It is interesting, and slightly disturbing. | Graphic: DeepMind
ChessBase 15 - Mega package
Find the right combination! ChessBase 15 program + new Mega Database 2020 with 8 million games and more than 80,000 master analyses. Plus ChessBase Magazine (DVD + magazine) and CB Premium membership for 1 year!
More...
In 1980 the first chess computer with an auto response board, the Chafitz ARB Sargon 2.5, was released. It was programmed by Dan and Kathe Spracklen and had a sensory board and magnet pieces. The magnets embedded in the pieces were all the same kind, so that the board could only detect whether there was a piece on the square or not. It would signal its moves with LEDs located on the corner of each square.
Chafitz ARB Sargon 2.5 | Photo:My Chess Computers
Some years after the release of this computer I visited the Spracklens in their home in San Diego, and one evening had an interesting discussion, especially with Kathy. What would happen, we wondered, if we set up a Sargon 2.5 in a jungle village where nobody knew chess. If we left the people alone with the permanently switched-on board and pieces, would they be able to figure out the game? If they lifted a piece, the LED on that square would light up; if they put it on another square that LED would light up briefly. If the move was legal, there would be a reassuring beep; the square of a piece of the opposite colour would light up, and if they picked up that piece another LED would light up. If the original move wasnt legal, the board would make an unpleasant sound.
Our question was: could they figure out, by trial and error, how chess was played? Kathy and I discussed it at length, over the Sargon board, and in the end came to the conclusion that it was impossible they could never figure out the game without human instructions. Chess is far too complex.
Now, three decades later, I have to modify our conclusion somewhat: maybe humans indeed cannot learn chess by pure trial and error, but computers can...
You remember how AlphaGo and AlphaZero were created, by Google's DeepMind division. The programs Leela and Fat Fritz were generated using the same principle: tell an AI program the rules of the game, how the pieces move, and then let it play millions of games against itself. The program draws its own conclusions about the game and starts to play master-level chess. In fact, it can be argued that these programs are the strongest entities to have ever played chess human or computer.
Now DeepMind has come up with a fairly atrocious (but scientifically fascinating) idea: instead of telling the AI software the rules of the game, just let it play, using trial and error. Let it teach itself the rules of the game, and in the process learn to play it professionally. DeepMind combined a tree-based search (where a tree is a data structure used for locating information from within a set) with a learning model. They called the project MuZero. The program must predict the quantities most relevant to game planning not just for chess, but for 57 different Atari games. The result: MuZero, we are told, matches the performance of AlphaZero in Go, chess, and shogi.
And this is how MuZero works (description from VenturBeat):
Fundamentally MuZero receives observations images of a Go board or Atari screen and transforms them into a hidden state. This hidden state is updated iteratively by a process that receives the previous state and a hypothetical next action, and at every step the model predicts the policy (e.g., the move to play), value function (e.g., the predicted winner), and immediate reward (e.g., the points scored by playing a move)."
Evaluation of MuZero throughout training in chess, shogi, Go, and Atari the y-axis shows Elo rating| Image: DeepMind
As the DeepMind researchers explain, one form of reinforcement learning the technique in which rewards drive an AI agent toward goals involves models. This form models a given environment as an intermediate step, using a state transition model that predicts the next step and a reward model that anticipates the reward. If you are interested in this subject you can read thearticle on VenturBeat,or visit the Deep Mind site. There you can read this paper on the general reinforcement learning algorithm that masters chess, shogi and Go through self-play. Here's an abstract:
The game of chess is the longest-studied domain in the history of artificial intelligence. The strongest programs are based on a combination of sophisticated search techniques, domain-specific adaptations, and handcrafted evaluation functions that have been refined by human experts over several decades. By contrast, the AlphaGo Zero program recently achieved superhuman performance in the game of Go by reinforcement learning from self-play. In this paper, we generalize this approach into a single AlphaZero algorithm that can achieve superhuman performance in many challenging games. Starting from random play and given no domain knowledge except the game rules, AlphaZero convincingly defeated a world champion program in the games of chess and shogi (Japanese chess), as well as Go.
That refers to the original AlphaGo development, which has now been extended to MuZero. Turns out it is possible not just to become highly proficient at a game by playing it a million times against yourself, but in fact it is possible to work out the rules of the game by trial and error.
I have just now learned about this development and need to think about the consequences discuss it with experts. My first somewhat flippant reaction to a member of the Deep Mind team: "What next? Show it a single chess piece and it figures out the whole game?"
Link:
MuZero figures out chess, rules and all - Chessbase News
- The Creator of AlphaGo Just Raised $1.1 Billion on One Radical Thesis And It Could Redefine the Entire Future of AI - quasa.io - May 7th, 2026 [May 7th, 2026]
- The Man Behind AlphaGo Thinks AI Is Taking the Wrong Path - WIRED - April 29th, 2026 [April 29th, 2026]
- AlphaGo's Father with 300,000 Citations Raises Nearly 10 Billion Yuan in Four Months of Business, Firmly Believes RL Can Achieve ASI - eu.36kr.com - April 29th, 2026 [April 29th, 2026]
- Google's AlphaGo Software Beats Human Champion of 'Go' in First Round - ABC News - Breaking News, Latest News and Videos - April 29th, 2026 [April 29th, 2026]
- Lee Sedol and Demis Hassabis Reunite 10 Years After the "AlphaGo Shock"... "Paving the Way for the AGI Era" - - April 29th, 2026 [April 29th, 2026]
- The stone Lee Sedol cast 10 years ago became a stepping stone for the AI era - - April 29th, 2026 [April 29th, 2026]
- 10 Years Since AlphaGo Google Presents Blueprint for Next-Generation AI Stage: 'AGI and Scientific Innovation' in Korea - - April 29th, 2026 [April 29th, 2026]
- Korea Forges AI Alliance with Google DeepMind on AlphaGo's 10th Anniversary - Seoul Economic Daily - April 29th, 2026 [April 29th, 2026]
- The two figures who helped usher in the age of artificial intelligence and shook the world have met - - April 29th, 2026 [April 29th, 2026]
- Broadcast of Lee Sedol's win over AlphaGo reaches 14.3% peak viewership - - April 29th, 2026 [April 29th, 2026]
- 'AlphaGo' CEO Hassabis Holds Consecutive Meetings with Heads of Korea's Four Major Conglomerates (Comprehensive) - - April 29th, 2026 [April 29th, 2026]
- The 'AlphaGo' of table tennis has arrived? Sony AI robot defeats professional players, featured in Nature journal - - April 29th, 2026 [April 29th, 2026]
- DeepMinds David Silver Raises $1.1 Billion to Build AI That Learns Without Human Data - CXO Digitalpulse - April 29th, 2026 [April 29th, 2026]
- 'AlphaGo Father' Hassabis Returns to Korea After 10 Years to Discuss AI's Next Move - Seoul Economic Daily - April 29th, 2026 [April 29th, 2026]
- 10th Anniversary of AlphaGo's Match vs Lee Sedol: In - depth Revelation of the Five - day Event in Seoul - eu.36kr.com - April 17th, 2026 [April 17th, 2026]
- DeepMind CEO Discusses Two Paths for AI: Becoming a Scientific Tool or Joining the AGI Race - eu.36kr.com - April 17th, 2026 [April 17th, 2026]
- Beyond AlphaGo, Can AI Earn Trust Even in Human Context? - Yahoo Finance - April 12th, 2026 [April 12th, 2026]
- Lee Sedol Reveals Trick Move That Confused AlphaGo in Historic Match - - April 8th, 2026 [April 8th, 2026]
- Lee Se-dol, who changed the world's Go history with the "confrontation of the century" with artifici.. - - March 9th, 2026 [March 9th, 2026]
- A Decade After AlphaGo, Artificial Intelligence Has Transformed the Game of Go - Koreabizwire - March 9th, 2026 [March 9th, 2026]
- How an intern helped build the AI that shook the world - New Scientist - March 9th, 2026 [March 9th, 2026]
- The moment that kicked off the AI revolution - New Scientist - March 7th, 2026 [March 7th, 2026]
- Is the AlphaGo AI the best in the world? We're about to find out. - Mashable - March 7th, 2026 [March 7th, 2026]
- The future of Go: Lee Se-dol is back, and this time it's personal - Korea JoongAng Daily - March 4th, 2026 [March 4th, 2026]
- AI is rewiring how the worlds best Go players think - MIT Technology Review - March 4th, 2026 [March 4th, 2026]
- Agentic artificial intelligence (AI) startup Inns announced on the 3rd that it will hold a global AI.. - - March 4th, 2026 [March 4th, 2026]
- Lee Se-dol 9 dan will play against artificial intelligence (AI) again in 10 years at the match again.. - - March 4th, 2026 [March 4th, 2026]
- Exclusive: Longtime Google DeepMind researcher David Silver leaves to found his own AI startup - Fortune - February 1st, 2026 [February 1st, 2026]
- 10 Years Since AlphaGo, Code Name: BlueSpot Disclosed Ahead of Handicap-Match Events - PR Newswire - January 16th, 2026 [January 16th, 2026]
- AlphaGo - The Movie | Full Award-winning Documentary Click Through The Next Website Page (KLlvj2Y96q) - Leaders.com.tn - January 14th, 2026 [January 14th, 2026]
- The last market maker? Why AGI may be the end of trading as we know it - felixonline.co.uk - January 9th, 2026 [January 9th, 2026]
- 200 Million People Watched Globally: Why Did He Win the Nobel Prize? All Revealed in 1.5 Hours - 36Kr - December 31st, 2025 [December 31st, 2025]
- The Thinking Game - How DeepMind Transformed Artificial Intelligence - Chess News | ChessBase - December 2nd, 2025 [December 2nd, 2025]
- Musk Challenges LoL Champion Team with AI - | DBR - December 2nd, 2025 [December 2nd, 2025]
- "The Man Who Beat AlphaGo" Lee Se-dol picked "Marriage" as one of the best things in his life.Recent.. - - November 7th, 2025 [November 7th, 2025]
- Schwarzenegger urges Californians to oppose Democratic redistricting ballot measure, as GOP presses on in other states - CNN - October 26th, 2025 [October 26th, 2025]
- Trump says hes targeting Democrats programs, but the suffering is bipartisan - The Hill - October 26th, 2025 [October 26th, 2025]
- Analysis | After Trump gains, New Jersey governors race offers a test for Democrats - The Washington Post - October 26th, 2025 [October 26th, 2025]
- Trump looms over 2025 races in Virginia, New Jersey, NYC and California - USA Today - October 26th, 2025 [October 26th, 2025]
- Opinion | How Democrats Became the Party of the Well-to-Do - The New York Times - October 26th, 2025 [October 26th, 2025]
- Transcript: House Minority Leader Hakeem Jeffries on "Face the Nation with Margaret Brennan," Oct. 26, 2025 - CBS News - October 26th, 2025 [October 26th, 2025]
- 'King-like powers': Chris Murphy says Trump prefers the government to remain closed - Politico - October 26th, 2025 [October 26th, 2025]
- On GPS: Is the future of the Democratic Party on the left? - CNN - October 26th, 2025 [October 26th, 2025]
- Elect the Jersey guy: How Jack Ciattarelli is trying to erase Democrats advantage in a crucial governors race - CNN - October 26th, 2025 [October 26th, 2025]
- Can Democrats harness the energy of the No Kings protests to fight Trump? - The Guardian - October 26th, 2025 [October 26th, 2025]
- Democrats face identity crisis after years of losing touch with voters - Deseret News - October 26th, 2025 [October 26th, 2025]
- Meet the candidates in the special election for Texas Senate District 9 - CBS News - October 26th, 2025 [October 26th, 2025]
- New Georgia Democratic Party leader, government shutdown, NBA gambling | On The Record with ANF - Atlanta News First - October 26th, 2025 [October 26th, 2025]
- Expert warns Democrats risk backlash over failure to condemn violent rhetoric in their ranks - Fox News - October 26th, 2025 [October 26th, 2025]
- I hate to be the one to tell you, but Democrats are starting to like Trump | Opinion - USA Today - October 26th, 2025 [October 26th, 2025]
- Why has the US government shut down and what does it mean? - BBC - October 26th, 2025 [October 26th, 2025]
- Article | Virginia Democrats are the next surprising entrant into the redistricting battle - POLITICO Pro - October 26th, 2025 [October 26th, 2025]
- Could she be Democrats' greatest Hope? Meet Tim Walz's TikTok famous daughter. - USA Today - October 26th, 2025 [October 26th, 2025]
- Democrats Join With Trump in the Death of Democracy - GV Wire - October 26th, 2025 [October 26th, 2025]
- Opinion | The exploding cigar of mid-decade gerrymandering - The Washington Post - October 26th, 2025 [October 26th, 2025]
- Minnesota Democrats hold the first of a series of town halls on gun violence - MPR News - October 26th, 2025 [October 26th, 2025]
- South Korean Go champion defeats AlphaGo for the first time in a comeback victory - Mashdigi - September 25th, 2025 [September 25th, 2025]
- Why AlphaGo, not ChatGPT, will shape the future of wealth management - Professional Wealth Management - September 17th, 2025 [September 17th, 2025]
- The world shuddered when Lee Se-dol made a "God's move" against AlphaGo in 2016. The final result wa.. - - August 26th, 2025 [August 26th, 2025]
- The Go Summit concluded with AlphaGo 2.0 defeating the human brain in three matches. - Mashdigi - August 22nd, 2025 [August 22nd, 2025]
- Lee Sedol showcases board game success and family life on 'Radio Star' - CHOSUNBIZ - Chosun Biz - August 20th, 2025 [August 20th, 2025]
- AlphaGo evolved again and in just three days learned the human Go strategy that took thousands of years to develop. - Mashdigi - August 18th, 2025 [August 18th, 2025]
- In the third round of the Man vs. Machine game, a five-player team still lost to AlphaGo 5. - Mashdigi - August 18th, 2025 [August 18th, 2025]
- AlphaGo defeated Lee Sedol 4:1 to end the century showdown - Mashdigi - August 18th, 2025 [August 18th, 2025]
- Google: The key to AlphaGo 2.0's fast thinking lies in the TensorFlow learning framework - Mashdigi - August 18th, 2025 [August 18th, 2025]
- World Go champion Ke Jie faces AlphaGo 2.0 in the showdown of the century tomorrow. - Mashdigi - August 18th, 2025 [August 18th, 2025]
- Lee Se-dol, a Go engineer who played a great match with "AlphaGo" with Lee Kuk-jong, the head of the.. - - August 14th, 2025 [August 14th, 2025]
- The Rise of Self-Improving AI : How Machines Are Redefining Innovation - Geeky Gadgets - August 6th, 2025 [August 6th, 2025]
- AI Wins Gold Medal at International Mathematical Olympiad (IMO), but "AlphaGo Moment" in Math Community Yet to Arrive - 36Kr - August 1st, 2025 [August 1st, 2025]
- It's exciting, but you can't just read it comfortably. This is the story of Jang Kang-myung's latest.. - - July 20th, 2025 [July 20th, 2025]
- Google's AlphaGo retires from competition after beating world number one 3 - 0 - HardwareZone Singapore - June 29th, 2025 [June 29th, 2025]
- Google's AlphaGo AI just beat the number one ranked Go player in the world - HardwareZone Singapore - June 29th, 2025 [June 29th, 2025]
- It was November 2015. There were two world competitions. It was four months before AlphaGo, made by - - June 22nd, 2025 [June 22nd, 2025]
- The rise of Generative AI: from AlphaGo to ChatGPT - imd.org - June 1st, 2025 [June 1st, 2025]
- With the effect of Lee Se-dol, a former Go player who beat AlphaGo, "Devils Plan 2" became the secon.. - - May 14th, 2025 [May 14th, 2025]
- Chinese teams AI paper paved the way for ChatGPT. Greater glory awaits by 2030 - South China Morning Post - April 21st, 2025 [April 21st, 2025]
- AI scholars win Turing Prize for technique that made possible AlphaGo's chess triumph - ZDNet - March 9th, 2025 [March 9th, 2025]
- The evolution of AI: From AlphaGo to AI agents, physical AI, and beyond - MIT Technology Review - March 1st, 2025 [March 1st, 2025]
- AlphaGo led Lee 4-1 in March 2016. One round Lee Se-dol won remains the last round in which a man be.. - - December 5th, 2024 [December 5th, 2024]
- Koreans picked Google Artificial Intelligence (AI) AlphaGo as an image that comes to mind when they .. - MK - - March 16th, 2024 [March 16th, 2024]