Open links in new tab
  1. What is the significance of move 37? (to a non go player)

    Feb 26, 2023 · 1 I have seen (and googled) information for Game 2, Move 37 in the AlphaGo vs. Lee Sedol match However it is difficult to find information concerning this move that doesn't rely on an …

  2. deep learning - What is the input to AlphaGo's neural network ...

    Jun 8, 2020 · AlphaGo Zero only uses the black and white stones from the Go board as its input, whereas previous versions of AlphaGo included a small number of hand-engineered features. What …

  3. Did Alphago zero actually beat Alphago 100 games to 0?

    Oct 21, 2020 · 2 tl;dr Did AlphaGo and AlphaGo play 100 repetitions of the same sequence of boards, or were there 100 different games? Background: Alphago was the first superhuman go player, but it …

  4. Newest 'alphago' Questions - Artificial Intelligence Stack Exchange

    For questions related to DeepMind's AlphaGo, which is the first computer Go program to beat a human professional Go player without handicaps on a full-sized 19x19 board. AlphaGo was introduced in …

  5. What is the difference between DQN and AlphaGo Zero?

    The earlier AlphaGo version had 4 separate networks, 3 variations of policy network - used during play at different stages of planning - and one value network. Is designed around self-play

  6. Why is Monte Carlo used as the tree search algorithm for AlphaGo?

    Apr 9, 2019 · The paper that introduced AlphaGo, Mastering the game of Go with deep neural networks and tree search, motivates the use of MCTS Monte Carlo tree search (MCTS) uses Monte Carlo …

  7. Difference between AlphaGo's policy network and value network

    Mar 29, 2016 · If anyone else stumbles upon this old question, like me, you'll be pleased to know that AlphaGo's successor, "AlphaGo Zero", as well as its successor "AlphaZero" do indeed get rid of the …

  8. What kind of policy evaluation and policy improvement AlphaGo, …

    Jul 17, 2020 · I'm trying to find out what kind of policy improvement and policy evaluation AlphaGo, AlphaGo Zero, and AlphaZero are using. By looking into their respective paper and SI, I can …

  9. How does Alpha Go Zero MCTS work in parallel?

    Sep 25, 2023 · To understand how AlphaGo Zero performs parallel simulations think of each simulation as a separate agent that interacts with the search tree. Each agent starts from the root node and …

  10. reinforcement learning - How does policy network learn in AlphaZero ...

    May 25, 2021 · In AlphaGo's paper, they take into account whether if the outcome of the game has been a win or a loss when training the policy network with reinforcement learning.