I am not expert in llm reasoning but I think because of RL. You cannot use Alpha...

ozten · on Dec 21, 2024

Nope. AlphaZero taught itself to play games like chess, shogi, and Go through self-play, starting from random moves. It was not given any strategies or human gameplay data but was provided with the basic rules of each game to guide its learning process.

demirbey05 · on Dec 24, 2024

Yes its reinforcement learning, but need to create policy and each policy is specialized for specific tasks.

sgt101 · on Dec 21, 2024

I thought that AlphaZero could play three games? Go, Chess and Shogi?

demirbey05 · on Dec 24, 2024

Think I mean Catan :)