A Go playing program implemented in Tensorflow roughly according to the architecture of AlphaGo. JoshieGo has strong policy/value network, and uses APV-MCTS to search during live play. On a single GTX1080 given 20 seconds per move, JoshieGo achieve strength around 3~4 amateur dan on OGS and Tencent's Fox Go Server.
Compared to many other AlphaGo-related but failed projects, JoshieGo does not aim to faithfully replicate AlphaGo's original paper, especially in the training of value network which is crucial for final performance. Rather, JoshieGo aims to build practical and strong Go engine with acceptable time and affordable hardware. The value network of JoshieGo is trained and fine-tuned purely with publicly available data using supervised learning without obvious overfitting. Policy network and MCTS are implemented as described in AlphaGo paper, with the exception that we do not use a fast roll-out policy.
Tensorflow == 1.0.0
Numpy >= 1.11.1
OpenCV (optional, visualization use only)
-
Edit IP Address in play.py
-
run python play.py --is_server=1
-
run python play.py
Games played by JoshieGo can be found by searching username "JoshieBot" on Tencent's Fox Go Server.
- Surprisingly, JoshieGo can not read ladder. Feature planes indicating successful ladder capture and ladder escape will be added.
- Wrap feature extraction code in C to boost performance.
- Implement distributed APV-MCTS.