mcts

Star

Here are 402 public repositories matching this topic...

hijkzzz / Awesome-LLM-Strawberry

Star

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

reinforcement-learning mathematics coding mcts strawberry llm chain-of-thought openai-o1

Updated Feb 26, 2025

suragnair / alpha-zero-general

Star

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

reinforcement-learning deep-learning neural-network tensorflow keras pytorch mcts othello gomoku monte-carlo-tree-search gobang alphago tf alphago-zero alpha-zero alphazero self-play

Updated Jan 1, 2025
Jupyter Notebook

junxiaosong / AlphaZero_Gomoku

Star

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

board-game reinforcement-learning tensorflow pytorch mcts gomoku rl monte-carlo-tree-search self-learning gobang alphago alphago-zero alphazero

Updated Apr 24, 2024
Python

werner-duvaud / muzero-general

Star

MuZero

machine-learning reinforcement-learning deep-learning neural-network deep-reinforcement-learning python3 pytorch gym mcts rl tensorboard residual-network monte-carlo-tree-search self-learning alphago model-based-rl alphazero muzero muzero-general

Updated Sep 3, 2024
Python

opendilab / LightZero

Star

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Updated Feb 28, 2025
Python

chauvinSimon / My_Bibliography_for_Research_on_Autonomous_Driving

Star

Personal notes about scientific and research works on "Decision-Making for Autonomous Driving"

Updated Dec 15, 2020

s-casci / tinyzero

Star

Easily train AlphaZero-like agents on any environment you want!

reinforcement-learning mcts alphazero

Updated Jan 11, 2024
Python

hrpan / tetris_mcts

Star

MCTS project for Tetris

game reinforcement-learning deep-learning tetris mcts tetris-bots

Updated Oct 9, 2024
Python

dylandjian / SuperGo

Star

A student implementation of Alpha Go Zero

machine-learning reinforcement-learning python3 pytorch mcts alphago alphago-zero

Updated Aug 1, 2018
Python

DataCanvasIO / Hypernets

Star

A General Automated Machine Learning framework to simplify the development of End-to-end AutoML toolkits in specific domains.

reinforcement-learning keras mcts hyperparameter-optimization evolutionary-algorithms nas monte-carlo-tree-search hyperparameter-tuning automl neural-architecture-search nasnet enas autodl

Updated Jul 19, 2024
Python

QueensGambit / CrazyAra

Star

A Deep Learning UCI-Chess Variant Engine written in C++ & Python 🦜

python open-source machine-learning chess-engine deep-learning mxnet artificial-intelligence mcts gluon lichess convolutional-neural-network alphago python-chess alphazero crazyhouse mcgs

Updated Feb 26, 2025
Jupyter Notebook

vgarciasc / mcts-viz

Star

Visualization of MCTS algorithm applied to Tic-tac-toe.

visualization mcts tictactoe p5js

Updated Aug 25, 2021
JavaScript

sungyubkim / Deep_RL_with_pytorch

Star

A pytorch tutorial for DRL(Deep Reinforcement Learning)

deep-reinforcement-learning pytorch dqn mcts uct c51 iqn hedge ppo a2c gail counterfactual-regret-minimization qr-dqn random-network-distillation soft-actor-critic self-imitation-learning

Updated Apr 24, 2023
Jupyter Notebook

initial-h / AlphaZero_Gomoku_MPI

Star

An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku

algorithm tensorflow parallel deep-reinforcement-learning mcts gomoku tree-search tensorlayer alphago mpi4py dirichlet-distribution alphazero alphazero-gomoku

Updated Jan 20, 2020
Python

thuxugang / doudizhu

Star

AI斗地主

reinforcement-learning ai card-game dqn mcts doudizhu

Updated Jun 13, 2018
Python

kaesve / muzero

Star

A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.

reinforcement-learning deep-learning tensorflow deep-reinforcement-learning tf2 mcts alphazero tensorflow2 muzero

Updated Mar 28, 2021
Jupyter Notebook

akolishchak / doom-net-pytorch

Star

Reinforcement learning models in ViZDoom environment

agent learning reinforcement-learning pytorch doom behavior-tree mcts vizdoom reinforcement ppo doomnet-track1

Updated Mar 9, 2022
Python

zjeffer / chess-deep-rl

Star

Research project: create a chess engine using Deep Reinforcement Learning

machine-learning chess-engine chess reinforcement-learning ai deep-learning neural-network deep-reinforcement-learning artificial-intelligence mcts neural-networks alphazero

Updated Jun 29, 2024
Jupyter Notebook

manyoso / allie

Star

Allie: A UCI compliant chess engine

chess-engine chess neural-network mcts deepmind alphabeta alphazero

Updated Apr 8, 2021
C++

PuYuuu / vehicle-interaction-decision-making

Star

The decision-making of multiple vehicles at intersection bases on level-k game and MCTS

mcts game-theory level-k

Updated Jan 24, 2025
C++

Improve this page

Add a description, image, and links to the mcts topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mcts topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mcts

Here are 402 public repositories matching this topic...

hijkzzz / Awesome-LLM-Strawberry

suragnair / alpha-zero-general

junxiaosong / AlphaZero_Gomoku

werner-duvaud / muzero-general

opendilab / LightZero

chauvinSimon / My_Bibliography_for_Research_on_Autonomous_Driving

s-casci / tinyzero

hrpan / tetris_mcts

dylandjian / SuperGo

DataCanvasIO / Hypernets

QueensGambit / CrazyAra

vgarciasc / mcts-viz

sungyubkim / Deep_RL_with_pytorch

initial-h / AlphaZero_Gomoku_MPI

thuxugang / doudizhu

kaesve / muzero

akolishchak / doom-net-pytorch

zjeffer / chess-deep-rl

manyoso / allie

PuYuuu / vehicle-interaction-decision-making

Improve this page

Add this topic to your repo