[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
-
Updated
Jun 5, 2024 - Python
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
MiniZero: An AlphaZero and MuZero Training Framework
datasets for computer go
An implementation of the MuZero algorithm by Google Deepmind. Research paper here: https://arxiv.org/abs/1911.08265
A C++ pytorch implementation of MuZero
Trains a deep reinforcement learning agent in simulation testbed environments with the DRLA library.
Trains deep reinforcement learning agents in Atari environments via the DRLA library.
C++ Deep Reinforcement Learning Agent library
Deep Q Learning blackbox strategies for casino games
Muesli RL algorithm implementation (PyTorch) (LunarLander-v2)
A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.
A Notebook implementation of the Pseudocode from the original Muzero paper
Applying AlphaZero Self-Play Tactics to LLaMA for Enhanced Chatbot Interaction
GenesisZERO : potential applications for MCTS agents with LLMs for Sequential decision-making
A PyTorch implementation of DeepMind's MuZero agent
Generalized AI to perform a multitude of tasks written in python3
Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and observation spaces, including both discrete and continuous variations.
Add a description, image, and links to the muzero topic page so that developers can more easily learn about it.
To associate your repository with the muzero topic, visit your repo's landing page and select "manage topics."