A robust variant of MuZero
-
Updated
Mar 22, 2021 - Python
A robust variant of MuZero
Materials for AlphaGo
Deep Q Learning blackbox strategies for casino games
Applying AlphaZero Self-Play Tactics to LLaMA for Enhanced Chatbot Interaction
An implementation of the MuZero algorithm by Google Deepmind. Research paper here: https://arxiv.org/abs/1911.08265
Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023
A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.
Trains deep reinforcement learning agents in Atari environments via the DRLA library.
A Notebook implementation of the Pseudocode from the original Muzero paper
muzero Algorithm Reinforcement Learning for Chinese XiangQi
Simple Muesli RL algorithm implementation (PyTorch)
Trains a deep reinforcement learning agent in simulation testbed environments with the DRLA library.
GenesisZERO : potential applications for MCTS agents with LLMs for Sequential decision-making
Meta-learning experiments for the game of minichess and related rule variants.
Add a description, image, and links to the muzero topic page so that developers can more easily learn about it.
To associate your repository with the muzero topic, visit your repo's landing page and select "manage topics."