muzero

Star

Here are 40 public repositories matching this topic...

hayashimasa / Robust_MuZero

Star

A robust variant of MuZero

deep-reinforcement-learning pytorch robust-control muzero

Updated Mar 22, 2021
Python

BIGBALLON / Toward-AGZ

Star

Materials for AlphaGo

deep-learning deep artificial-intelligence alphago alphago-zero muzero

Updated Mar 10, 2022

rystrauss / dopamax

Star

Reinforcement learning in pure JAX.

reinforcement-learning dqn mcts ppo podracer alphazero jax muzero brax anakin dopamax

Updated Dec 30, 2023
Python

souvikshanku / tic-tac-toe-zero

Star

MuZero - tic-tac-toe

tic-tac-toe muzero

Updated Jun 2, 2024
Python

AntoniovanDijck / BlackJackRL

Star

Deep Q Learning blackbox strategies for casino games

machine-learning deep-neural-networks reinforcement-learning deep-learning tensorflow blackjack deep-reinforcement-learning torch reinforcement-learning-algorithms deep-q-network mlx rlx q-learning-algorithm muzero

Updated Mar 22, 2024
Jupyter Notebook

ChukwumaChukwuma / enyimba_ai

Star

Applying AlphaZero Self-Play Tactics to LLaMA for Enhanced Chatbot Interaction

machine-learning natural-language-processing reinforcement-learning ai chatbot artificial-intelligence strategy policy-evaluation alphazero muzero prompt-engineering llms generative-ai rlhf llama2

Updated Jan 5, 2024
Python

CogitoNTNU / MuZero

Star

An implementation of the MuZero algorithm by Google Deepmind. Research paper here: https://arxiv.org/abs/1911.08265

muzero

Updated May 8, 2024
Python

seawee1 / efficientalphazero

Star

AlphaZero for singleplayer environments implemented efficiently using Ray

mcts ray alphago alphazero muzero

Updated Apr 4, 2023
Python

fpga-tom / pyzero

Star

muzero

Updated Jan 6, 2020
C++

sail-sg / rosmo

Star

Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023

reinforcement-learning atari arcade-learning-environment model-based-rl jax model-based-reinforcement-learning bsuite muzero dm-haiku offline-rl offline-reinforcement-learning rl-unplugged muzero-unplugged

Updated Jul 18, 2023
Python

jianzhnie / RLZero

Star

A clean and easy implementation of MuZero, AlphaZero and Self-Play reinforcement learning algorithms for any game.

reinforcement-learning multi-agent mcts alpha-zero self-play muzero

Updated Mar 11, 2024
Python

benborder / drla-atari

Star

Trains deep reinforcement learning agents in Atari environments via the DRLA library.

reinforcement-learning cpp deep-reinforcement-learning pytorch atari ppo dreamer libtorch muzero

Updated Apr 27, 2024
C++

abrahamabel / Muzero-GDM_Pseudo_Code

Star

A Notebook implementation of the Pseudocode from the original Muzero paper

python jupyter-notebook mcts muzero muzero-pseudocode

Updated Jan 14, 2024
Jupyter Notebook

liudengfeng / mrlxq

Star

muzero Algorithm Reinforcement Learning for Chinese XiangQi

reinforcement-learning ray rllib muzero xiqngqi

Updated May 1, 2023
Python

svenssona / muzero

Star

Learning how muzero works

reinforcement-learning muzero

Updated Sep 4, 2021
Jupyter Notebook

Itomigna2 / Muesli-cartpole

Star

Simple Muesli RL algorithm implementation (PyTorch)

reinforcement-learning deep-learning colab muesli model-based-rl cartpole-v1 muzero

Updated Jan 24, 2023
Jupyter Notebook

benborder / drla-sim

Star

Trains a deep reinforcement learning agent in simulation testbed environments with the DRLA library.

reinforcement-learning cpp deep-reinforcement-learning pytorch cartpole connect4 ppo dreamer libtorch muzero

Updated Apr 27, 2024
C++

abrahamabel / GenesisZero

Star

GenesisZERO : potential applications for MCTS agents with LLMs for Sequential decision-making

reinforcement-learning deep-reinforcement-learning gym reinforcement-learning-algorithms monte-carlo-tree-search gym-environment reinforcement-learning-agent alphazero mcts-algorithm muzero large-language-models llm llms stochastic-muzero muzero-stochastic llm-agent

Updated Dec 16, 2023

Atze00 / muzero-cartpole

Star

reinforcement-learning cartpole replay muzero

Updated Dec 28, 2020
Python

mdhiebert / meta-minichess

Star

Meta-learning experiments for the game of minichess and related rule variants.

meta-learning alphazero muzero minichess meta-minichess gym-minichess

Updated Oct 9, 2021
Python

Improve this page

Add a description, image, and links to the muzero topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the muzero topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

muzero

Here are 40 public repositories matching this topic...

hayashimasa / Robust_MuZero

BIGBALLON / Toward-AGZ

rystrauss / dopamax

souvikshanku / tic-tac-toe-zero

AntoniovanDijck / BlackJackRL

ChukwumaChukwuma / enyimba_ai

CogitoNTNU / MuZero

seawee1 / efficientalphazero

fpga-tom / pyzero

sail-sg / rosmo

jianzhnie / RLZero

benborder / drla-atari

abrahamabel / Muzero-GDM_Pseudo_Code

liudengfeng / mrlxq

svenssona / muzero

Itomigna2 / Muesli-cartpole

benborder / drla-sim

abrahamabel / GenesisZero

Atze00 / muzero-cartpole

mdhiebert / meta-minichess

Improve this page

Add this topic to your repo