#

reinforce

Here are 103 public repositories matching this topic...

MehranTaghian / policy-gradient-methods

Implementation of some of the policy gradient methods in PyTorch.

pytorch policy-gradient reinforce actor-critic ppo online-supervised-learning gradient-bandit batch-reinforce

Updated Jul 27, 2022
Python

dmsovetov / reinforcement

Deep reinforcement learning experiments

reinforcement-learning qlearning pytorch deeplearning a3c reinforce a2c

Updated Mar 11, 2024
Python

yueying-teng / pong_with_policy_gradients

reinforcement-learning pong pytorch gym policy-gradient reinforce

Updated Aug 30, 2023
Python

vigneshramk / A2C-Reinforce-Behavior-Cloning

reinforcement-learning-algorithms reinforce imitation-learning pytorch-rl a2c

Updated Jul 5, 2018
Python

Bharath2 / SimpleRL

Simple yet efficient implementations of Model Free Reinforcement Learning algorithms in Pytorch

reinforcement-learning dqn rl reinforce ddpg sac ppo

Updated Nov 15, 2021
Python

CS486-RL-Poker-Agent / bismuth

An RL agent using policy gradient to learn no-limit Texas hold'em.

reinforcement-learning poker policy-gradient reinforce

Updated Mar 11, 2024
Python

huiwenzhang / rl-benchmark

simple and compact implementations of reinforcement learning benchmark algorithms

dqn reinforce actor-critic ppo

Updated Jun 9, 2018
Python

siddk / rl-kitchen-sink

PyTorch Implementations of Standard Deep RL Algorithms (including REINFORCE, A2C, PPO)

reinforcement-learning pytorch reinforcement-learning-algorithms reinforce pytorch-rl ppo a2c

Updated Sep 11, 2018
Python

smbyun0214 / RL-Testbed

reinforcement-learning reinforce ddpg trpo ppo pendulum-v0

Updated Aug 8, 2021
Python

akashkmr27089 / Behavior_Cloning

Behaviour Cloning On OpenAI Environment

python reinforcement-learning openai-gym policy-gradient reinforce lunar-lander lunarlander-v2 behaviour-clonning daggr

Updated Jun 29, 2020
Jupyter Notebook

mew-two-github / CS6700-Project

Implementation of REINFORCE for open ai env acrobot, epsilon greedy Q-Learning for open ai env taxi & TD(0) for custom gameshow env KBC.

reinforcement-learning q-learning policy-gradient reinforcement-learning-algorithms reinforce temporal-differencing-learning reinforcement-learning-agent open-ai-gym reinforcement-learning-environments

Updated Dec 1, 2021
Python

tairtahar / Reinforce

Reinforce is a gradient-based Reinforcement Learning algorithm used for policy learning. It can be applied to both continuous and discrete environments.

python reinforcement-learning pytorch reinforce gradient-policy

Updated Nov 6, 2021
Jupyter Notebook

Anjali001 / Reinforcement-Learning

reinforcement-learning policy-gradient reinforce greedy-algorithm td-learning sarsa-learning td-lambda exploration-exploitation epsilon-greedy-exploration ucb-algorithm

Updated May 16, 2022
Jupyter Notebook

dknathalage / deep-rl

My implementations of popular reinforcement learning methods based on other developers and research papers.

pytorch a3c reinforce proximal-policy-optimization a2c pytorch-implementation reinfrocement-learning

Updated Aug 22, 2020
Python

Directorman9 / Gym-minigrid-games

This notebook trains an agent to navigate a maze and reach a desired destination. It uses the Gym-MiniGrid's fourRoom-v0 environment as the maze. The agent is trained by using reiforcement learning's vanilla policy gradient (REINFORCE) algorithm.

reinforcement-learning gym minigrid reinforce vanilla-policy-gradient

Updated May 1, 2022

dhananjaisharma10 / Policy-Gradient-Methods

Reinforcement Learning: Policy Gradient Methods

policy-gradient reinforce actor-critic

Updated Jan 30, 2020
Python

thebrownfrog / snake-first-RL-project

Bot averaging >60 apples on a 10x10 map with 2 apples on the map at a time

reinforcement-learning reinforce reinforce-algorithm

Updated May 29, 2024
Python

jbecke / Open-AI-Gym-ABCs

Simple, well-commented Pytorch implementations of REINFORCE and Actor Critic RL methods.

reinforcement-learning pytorch reinforce actor-critic pytorch-tutorial

Updated Jul 4, 2018
Python

gianluca-maselli / REINFORCE

deep-reinforcement-learning pytorch reinforce

Updated Apr 4, 2023
Python

Twice22 / Reinforcement-Learning

My reports for the reinforcement learning class given at the ENS

reinforcement-learning policy-gradient reinforce policy-iteration value-iteration ucb1

Updated Jan 16, 2018
Jupyter Notebook

Improve this page

Add a description, image, and links to the reinforce topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the reinforce topic, visit your repo's landing page and select "manage topics."