Skip to content

Applied various Reinforcement Learning (RL) algorithms to determine the optimal policy for diverse Markov Decision Processes (MDPs) specified within the OpenAI Gym library

License

Notifications You must be signed in to change notification settings

Anshumaan-Chauhan02/RL-Project

Repository files navigation

Reinforcement Learning (RL) Project

Project Description

We employ the Actor-Critic, Reinforce with Baseline, and Episodic n-step SARSA algorithms to acquire an optimal policy for distinct Markov Decision Processes (MDPs), specifically, MountainCar-v0, Acrobot-v0, and CarPole-v1 from the OpenAI Gym library. Systematic experimentation has been conducted on both hyperparameters and model architecture, leading in the presentation of results for the most effective configuration.

Technical Skills

Python OpenAI Gym PyTorch Matplotlib Jupyter Notebook

Dependencies

OpenAI Gym
  !pip install gym
PyTorch (Check CPU/GPU Compatibility)
  https://pytorch.org/get-started/locally/
NumPy
  !pip install numpy
Matplotlib
  !pip install matplotlib

File Contents

  • Actor Critic Final.py
    • Contains the implementation of the Actor-Critic algorithm, a reinforcement learning technique combining policy (Actor) and value function (Critic) approximation to enhance learning efficiency.
  • REINFORCE with Baseline Final.py:
    • Encompasses the implementation of the REINFORCE algorithm with Baseline, a policy gradient method incorporating a baseline to reduce variance in gradient estimates.
  • Semi-Gradient-SARSA Final.py
    • Houses the implementation of the Semi-Gradient-SARSA algorithm, a temporal difference learning method applied in reinforcement learning scenarios for updating Q-values and optimizing policy.

About

Applied various Reinforcement Learning (RL) algorithms to determine the optimal policy for diverse Markov Decision Processes (MDPs) specified within the OpenAI Gym library

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages