Overview

This repository contains the projects that I completed as part of the Deep Reinforcement Learning course offered by HuggingFace. The course covered various aspects of deep reinforcement learning, including Q-learning, policy gradients, actor-critic methods, and deep deterministic policy gradients (DDPG).

The projects in this repository demonstrate the application of these techniques to different environments. Each project includes README.md file that explains the project detailes.

Projects

Vizdoom Health Gathering Supreme: This project uses a Proximal Policy Optimization (PPO) algorithm to train an agent to gather health in the Vizdoom environment.
Reinforce Pixelcopter PLE v0: This project uses a policy gradient algorithm to train an agent to fly a pixelcopter in the PLE environment.
A2C PandaReachDense-v2: This project uses an Advantage Actor-Critic (A2C) algorithm to train an agent to reach a goal with a panda arm in the MuJoCo environment.
DQN SpaceInvadersNoFrameskip-v4: This project uses a Deep Q-Network (DQN) algorithm to train an agent to play Space Invaders in the Atari environment.
Q-Taxi-v3: This project uses a Q-learning algorithm to train an agent to navigate a taxi in the OpenAI Gym environment.
A2C AntBulletEnv-v0: This project uses an A2C algorithm to train an agent to control an ant in the Bullet environment.
PPO Pyramids: This project uses a PPO algorithm to train an agent to navigate a pyramid in the Gym environment.
PPO SnowballTarget: This project uses a PPO algorithm to train an agent to throw snowballs at a target in the Gym environment.
CartPole-v1: This project uses a Q-learning algorithm to train an agent to balance a pole on a moving cart.
Q-FrozenLake-v1-4x4-noSlippery: This project uses a Q-learning algorithm to train an agent to navigate a frozen lake.
PPO Huggy: This project uses a PPO algorithm to train an agent to hug a teddy bear in the Gym environment.
PPO LunarLander-v2: This project uses a PPO algorithm to train an agent to land a lunar lander safely.

Requirements

Python 3
NumPy
PyTorch

You can find the original repos here: https://huggingface.co/arhamk

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
CartPole-v1		CartPole-v1
Reinforce-Pixelcopter-PLE-v0		Reinforce-Pixelcopter-PLE-v0
a2c-AntBulletEnv-v0		a2c-AntBulletEnv-v0
a2c-PandaReachDense-v2		a2c-PandaReachDense-v2
dqn-SpaceInvadersNoFrameskip-v4		dqn-SpaceInvadersNoFrameskip-v4
ppo-Huggy		ppo-Huggy
ppo-LunarLander-v2		ppo-LunarLander-v2
ppo-Pyramids		ppo-Pyramids
ppo-SnowballTarget		ppo-SnowballTarget
q-FrozenLake-v1-4x4-noSlippery		q-FrozenLake-v1-4x4-noSlippery
q-Taxi-v3		q-Taxi-v3
vizdoom_health_gathering_supreme		vizdoom_health_gathering_supreme
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CartPole-v1

CartPole-v1

Reinforce-Pixelcopter-PLE-v0

Reinforce-Pixelcopter-PLE-v0

a2c-AntBulletEnv-v0

a2c-AntBulletEnv-v0

a2c-PandaReachDense-v2

a2c-PandaReachDense-v2

dqn-SpaceInvadersNoFrameskip-v4

dqn-SpaceInvadersNoFrameskip-v4

ppo-Huggy

ppo-Huggy

ppo-LunarLander-v2

ppo-LunarLander-v2

ppo-Pyramids

ppo-Pyramids

ppo-SnowballTarget

ppo-SnowballTarget

q-FrozenLake-v1-4x4-noSlippery

q-FrozenLake-v1-4x4-noSlippery

q-Taxi-v3

q-Taxi-v3

vizdoom_health_gathering_supreme

vizdoom_health_gathering_supreme

LICENSE

LICENSE

README.md

README.md

Repository files navigation

Overview

Projects

Requirements

About

Releases

Packages

License

arham-kk/HF-RL-Course

Folders and files

Latest commit

History

Repository files navigation

Overview

Projects

Requirements

About

Topics

Resources

License

Stars

Watchers

Forks