proximal-policy-optimization

Star

Here are 190 public repositories matching this topic...

mohith-sakthivel / sufficient-ppo

Star

Clean and flexible implementation of PPO (built on top of stable-baselines3)

reinforcement-learning pytorch proximal-policy-optimization ppo openai-baselines ppo2 stable-baselines3

Updated Jul 9, 2021
Python

escribano89 / unir_tfm_reinforcement_learning

Star

Repositorio para el contenido relativo al trabajo de fin de máster desarrollado en el Máster de Inteligencia Artificial de la Universidad Internacional de La Rioja (UNIR).

machine-learning reinforcement-learning deep-learning robotics deep-reinforcement-learning artificial-intelligence policy-gradient proximal-policy-optimization td3 reinforcement-learning-environments ppo-pytorch

Updated Oct 18, 2022
Jupyter Notebook

sunoh-kim / deep-reinforcement-learning

Star

This repository contains my assignment solutions for the Deep Reinforcement Learning course (430.729_003) offered by Seoul National University (Spring 2020).

deep-reinforcement-learning imitation-learning deep-q-learning deep-deterministic-policy-gradient proximal-policy-optimization

Updated Apr 10, 2022
Jupyter Notebook

Ezgii / PPO-on-pendulum

Star

Training a PPO to balance a pendulum in a fully observable environment.

reinforcement-learning openai-gym pytorch pendulum proximal-policy-optimization ppo

Updated May 31, 2023
Python

alexsasu / Flappy-Bird-Agent

Star

Reinforcement learning agent for playing Flappy Bird, as part of a university project

reinforcement-learning flappy-bird proximal-policy-optimization

Updated Feb 25, 2024
C#

blahBlahhhJ / ProjectProcgen

Star

A pytorch project to easily run experiments on OpenAI's Procgen Benchmark

reinforcement-learning pytorch proximal-policy-optimization

Updated May 20, 2021
Python

ruchitapaithankar15 / marioAI-Gaming-Reinforcement-Learning

Star

Built and trained a model using OpenAI gym, NES emulator to play Super Mario. Optimized the model using preprocessing techniques and vectorization. The algorithm used is PPO (Proximal Policy Optimal) along with Reinforcement Learning.

python reinforcement-learning ai openai-gym nes-emulator openai proximal-policy-optimization

Updated Mar 21, 2023
Jupyter Notebook

pmistry9597 / Reinforcement-Learning-Algo-Demo

Star

A demonstration of some prominent reinforcement learning algorithms

reinforcement-learning openai-gym policy-gradient deep-q-network proximal-policy-optimization

Updated Mar 28, 2023
Python

ays-dev / lunarlander-pytorch

Star

Single file implementation of Deep Reinforcement Learning algorithm (PPO) based on LunarLander-v2 environment

python machine-learning deep-neural-networks reinforcement-learning deep-learning torch python3 pytorch gym proximal-policy-optimization ppo lunar-lander

Updated Jul 13, 2023
Python

nslyubaykin / relax_ppo_example

Star

Example PPO implementation with ReLAx

reinforcement-learning gae policy-gradient reinforcement-learning-algorithms continuous-control proximal-policy-optimization ppo generalized-advantage-estimation discrete-control

Updated Aug 29, 2022
Jupyter Notebook

GenerativeAIAffiliates / AskAboutSymptomsGPT

Star

Ask About Symptoms is an LLM that has an in-depth understanding of health. The creator of the original version known as DoctorGPT, Siraj Raval, says it works offline, it's cross-platform, & the health data is said to be kept private. We are learning how to build this in our community.

python ios cmake deep-learning cplusplus compiler xcode conda pip tensor source-code quantization submodule fine-tuning proximal-policy-optimization tvm hugging-face llama2

Updated Aug 12, 2023
Jupyter Notebook

arthur-x / SimplyPPO

Star

SimplyPPO replicates Proximal-Policy-Optimization with minimum (~250) lines of code in clean, readable PyTorch style, while trying to use as few additional tricks and hyper-parameters as possible (PyBullet benchmarks included).

proximal-policy-optimization pybullet-benchmarks

Updated Apr 19, 2023
Python

GiorgiaAuroraAdorni / CAT-optimal-hybrid-solver

Star

The CAT Optimal Hybrid Solver is a tool designed to tackle the cross array task (CAT) activity designed to assess algorithmic thinking skills in the context of K-12 education.

reinforcement-learning clustering problem-solving depth-first-search random-search computational-thinking proximal-policy-optimization hybrid-approach

Updated Oct 17, 2023
Python

Raiszo / ppo-4Quadrotor

Star

PPO implementation for the cable suspended load quadrotor

reinforcement-learning quadcopter tensorflow proximal-policy-optimization cable-suspended-load

Updated Jan 9, 2020
Python

1jsingh / rl_reacher

Star

Train double-jointed arms to reach target locations using Proximal Policy Optimization (PPO) in Pytorch

pytorch ddpg proximal-policy-optimization ppo unity-environment reacher-environment

Updated May 3, 2019
Jupyter Notebook

MichaelFish199 / SonicTheHedgehog2-ReinforcmentLearning

Star

This project implements an agent for playing the SonicTheHedgehog2 game from a ROM file using the Proximal Policy Optimization (PPO) algorithm from the stablebaselines3 library. The agent is trained to learn the optimal actions to take at each step in the game in order to complete the level and maximize the score.

game reinforcement-learning sonic-the-hedgehog proximal-policy-optimization rom-files stable-baselines3 rewards-and-scoring game-playing-agents