Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
-
Updated
Mar 31, 2024 - Python
Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are still in progress)
PyTorch C++ Reinforcement Learning
A PyTorch library for building deep reinforcement learning agents.
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Trading Environment(OpenAI Gym) + PPO(TensorForce)
Deep Reinforcement Learning (PPO) in Autonomous Driving (Carla) [from scratch]
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Curiosity-driven Exploration by Self-supervised Prediction
Proximal Policy Optimization (PPO) algorithm for Contra
Implementation of Generatve Adversarial Imitation Learning (GAIL) for classic environments from OpenAI Gym.
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
Code for the paper "Reinforced Curriculum Learning for Autonomous Driving in CARLA" (ICIP 2021)
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Add a description, image, and links to the proximal-policy-optimization topic page so that developers can more easily learn about it.
To associate your repository with the proximal-policy-optimization topic, visit your repo's landing page and select "manage topics."