A central location for my reinforcement learning experiments
-
Updated
Nov 2, 2016 - Jupyter Notebook
A central location for my reinforcement learning experiments
[Deprecated] Source Code Generation using Sequence Generative Adversarial Networks
implementation of SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient
Try to implement strong RL-agent
This is used for contributions to the Windows 10 TechNet content for IT professionals.
Reinforcement learning algs for Open AI gym games.
Automate swing trading using deep reinforcement learning. The deep deterministic policy gradient-based neural network model trains to choose an action to sell, buy, or hold the stocks to maximize the gain in asset value. The paper also acknowledges the need for a system that predicts the trend in stock value to work along with the reinforcement …
Reinforcement learning solutions
Deep Q-Learning Networks vs. Policy Gradient Learning in OpenAI Gym's Pong Environment
Implementation of OpenAI's Cart Pole and adjusted to incorporate reinforcement learning with policy gradients.
Reinforcement learning agents and environment for Easy21, a modified version of Blackjack
Autonomous Navigation using Deep Reinforcement Learning
My thesis work on exploring the performance impact of the exploration strategy
Example code of Logistic Regression, MLP, Policy Gradient for 2017 PyCon
emotional dialogue genertion
deep models for small image classification datasets
Implementing Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning". using TensorFlow
Add a description, image, and links to the policy-gradient topic page so that developers can more easily learn about it.
To associate your repository with the policy-gradient topic, visit your repo's landing page and select "manage topics."