#

policy-gradient

Here are 410 public repositories matching this topic...

greydanus / rlzoo

A central location for my reinforcement learning experiments

reinforcement-learning minimal notebook tensorflow openai-gym policy-gradient pong-game

Updated Nov 2, 2016
Jupyter Notebook

keon / CodeGAN

[Deprecated] Source Code Generation using Sequence Generative Adversarial Networks

deep-learning paper recurrent-neural-networks policy-gradient rnn

Updated Jan 7, 2017
Python

fukuta0614 / chainer-SeqGAN

implementation of SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient

reinforcement-learning policy-gradient chainer-seqgan

Updated Feb 4, 2017
Jupyter Notebook

Fritz449 / Asynchronous-RL-agent

Try to implement strong RL-agent

machine-learning reinforcement-learning deep-learning deep-reinforcement-learning q-learning dqn policy-gradient a3c

Updated Mar 6, 2017
Python

emphasismemorandum / windows-itpro-docs

This is used for contributions to the Windows 10 TechNet content for IT professionals.

legacy political-science policy-gradient articles implementation golden-master legal-writing

Updated Mar 7, 2017
PowerShell

vkramanuj / atari-rl

Reinforcement learning algs for Open AI gym games.

reinforcement-learning tensorflow policy-gradient

Updated Mar 12, 2017
Python

BangaloreSharks / SharkStock

Automate swing trading using deep reinforcement learning. The deep deterministic policy gradient-based neural network model trains to choose an action to sell, buy, or hold the stocks to maximize the gain in asset value. The paper also acknowledges the need for a system that predicts the trend in stock value to work along with the reinforcement …

trading-bot deep-reinforcement-learning recurrent-neural-networks stock-market policy-gradient convolutional-neural-networks sentiment-classification

Updated Apr 27, 2017
Python

mbednarski / Chiron

Reinforcement learning solutions

machine-learning reinforcement-learning qlearning openai-gym gym policy-gradient reinforcement-learning-algorithms sarsa

Updated Apr 29, 2017
Python

yashbhutwala / pong-ai

Deep Q-Learning Networks vs. Policy Gradient Learning in OpenAI Gym's Pong Environment

python tensorflow numpy pong openai-gym policy-gradient deep-q-learning

Updated May 2, 2017
Python

angusfung / cartpole-AI-RL-policy-gradient

Implementation of OpenAI's Cart Pole and adjusted to incorporate reinforcement learning with policy gradients.

reinforcement-learning ai openai-gym policy-gradient cartpole

Updated May 6, 2017
Python

hartikainen / easy21

Reinforcement learning agents and environment for Easy21, a modified version of Blackjack

reinforcement-learning monte-carlo policy-gradient sarsa easy21

Updated May 7, 2017
Python

bhanuvikasr / Deep-RL-TORCS

Autonomous Navigation using Deep Reinforcement Learning

deep-reinforcement-learning policy-gradient deep-q-network actor-critic

Updated May 23, 2017
Python

TNieuwdorp / Thesis

My thesis work on exploring the performance impact of the exploration strategy

python machine-learning tensorflow openai-gym policy-gradient reinforce

Updated May 29, 2017
Python

nailo2c / pycon-2017-tutorial-rl

Example code of Logistic Regression, MLP, Policy Gradient for 2017 PyCon

neural-network tensorflow taiwan keras policy-gradient logistic-regression pycon2017

Updated May 31, 2017
Jupyter Notebook

Scitator / rl-course-experiments

reinforcement-learning deep-learning neural-network tensorflow monte-carlo genetic-algorithm deep-reinforcement-learning policy-gradient deep-q-network asynchronous-advantage-actor-critic temporal-differencing-learning

Updated Jun 7, 2017
Jupyter Notebook

xwhan / dialogue_generation

emotional dialogue genertion

tensorflow policy-gradient seq2seq-chatbot dialogue-systems

Updated Jun 13, 2017
Python

falcondai / mnist

deep models for small image classification datasets

tensorflow mnist supervised-learning policy-gradient convolutional-neural-networks gumbel-softmax

Updated Jun 21, 2017
Python

aravindreddypv / Pong-policy

reinforcement-learning pong openai-gym policy-gradient

Updated Jul 13, 2017
Python

KokoMind / A3C-TF

Implementing Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning". using TensorFlow

reinforcement-learning policy-gradient a3c

Updated Aug 7, 2017
Python

rameshjes / RobotLearning

reinforcement-learning policy-gradient dynamic-programming markov-decision-processes bandits sarsa-lambda

Updated Aug 16, 2017
Jupyter Notebook

Improve this page

Add a description, image, and links to the policy-gradient topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the policy-gradient topic, visit your repo's landing page and select "manage topics."