Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[meta] Deep-RL work #14

Open
stites opened this issue Aug 1, 2017 · 0 comments
Open

[meta] Deep-RL work #14

stites opened this issue Aug 1, 2017 · 0 comments

Comments

@stites
Copy link
Contributor

stites commented Aug 1, 2017

This is going to be a meta branch to discuss any deep reinforcement learning implementations of algorithms to write, as well as any structural changes to the folder structure if people want to work on this and things break.

Currently, in deep-rl, there exists:

  • an unused reporting server in exe (ideally to kick off a haskell-variant of tensorboard)
  • the beginnings of some mypy type stubs for certain libraries (although mypy sometimes fails)
  • a3c (py)
  • ddpg (py)
  • dqn (py, in progress tensorflow-hs)
  • deep recurrent q-nets (py)
  • a model-based network (py)
  • a q-network (py, tensorflow-hs, backprop)
  • reinforce (py, in-progress tensorflow-hs)

In a different repository, there exists tensorflow-less versions of:

  • a2c
  • actor-critic
  • CEM
  • reinforce

So there are a few things to do here. One is to finish porting the python to haskell variants (tensorflow-haskell, and backprop -- optionally grenade, but I've had it blow up in my face already). The other is to implement more python algorithms, but these will be subject to certain constraints since the ultimate goal is to port to haskell. To hit parity with rllab*, we would still need:

  • Truncated Natural Policy Gradient
  • Reward-Weighted Regression
  • Relative Entropy Policy Search
  • Trust Region Policy Optimization
  • Covariance Matrix Adaption Evolution Strategy

* hitting parity with rllab would be nice since it would allow us to benchmark with results from _Yan Duan, Xi Chen, Rein Houthooft, John Schulman, Pieter Abbeel. "Benchmarking Deep Reinforcement Learning for Continuous Control". _

@stites stites changed the title [meta ticket] Deep-RL work [meta] Deep-RL work Aug 1, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant