Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add SPR implementation to atari_100k lab #184

Open
wants to merge 3 commits into
base: master
Choose a base branch
from

Conversation

MaxASchwarzer
Copy link

I'd like to add support for SPR to the Atari 100k lab project. My implementation is mostly siloed to minimize changes to the existing algorithms. SPR needs a custom model-based style replay buffer that returns subtrajectories, and a version of noisy nets that allows noise to be toggled on and off within a function, both of which are included.

This implementation performs a bit better than the original version; I found median 0.45 over 100 seeds, compared to 0.395 for the original.

@psc-g @agarwl

@google-cla
Copy link

google-cla bot commented Sep 24, 2021

Thanks for your pull request. It looks like this may be your first contribution to a Google open source project (if not, look below for help). Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

📝 Please visit https://cla.developers.google.com/ to sign.

Once you've signed (or fixed any issues), please reply here with @googlebot I signed it! and we'll verify it.


What to do if you already signed the CLA

Individual signers
Corporate signers

ℹ️ Googlers: Go here for more info.

@google-cla google-cla bot added the cla: no label Sep 24, 2021
@MaxASchwarzer
Copy link
Author

@googlebot I signed it!

@google-cla google-cla bot added cla: yes CLA has been signed. and removed cla: no labels Sep 29, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cla: yes CLA has been signed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant