Brax + PPO integration #313

vwxyzjn · 2022-11-06T19:56:24Z

Description

Test out integration with brax. It seems to work out of the box without having to implement observation normalization —
https://wandb.ai/costa-huang/cleanRL/runs/2aemjwey?workspace=user-costa-huang

Compilation takes ~400 seconds, and getting 6000 rewards in Ant takes about 100 seconds with GPU. In comparison, the official demo takes 30 seconds to compile and about 80 seconds to reach ~8000 rewards (using TPU I presume). Our compilation time takes significantly longer, most likely because we didn't use lax.scan or jax.foriloop, but once the compilation finished the SPS is about 600k.

CC @joaogui1

Types of changes

Bug fix
New feature
New algorithm
Documentation

Checklist:

I've read the CONTRIBUTION guide (required).
I have ensured pre-commit run --all-files passes (required).
I have updated the documentation and previewed the changes via mkdocs serve.
I have updated the tests accordingly (if applicable).

If you are adding new algorithms or your change could result in performance difference, you may need to (re-)run tracked experiments. See #137 as an example PR.

vercel · 2022-11-06T19:56:26Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Updated
cleanrl	✅ Ready (Inspect)	Visit Preview	Nov 6, 2022 at 9:28PM (UTC)

Surya-77 · 2024-03-13T13:47:48Z

Hi @vwxyzjn ,

I hope you're doing well. I was reviewing the PR for the ( Brax + PPO integration #313 ) and noticed that it's currently closed. I wanted to check in with you to see if there have been any difficulties in merging this change into the main repository. Additionally, is there an updated version of this integration available that addresses any issues or incorporates new changes? Looking forward to your response.

Best regards,
Surya

Brax + PPO integration

5d4c95d

update dependencies

8223596

vercel bot deployed to Preview November 6, 2022 21:28 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Brax + PPO integration #313

Brax + PPO integration #313

vwxyzjn commented Nov 6, 2022

vercel bot commented Nov 6, 2022 •

edited

Surya-77 commented Mar 13, 2024

Brax + PPO integration #313

Are you sure you want to change the base?

Brax + PPO integration #313

Conversation

vwxyzjn commented Nov 6, 2022

Description

Types of changes

Checklist:

vercel bot commented Nov 6, 2022 • edited

Surya-77 commented Mar 13, 2024

vercel bot commented Nov 6, 2022 •

edited