Releases: LucasAlegre/morl-baselines
Releases · LucasAlegre/morl-baselines
MORL-Baselines 1.0.0
This release marks the first stable version of MORL-Baselines. After having thoroughly tested the algorithms on various environments fixing bugs for the past few weeks. We feel the library is stable enough to deserve a proper release.
Features
- Over 10 MORL algorithms supported under the MO-Gymnasium API (multi & single policy, under SER and ESR criteria);
- Automated reporting to Weights and Biases dashboards... of various metrics (see screenshot below);
- Clean, documented, and tested code, and this is enforced by our CI hooks;
- Utility functions to help researchers build new algorithms, e.g.
ParetoArchive
,NatureCNN
,PrioritizedReplayBuffer
; - Performances have been tested and reported in a reproducible manner: see #43 and https://wandb.ai/openrlbenchmark/MORL-Baselines.
Example of our dashboards: Pareto front and multi-objective metrics are visible in real-time.
1.0.0-rc2 bugfixes and enhancements
What's Changed
- Change PQL to linearly decaying exploration by @ffelten in #48
- Refactor random seed by @LucasAlegre in #49
- Recover from solver error in OLS by @ffelten in #51
Full Changelog: 1.0.0-rc1...1.0.0-rc2
1.0.0-rc1 Stabilizing and performance assessment
First release candidate aiming at stabilizing and reporting the performances of the algorithms in the codebase. We aim to fix bugs as we encounter them when assessing performances and bumping RC numbers along the way. Once we have finished the performance assessments, we should be able to release 1.0.0.