Releases · LucasAlegre/morl-baselines

12 Jun 16:06

ffelten

1.0.0

867d3e9

Latest

This release marks the first stable version of MORL-Baselines. After having thoroughly tested the algorithms on various environments fixing bugs for the past few weeks. We feel the library is stable enough to deserve a proper release.

Features

Over 10 MORL algorithms supported under the MO-Gymnasium API (multi & single policy, under SER and ESR criteria);
Automated reporting to Weights and Biases dashboards... of various metrics (see screenshot below);
Clean, documented, and tested code, and this is enforced by our CI hooks;
Utility functions to help researchers build new algorithms, e.g. ParetoArchive, NatureCNN, PrioritizedReplayBuffer;
Performances have been tested and reported in a reproducible manner: see #43 and https://wandb.ai/openrlbenchmark/MORL-Baselines.

Example of our dashboards: Pareto front and multi-objective metrics are visible in real-time.

Assets 2

04 Apr 09:51

ffelten

1.0.0-rc2

50ffbc6

1.0.0-rc2 bugfixes and enhancements Pre-release

Pre-release

What's Changed

Change PQL to linearly decaying exploration by @ffelten in #48
Refactor random seed by @LucasAlegre in #49
Recover from solver error in OLS by @ffelten in #51

Full Changelog: 1.0.0-rc1...1.0.0-rc2

Contributors

ffelten and LucasAlegre

Assets 2

28 Mar 17:14

ffelten

1.0.0-rc1

345dd72

1.0.0-rc1 Stabilizing and performance assessment Pre-release

Pre-release

First release candidate aiming at stabilizing and reporting the performances of the algorithms in the codebase. We aim to fix bugs as we encounter them when assessing performances and bumping RC numbers along the way. Once we have finished the performance assessments, we should be able to release 1.0.0.

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Features

What's Changed

Contributors

Releases: LucasAlegre/morl-baselines

MORL-Baselines 1.0.0

Features

1.0.0-rc2 bugfixes and enhancements

What's Changed

Contributors

1.0.0-rc1 Stabilizing and performance assessment