Skip to content

Releases: HumanCompatibleAI/imitation

v1.0.0 -- first stable release

31 Oct 18:48
de589d4
Compare
Choose a tag to compare

We're pleased to announce the first stable release of imitation. Key improvements include:

  • Gymnasium compatibility, which has superceded Gym
  • Tuned hyperparameters and benchmark results for common algorithm-environment pairs (see release artifact attached).
  • New algorithm (beta): SQIL
    For more information, see the changelog below.

What's Changed

New Contributors

Full Changelog: v0.4.0...v1.0.0

v0.4.0

17 Jul 23:05
2743c28
Compare
Choose a tag to compare

What's Changed

  • Continuous Integration: Add support for Mac OS; remove dependency on MuJoCo
  • Preference comparison: improved logging, support for active learning based on variance of ensemble.
  • HuggingFace integration for model and dataset loading.
  • Benchmarking: add results and example configs.
  • Documentation: add notebook tutorials; other general improvements.
  • General changes: migrate to pathlib; add more type hints to enable mypy as well as pytype.

Full Changelog: v0.3.1...v0.4.0

v0.3.1

29 Jul 00:58
b71cedd
Compare
Choose a tag to compare

What's Changed

Main changes:

Minor changes:

Full Changelog: v0.3.0...v0.3.1

Major improvements

26 Jul 21:07
242b15b
Compare
Choose a tag to compare

New features:

Improvements:

  • Port MCE IRL from JAX to Torch, eliminating the JAX dependency. (thanks to @qxcv)
  • Refactor RewardNet code to be independent from AIRL, and shared across algorithms. (thanks to @ejnnr)
  • Add Windows support including continuous integration. (thanks to @taufeeque9)

First PyTorch release

23 Oct 23:07
e7e87b2
Compare
Choose a tag to compare
compute_train_stats: Fix logits passed in as proba (#273)

Led to an error when I was training.

Final TF1 release

01 Sep 01:39
Compare
Choose a tag to compare
v0.1.1

Final TF1 release

Initial release

09 May 19:46
cf51186
Compare
Choose a tag to compare

Prototype versions of AIRL, GAIL, BC, DAGGER.