Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* Prevent more than one call to wandb.login Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix bug: `decay_factor` hparam in avalanche/ewc.py Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Use wandb when wandb.project is enabled Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * wip: Adding task-inference to Avalanche models Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Use task-inference mechanism only at test-time Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Improved is_classic_control_env and is_atari_env Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Add is_monsterkong_env to utils.py Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Add passing env per task for IncrementalRL Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * [WIP]: Adding modified MuJoCO Envs Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * modified_size xml tweaking is better Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Finish adding ModifiedSizeEnv (not on-the-fly yet) Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * [wip] Fixing bugs in size scaling for hopper Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * [wip] Writing an algo for updating the sizes Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Wrapping up work on the mujoco envs Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Rework the task creation in RL Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Upgrade metaworld version Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Minor tweak in formatting Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix import bugs with mujoco envs Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Add ContinualHalfCheetah-v0 to ContinualRLSetting Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Debugging docker setup Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Remove 'observe_state_directly' field Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Add deprecation warning for observe_state_directly Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix tests for the current RL side of the tree Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix uncaught error when trying to import mujoco Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * [cluster debugging] fix bug in envs/__init__.py Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix issues with Avalanche logging on Cluster Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * [cluster debugging] fix stupid typo Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Added missing tasks.py file in incremental rl Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * [The Great Renaming] Flatten and rename modules Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * [The Great Renaming] SL setting tests pass Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix mujoco dependency during tests Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * [The Great Renaming] Add ContinualSLSetting Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Upgrade the 'smooth shuffling' code and tests Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Prevent iterating on closed ContinualSLEnvironment Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix bugs with BaselineMethod in continual_sl Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fixed more failing tests, task-incremental bugs Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix formatting issue in assumption file Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix all tests on SL side Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix imports in Avalanche methods Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Add tests for Continuous/Discrete SL to Avalanche Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Add missing RL settings, fix tests for SB3 Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix mujoco import bug and all Avalanche tests Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Add entry for mjkey.txt to gitignore Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Skip SAC tests unless --slow is passed Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * [docker debugging] Fix import error Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * [docker debugging] Adding mujoco-py as extras-req. Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Save results to a dir and upload to wandb Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Add presets for the RL sweep Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Set video_callable to True if using wandb Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix bugs with benchmark schedules, gym Monitor Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix sweeps crashing after first run Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * [Cluster debugging] Adding util scripts for eai Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Make all Avalanche methods target ContinualSL Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * [Docker debugging] Removing some scripts Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Monitor the training performance by default Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix hpo_sweep not correctly using wandb Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix task labels not given at test time in T-IL Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix bugs with PNN Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix wandb bug with saving the results to a dir Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Ignore exceptions when trying to save results Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix bug with PNN in Cifar100 with 10 tasks Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Avoid bug with Avalanche's default_logger Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Rebuild docker container before sweep Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Before RL Sweep Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix bug in ExperienceReplay, some SB3 Methods Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix sl_sweep.sh Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Add the 'objective_name' property Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Added the InteractiveLogger to Avalanche methods Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fixing bug with HOME directory in eai jobs Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix bug in ContinualRLSetting with half_cheetah Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * [WIP] Rework ContinualRL to be based on Continual Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * [wip] Fix bugs in sequoia/common Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Rework of ContinualRLSetting Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * [temp] starting to fix Discrete RL Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Adding pixel variants for classic-control envs Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Improving tests, Fixing Discrete RL Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * [wip] Fixing Discrete results Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix all tests for DiscreteTaskAgnosticRLSetting Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix tests for IncrementalRLSetting Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * [wip] refactoring of the 'RL' settings Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix the children() bug in readme.py Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix mapping from mujoco env names to env specs Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * TraditionalRLSetting under IncremementalRLSetting Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Ugly commit: All tests in RL 'pass' Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Update README table and puml diagrams Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix tests for SL Settings Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix error in tests for EWC Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Add TypedDictSpace, to replace NamedTupleSpace Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix BaseMethod bug w/ max_epochs >= 1 in Continual Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Replace NamedTupleSpace -> TypedDictSpace Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix bugs with obs_space[0] -> obs_space["x"] Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Import the make_task functions in settings.rl Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Improving the tests for SettingProxy Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Removing the /tests folder Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix formatting and imports in examples Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix pytest.ini config, test everything by default Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Support dicts with extra keys in TypedDictSpace Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Reduce debug logging verbosity in RL Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix little bugs left in the RL settings Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix issues with the EpisodeLimit wrapper Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Add RoundRobin, Concat and RandomMulti wrappers Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Adding MultiEnv wrappers Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix issues with IterableWrapper Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix bug in MeasureSLPerformanceWrapper Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * [temp] MeasureRlPerformance wrapper / EnvDataset Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix bug in ConvertToFromTensors wrapper Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix pretty much every tests in RL Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix bugs with MeasureRLPerformanceWrapper Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Add on_task_switch_callback to ConcatEnvsWrapper Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Add test for equality in TypedDict test Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Adding support for metaworld's MT10 benchmark Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix bugs caused by obs[0] instead of obs["x"] Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Update README in sequoia/settings, methods Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix tests for SB3 methods Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix more bugs with obs_space[0] -> obs_space["x"] Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix bug with multi-headed BaseModel in MultiTaskRL Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix bug with avalanche.LwF with Multi-Task Model Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Use task_inference_forward pass when taskid=None Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * "fix" AssertionError in avalanche.Replay Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * Fix bugs in tests for avalanche.EWC Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com> * "Fix" bugs in avalanche.SynapticIntelligence Signed-off-by: Fabrice Normandin <fabrice.normandin@gmail.com>
- Loading branch information