[Feature Request] Store next observations and dones in RolloutBuffer #1273

taufeeque9 · 2023-01-11T14:35:48Z

🚀 Feature

Add next_observations and dones fields to the RolloutBuffer and the DictRolloutBuffer classes, similar to how it is done in the ReplayBuffer class.

Motivation

Currently, on-policy algorithms don't store the next observations and dones fields in their buffer in the get_rollouts method. This is because these fields are not required by any of the algorithms in stable-baselines3. However, these fields are required to be stored in the buffer to implement the original variant of the AIRL algorithm in imitation.

Pitch

No response

Alternatives

No response

Additional context

No response

Checklist

I have checked that there is no similar issue in the repo

The text was updated successfully, but these errors were encountered:

araffin · 2023-01-12T09:45:11Z

Add next_observations and dones fields to the RolloutBuffer and the DictRolloutBuffer classes, similar to how it is done in the ReplayBuffer class.

dones are stored in episode_starts (shifted by one) and next_observations can be retrieved using observations[i+1] (except for terminal obs)

Alternatives

why not implement a custom buffer for your use case?
(and you can fill it using a callback or custom SB3 version)

. However, these fields are required to be stored in the buffer to implement the original variant of the AIRL algorithm in imitation.

do you have a code example of that?

taufeeque9 added the enhancement New feature or request label Jan 11, 2023

taufeeque9 linked a pull request Jan 11, 2023 that will close this issue

Add next_observations and dones to RolloutBuffer #1267

Draft

16 tasks

araffin mentioned this issue Feb 13, 2023

[Feature Request] Add a next_observations field to RolloutBufferSamples #1328

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Store next observations and dones in RolloutBuffer #1273

[Feature Request] Store next observations and dones in RolloutBuffer #1273

taufeeque9 commented Jan 11, 2023

araffin commented Jan 12, 2023

[Feature Request] Store next observations and dones in RolloutBuffer #1273

[Feature Request] Store next observations and dones in RolloutBuffer #1273

Comments

taufeeque9 commented Jan 11, 2023

🚀 Feature

Motivation

Pitch

Alternatives

Additional context

Checklist

araffin commented Jan 12, 2023