Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We鈥檒l occasionally send you account related emails.

Already on GitHub? Sign in to your account

The rrllib wrapper should provide a single agent environment with action_space and observation_space attributes #69

Open
fteicht opened this issue Oct 6, 2021 · 0 comments
Assignees
Labels
bug Something isn't working

Comments

@fteicht
Copy link
Collaborator

fteicht commented Oct 6, 2021

馃悰 Bug

Many algorithms based on OpenAI gym domains expect the domain to define the action_space and observation_space attributes, and especially ray's rllib's single agent algorithms like AlphaZero.

See for instance ray's rllib's AlphaZero implementation which makes use of those attributes.

The current implementation of scikit-decide's rllib wrapper provides only a multi-agent environment wrapper via AsRLlibMultiAgentEnv which does not define the action_space and observation_space attributes (which is fine for rllib's multi-agent environments). Therefore scikit-decide's rllib wrapper should additionally provide a single-agent environment wrapper for algorithms like rllib's AlphaZer, which defines the action_space and observation_space attributes.

To Reproduce

Define a scikit-decide RL domain and pass it to ray's rllib's AlphaZero algorithm.
The following exception will be thrown when solving the domain:
AttributeError: 'AsRLlibMultiAgentEnv' object has no attribute 'action_space'

Expected behavior

No exception is thrown because an environment wrapper likeAsRLlibSingleAgentEnv (to be defined) should define the action_space and observation_space attributes.

@fteicht fteicht added the bug Something isn't working label Oct 6, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants