Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

New simulation functionality #182

Open
monkey0head opened this issue Nov 8, 2022 · 0 comments
Open

New simulation functionality #182

monkey0head opened this issue Nov 8, 2022 · 0 comments

Comments

@monkey0head
Copy link

Hi! You presented a paper regarding simulation of industrial challenges with OBP at the RecSys'22. I found it very interesting and want to understand some details. I found the pr with source code and notebooks with experiments, but did not find a documentation describing the idea and details of the new functionality. So, could you help me with some questions:

  1. What kind of reward functions do you have and how are they trained? I found logistic_reward_function, linear_reward_function and others placed here. Unfortunately I have not realised what it the training data for them and if they are retrained every simulation round.
  2. What is the functionality of BanditEnvironmentSimulator?

It would be great if you can share some papers (except for the one from RecSys'22), schemas, demos, tutorials explaining your simulation framework details.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant