RPC Communication in Distributed RL Training #303

Sharad24 · 2020-08-31T21:09:21Z

There's three ways that I can think of having distributed training:

Use of Pytorch's Distributed Training infrastructure. Would require establishing communication protocols specific to the case of Deep RL. This would all be in Python (most likely) unless we find a way around.
Use of Reverb
- Use TF based Datasets (@threewisemonkeys-as )
- Pytorch wrapper for the conversion of NumPy arrays, etc (that are received) (Short-term, up for grabs)

threewisemonkeys-as · 2020-09-01T19:28:25Z

I agree that we should target 2 to begin with. We will still need python multiprocessing over here to run actors and learners seperately right?

As for the structure and fitting it into the rest of the library I was thinking of having DistributedOnPolicyTrainer and DistributedOffPolicyTrainer which will act as the main process and spawn the multile actors while maintaining and updating the central weights. In this case, the agent would only need to implement update_params (to be called in the main process) and select_action (to be called for each actor). The trajectories and weights would be transported through reverb.

I am holding off on #233 since a reverb buffer wrapper would heavily depend on the structure we go with. Plus it is not really useful in the non-distributed case.

github-actions · 2020-11-03T00:34:54Z

Stale issue message

Sharad24 added enhancement New feature or request Core c++ labels Aug 31, 2020

Sharad24 added this to To do in Distributed RL via automation Aug 31, 2020

github-actions bot added the no-issue-activity label Nov 3, 2020

threewisemonkeys-as linked a pull request Nov 4, 2020 that will close this issue

Distributed Framework #327

Draft

github-actions bot closed this as completed Nov 10, 2020

Distributed RL automation moved this from To do to Done Nov 10, 2020

threewisemonkeys-as reopened this Nov 10, 2020

Distributed RL automation moved this from Done to In progress Nov 10, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RPC Communication in Distributed RL Training #303

RPC Communication in Distributed RL Training #303

Sharad24 commented Aug 31, 2020 •

edited

threewisemonkeys-as commented Sep 1, 2020

github-actions bot commented Nov 3, 2020

RPC Communication in Distributed RL Training #303

RPC Communication in Distributed RL Training #303

Comments

Sharad24 commented Aug 31, 2020 • edited

threewisemonkeys-as commented Sep 1, 2020

github-actions bot commented Nov 3, 2020

Sharad24 commented Aug 31, 2020 •

edited