CEM #373

hades-rp2010 · 2020-10-05T21:53:24Z

codecov · 2020-10-05T21:56:18Z

Codecov Report

Merging #373 into master will decrease coverage by 0.03%.
The diff coverage is 90.09%.

@@            Coverage Diff             @@
##           master     #373      +/-   ##
==========================================
- Coverage   91.28%   91.25%   -0.04%     
==========================================
  Files          90       92       +2     
  Lines        3809     3910     +101     
==========================================
+ Hits         3477     3568      +91     
- Misses        332      342      +10

Impacted Files	Coverage Δ
genrl/agents/modelbased/base.py	`71.42% <71.42%> (ø)`
genrl/agents/modelbased/cem/cem.py	`94.87% <94.87%> (ø)`
genrl/agents/__init__.py	`100.00% <100.00%> (ø)`

lgtm-com · 2020-10-05T22:17:27Z

This pull request introduces 3 alerts when merging a90e8d0 into 52b0b4c - view on LGTM.com

new alerts:

3 for Unused import

lgtm-com · 2020-10-15T11:14:24Z

This pull request introduces 4 alerts when merging 3b2067d into 25eb018 - view on LGTM.com

new alerts:

4 for Unused import

sampreet-arthi

Looks good. Mostly just doubts. How's the performance of the agent now? (Does it hit 500?)

sampreet-arthi · 2020-10-16T19:01:57Z

genrl/agents/modelbased/base.py

+        raise NotImplementedError
+
+
+class ModelBasedAgent(ABC):


Can this inherit from the genrl/deep BaseAgent?

It can, and I think thats a better option (for now at least)

sampreet-arthi · 2020-10-16T19:05:27Z

genrl/agents/modelbased/cem/cem.py

+        # No need for this here
+        pass
+
+    def collect_rollouts(self, state: torch.Tensor):


Looks pretty similar to the OnPolicyAgent method. Shouldn't this return values and dones though? Not sure if this is a consequence of the algo.

sampreet-arthi · 2020-10-16T19:08:22Z

genrl/agents/modelbased/cem/cem.py

+        for i, done in enumerate(dones):
+            if done or timestep == self.rollout_size - 1:
+                self.rewards.append(self.env.episode_reward[i].detach().clone())
+                # self.env.reset_single_env(i)


Why is this commented out? This is necessary to reset environments immediately as they are set to done. (Not a good practice to do env.step() if the env is already returning done = True)

Since I am breaking the loop of actions if a env.step() returns done=True, and every plan session (the plan function) starts with env.reset(), I think this is redundant here, hence its commented out

tests/test_deep/test_agents/test_cem.py

sampreet-arthi · 2020-10-16T19:10:54Z

tests/test_deep/test_agents/test_cem.py

+
+
+def test_CEM():
+    env = VectorEnv("CartPole-v0", 1)


Why set it to 1? It does work with multiple envs right?

Yeah it does

sampreet-arthi · 2020-10-16T19:12:50Z

Also, forgot to mention the docs. The CEM agent code didn't have docstrings afair.

sampreet-arthi · 2020-10-16T19:21:03Z

tests/test_deep/test_agents/test_cem.py

+from genrl.trainers import OnPolicyTrainer
+
+
+def test_CEM():


Also please make this a class so the tests are easier to find/understand

hades-rp2010 · 2020-10-16T19:58:48Z

Also, forgot to mention the docs. The CEM agent code didn't have docstrings afair.

Yeah, I'll get that done too

lgtm-com · 2020-10-17T21:10:05Z

This pull request introduces 2 alerts when merging f5a189d into 25eb018 - view on LGTM.com

new alerts:

2 for Unused import

lgtm-com · 2020-10-21T16:47:10Z

This pull request introduces 2 alerts when merging 4b11c16 into 25eb018 - view on LGTM.com

new alerts:

2 for Unused import

hades-rp2010 added 13 commits September 1, 2020 15:49

Single actor critic shared params

1d49049

Shared layers for multi ACs

ef4a179

Merge branch 'master' of https://github.com/SforAiDl/genrl

2ecd086

Fix lint errors (1)

53450a8

Fixed tests

274aff9

Changes to dicstrings and classes

38f95f0

Renaming Multi -> Two and comments

835819e

Merge branch 'master' of https://github.com/SforAiDl/genrl

c94a9a1

Adding tutorial

bf71710

Small change

fc356b9

Index

844c53d

Up to date

d3830e0

CEM agent

a90e8d0

sampreet-arthi added this to In progress in Model Based RL Oct 12, 2020

hades-rp2010 added 2 commits October 13, 2020 21:19

Merge branch 'master' of https://github.com/SforAiDl/genrl into CEM

6cb6d5c

Training CEM without rollouts

3b2067d

Fix Codacy (1)

f86b046

hades-rp2010 changed the title ~~[WIP] CEM~~ CEM Oct 15, 2020

sampreet-arthi reviewed Oct 16, 2020

View reviewed changes

sampreet-arthi linked an issue Oct 16, 2020 that may be closed by this pull request

CEM for Model based agents #363

Open

sampreet-arthi reviewed Oct 16, 2020

View reviewed changes

Docstrings

f5a189d

hades-rp2010 requested a review from sampreet-arthi October 19, 2020 20:46

Adding device

4b11c16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CEM #373

CEM #373

hades-rp2010 commented Oct 5, 2020

codecov bot commented Oct 5, 2020 •

edited

lgtm-com bot commented Oct 5, 2020

lgtm-com bot commented Oct 15, 2020

sampreet-arthi left a comment

sampreet-arthi Oct 16, 2020

hades-rp2010 Oct 17, 2020

sampreet-arthi Oct 16, 2020

sampreet-arthi Oct 16, 2020

hades-rp2010 Oct 16, 2020

sampreet-arthi Oct 16, 2020

hades-rp2010 Oct 17, 2020

sampreet-arthi commented Oct 16, 2020

sampreet-arthi Oct 16, 2020

hades-rp2010 commented Oct 16, 2020

lgtm-com bot commented Oct 17, 2020

lgtm-com bot commented Oct 21, 2020

CEM #373

Are you sure you want to change the base?

CEM #373

Conversation

hades-rp2010 commented Oct 5, 2020

codecov bot commented Oct 5, 2020 • edited

Codecov Report

lgtm-com bot commented Oct 5, 2020

lgtm-com bot commented Oct 15, 2020

sampreet-arthi left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sampreet-arthi commented Oct 16, 2020

Choose a reason for hiding this comment

hades-rp2010 commented Oct 16, 2020

lgtm-com bot commented Oct 17, 2020

lgtm-com bot commented Oct 21, 2020

codecov bot commented Oct 5, 2020 •

edited