Bugs in PPO #6

moonblue333 · 2019-05-05T15:07:25Z

counter
for index in BatchSampler(SubsetRandomSampler(range(self.buffer_capacity), self.batch_size, True)):

yuntao-ma · 2019-11-25T11:09:34Z

How to solve bug 2?
It seems that "done" from the env hasn't been dealt with. Why?

Thanks.

HuangHaoyu1997 · 2020-03-08T17:04:29Z

@yuntao-ma
for index in BatchSampler(SubsetRandomSampler(range(self.buffer_capacity)), self.batch_size, True):

brezezee · 2020-07-27T13:58:12Z

Why can I train with this code to only get nan actions

xxx-007 · 2020-11-04T01:09:33Z

I get nan actions too

HzcIrving · 2020-11-28T09:58:49Z

I change the code to :
for index in BatchSampler(SubsetRandomSampler(range(self.buffer_capacity)), self.batch_size, True):
but there still exists a bug:
Traceback (most recent call last): File "E:/AAAFor_PHD/UUV_SCI_Modif/UUV_obs_env/PPO2/Demo/PPO_demo.py", line 195, in <module> main() File "E:/AAAFor_PHD/UUV_SCI_Modif/UUV_obs_env/PPO2/Demo/PPO_demo.py", line 175, in main next_state, reward, done, info = env.step(action) File "F:\Anaconda\envs\Obstacle_Avoid\lib\site-packages\gym\envs\classic_control\pendulum.py", line 49, in step u = np.clip(u, -self.max_torque, self.max_torque)[0] IndexError: invalid index to scalar variable.

haohaoqian · 2023-12-13T04:10:00Z

Transition = namedtuple('Transition',['state', 'aciton', 'reward', 'a_log_prob', 'next_state'])
'aciton' should be 'action'

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bugs in PPO #6

Bugs in PPO #6

moonblue333 commented May 5, 2019

yuntao-ma commented Nov 25, 2019

HuangHaoyu1997 commented Mar 8, 2020

brezezee commented Jul 27, 2020

xxx-007 commented Nov 4, 2020

HzcIrving commented Nov 28, 2020

haohaoqian commented Dec 13, 2023

Bugs in PPO #6

Bugs in PPO #6

Comments

moonblue333 commented May 5, 2019

yuntao-ma commented Nov 25, 2019

HuangHaoyu1997 commented Mar 8, 2020

brezezee commented Jul 27, 2020

xxx-007 commented Nov 4, 2020

HzcIrving commented Nov 28, 2020

haohaoqian commented Dec 13, 2023