You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, thanks for sharing your wonderful code.
But I have met some errors when running it.
Inside the line 197~205 from dqn_learn.py, the size of target_Q_values and that of current_Q_values does not matched well. I have changed to next_max_q = next_max_q.unsqueeze(-1) for correcting sizes. Also I have changed to rew_batch[0] from line 203.
(IMO) After stacking records in replay buffer, queue action does not work properly. I have changed the line 158 to action = select_epilson_greedy_action(Q, recent_observations, t), however different action value has queued.
I am still working these but having troubles. Could you help make them right?
The text was updated successfully, but these errors were encountered:
Thanks for your question. But I won't be available for a few days.
I will revisit it when I have time.
Which pytorch version do you use? I haven't updated to latest version. It might be the problem.
Hi, thanks for sharing your wonderful code.
But I have met some errors when running it.
Inside the line 197~205 from
dqn_learn.py
, the size oftarget_Q_values
and that ofcurrent_Q_values
does not matched well. I have changed tonext_max_q = next_max_q.unsqueeze(-1)
for correcting sizes. Also I have changed torew_batch[0]
from line 203.(IMO) After stacking records in replay buffer, queue action does not work properly. I have changed the line 158 to
action = select_epilson_greedy_action(Q, recent_observations, t)
, however different action value has queued.I am still working these but having troubles. Could you help make them right?
The text was updated successfully, but these errors were encountered: