Unmatching size and error #3

tegg89 · 2017-08-21T08:33:49Z

Hi, thanks for sharing your wonderful code.
But I have met some errors when running it.

Inside the line 197~205 from dqn_learn.py, the size of target_Q_values and that of current_Q_values does not matched well. I have changed to next_max_q = next_max_q.unsqueeze(-1) for correcting sizes. Also I have changed to rew_batch[0] from line 203.
(IMO) After stacking records in replay buffer, queue action does not work properly. I have changed the line 158 to action = select_epilson_greedy_action(Q, recent_observations, t), however different action value has queued.

I am still working these but having troubles. Could you help make them right?

The text was updated successfully, but these errors were encountered:

hungtuchen · 2017-08-22T13:30:29Z

Thanks for your question. But I won't be available for a few days.
I will revisit it when I have time.
Which pytorch version do you use? I haven't updated to latest version. It might be the problem.

tegg89 · 2017-08-28T05:04:44Z

@transedward Thanks for your reply. I have tested in Pytorch 0.2.0.post1 (0.2.0.1), Python 3.5.3 with Anaconda and Ubuntu 16.04.

praveen-palanisamy · 2017-11-01T02:49:27Z

@tegg89 : Checkout #8 . Let us know if it worked/didn't work.

hungtuchen mentioned this issue Sep 11, 2017

pytorch 0.2 #5

Open

praveen-palanisamy mentioned this issue Nov 1, 2017

Fixes for issues #3, #5 and #7. Agent learns better #8

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unmatching size and error #3

Unmatching size and error #3

tegg89 commented Aug 21, 2017

hungtuchen commented Aug 22, 2017 •

edited

tegg89 commented Aug 28, 2017

praveen-palanisamy commented Nov 1, 2017

Unmatching size and error #3

Unmatching size and error #3

Comments

tegg89 commented Aug 21, 2017

hungtuchen commented Aug 22, 2017 • edited

tegg89 commented Aug 28, 2017

praveen-palanisamy commented Nov 1, 2017

hungtuchen commented Aug 22, 2017 •

edited