The learning rate of the inverse KKT method in the code is inconsistent with that in the paper. #3

beijiguang94 · 2023-10-19T04:13:14Z

hi @wanxinjin. I noticed that in the paper the learning rate for PDP, inverse KKT, and neural policy cloning methods in imitation learning was set to $\eta=10^{-4}$. But in scripts like "cartpole_inverseKKT.py", the parameter lr equals 1e-7. Why so?

wanxinjin · 2023-10-20T06:13:26Z

Hi @beijiguang94, thanks for your interest.
I highly likely have changed the codes afterward. Please use a learning rate that is stable.

beijiguang94 · 2023-10-22T02:41:27Z

thanks. When useing different learning rates, I suppose it would be better to compare the imitation loss of these methods by changing the X label from 'iterations' to 'consumed time'.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The learning rate of the inverse KKT method in the code is inconsistent with that in the paper. #3

The learning rate of the inverse KKT method in the code is inconsistent with that in the paper. #3

beijiguang94 commented Oct 19, 2023

wanxinjin commented Oct 20, 2023

beijiguang94 commented Oct 22, 2023

The learning rate of the inverse KKT method in the code is inconsistent with that in the paper. #3

The learning rate of the inverse KKT method in the code is inconsistent with that in the paper. #3

Comments

beijiguang94 commented Oct 19, 2023

wanxinjin commented Oct 20, 2023

beijiguang94 commented Oct 22, 2023