Why L-BFGS performs different in tensorflow.compat.v1 and pytorch #1698

gaohuizhang · 2024-04-01T00:06:27Z

Hi,

I tried to train the model with L-BFGS after 15000 iterations of Adam, but I got different results from tensorflow.compat.v1 and PyTorch, even if I use the exact same code, just a different backend.
This is the loss history using tensorflow.compat.v1

This is the loss history using Pytorch

Does anyone have any idea why this happens? Does this mean we have to use TensorFlow if we want to use the L-BFGS optimizer?

Thanks a lot

bakhtiyar-k · 2024-04-01T10:25:24Z

Hello, try to use the same seed for both cases dde.config.set_random_seed(1). Maybe the results are different due to different initialization

gaohuizhang · 2024-04-02T17:40:14Z

Thanks for your advice, but I tried several times and got similar results. I don't think it is caused by randomness of initialization.

praksharma · 2024-04-24T10:18:23Z

Because LBFGS is implemented differently in both libraries. You find the link the the particular implementation (papers) in the source code of tf and torch.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why L-BFGS performs different in tensorflow.compat.v1 and pytorch #1698

Why L-BFGS performs different in tensorflow.compat.v1 and pytorch #1698

gaohuizhang commented Apr 1, 2024 •

edited

bakhtiyar-k commented Apr 1, 2024

gaohuizhang commented Apr 2, 2024

praksharma commented Apr 24, 2024

Why L-BFGS performs different in tensorflow.compat.v1 and pytorch #1698

Why L-BFGS performs different in tensorflow.compat.v1 and pytorch #1698

Comments

gaohuizhang commented Apr 1, 2024 • edited

bakhtiyar-k commented Apr 1, 2024

gaohuizhang commented Apr 2, 2024

praksharma commented Apr 24, 2024

gaohuizhang commented Apr 1, 2024 •

edited