Implementation of L-BFGS and VL-BFGS on GPU relying on PyTorch framework.
The VL variant is from Large-scale L-BFGS using MapReduce.
Here is an example of the speed-up (5 000 features and 200 000 samples):
Here is a more complete comparison according to the dimension: