Modifying AdaProx for LASSO #23

pythonometrist · 2020-12-09T21:21:48Z

This isnt an issue per se. I did want to figure out if I could use a similar approach for a simple LASSO regression in pytorch. Working with proximal operators with SGD is straightforward (but then SGD has step size issues). ADAM requires memory for past gradients - but isn't meant for non-differentiable convex problems (even though L1 regularization does improve results a fair bit). I wanted tos ee if AdaProx improves results.

pmelchior · 2020-12-09T21:30:59Z

Do you have an elementwise l1 penalty? If so, you can use operators.prox_soft. adaprox should work then.

pythonometrist · 2020-12-09T22:00:16Z

Thanks- let me dig into it and revert. I am going to evaluate how this compares with a smooth Huber loss for linear regression.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modifying AdaProx for LASSO #23

Modifying AdaProx for LASSO #23

pythonometrist commented Dec 9, 2020

pmelchior commented Dec 9, 2020

pythonometrist commented Dec 9, 2020

Modifying AdaProx for LASSO #23

Modifying AdaProx for LASSO #23

Comments

pythonometrist commented Dec 9, 2020

pmelchior commented Dec 9, 2020

pythonometrist commented Dec 9, 2020