L-1 procrustes #115

choltz95 · 2021-07-23T17:55:43Z

Hi, Thanks for the project! I'm interested in solving procrustes using a more robust metric of dispersion than F norm.

I saw a couple papers online that solve e.g. smooth surrogates of l1, but wondering if you have any pointers in particular.

FanwangM · 2021-07-23T21:03:51Z

Thanks for the nice words!

We don't have any implementation to support l1 norm-based Procrustes (I am not familiar with that either). But we may add this as a feature in the future depending on other project progress and time avaliability. I found some interesting papers,

Are you looking for the implementation of l1 norm-based Procrustes ?
@choltz95

choltz95 · 2021-07-23T21:49:21Z

Thank you so much for the resources! I really appreciate it.

I guess that second paper is the most general, but doesn't guarantee optimality.

I am happy to make a pull request some time if you think this would be relevant to your project.

Main pieces for projected gradient are in section 3.3 & Eq. 14 of this paper (same author as your #2.)

FanwangM · 2021-07-25T03:49:23Z

Wow! This is going to be a nice improvement!! Thanks.
Please let me know when you need a code review or any other problems. I will tag this issue as an enhancement.

PaulWAyers · 2022-08-19T15:33:28Z

@choltz95 were you going to tackle this issue?

choltz95 · 2022-08-24T19:45:23Z

Hi yes - sorry for the delayed response. Still potentially interested. I ended up doing iteratively reweighted least squares to solve the l1 problem for my application since I did not really need a high quality solution.

Let me take another look at the papers originally posted in this thread and get back to you.

choltz95 · 2022-08-27T20:10:36Z

Hey - just started working on a prototype. I implemented a preliminary version of PGD for the smoothened l1 problem (huber) from the Trendafilov & Watson paper in Jax here.

I saw that the convergence is mediocre and needs many iterations on even some very simple testcases. I tried some simple things like gridding over geometric step lengths and early stopping. Did you find any other papers that discuss nicer solutions to this problem? There could also be some bugs.

When I initialize using solution to l-2 procrustes, result looks a little more reasonable.

I am thinking of trying a projected admm-type algorithm to solve the problem directly very quickly. I had some success on a similar problem. I feel that neither is ideal since the other methods in your library all have nice solutions, but it seems there is no way to get around iterating or potentially bad solutions for this one.

FanwangM · 2022-08-28T22:29:38Z

Thank you for your efforts!
I have tried your Colab codes, but it seems that the implementation is giving the strange behavior with missing value in foc and all the error values are nan. Can you check why is that?

I think for the implementation, maybe a good check is to reproduce the results listed in the paper you mentioned. I will also do more literature reading to find if we can get better algorithms on this problem.

@choltz95

choltz95 · 2022-08-30T18:20:04Z

Sounds good. Yeah- those plots don't look right to me. Here is the result from my run:

Likely there is some issue with the implementation of the descent direction since the foc (eq 15, left plot) should ideally go to zero monotonically. I'll get back to you when I have some time to debug. I agree next step should be reproduce their result.

choltz95 · 2022-10-18T17:38:08Z

Hi, sorry for the late response. Quick follow-up. Took another look at my implementation. I think it may be possible that the implementation is okay - I reproduced the concrete numerical experiments in Example 2 of the paper to get similar error, even though the foc is nonzero. I think the issue may simply be that projected gradient method may not be ideal for this kind of problem with orthogonality constraint.

I am thinking this: since there is already some api support for weighted least squares procrustes (with weights on the rows of A / the "samples"), it is probably easier to do something naive like iterative reweighting: https://en.wikipedia.org/wiki/Iteratively_reweighted_least_squares. This way we remain within the procrustes api and require minimal effort while supporting some notion of robust / l1-procrustes. I think there could be some problems like convergence is usually pretty slow, solution won't change much from least squares, but maybe it's better then projected gradient.

Let me know if you would like me to proceed!

FanwangM · 2022-10-19T02:07:01Z

Looks like a good feature to have. Please go ahead and make a pull request. Can you tag me when you are done? Thank you for making our package grow! @choltz95

FanwangM added the enhancement New feature or request label Jul 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

L-1 procrustes #115

L-1 procrustes #115

choltz95 commented Jul 23, 2021

FanwangM commented Jul 23, 2021

choltz95 commented Jul 23, 2021 •

edited

FanwangM commented Jul 25, 2021

PaulWAyers commented Aug 19, 2022

choltz95 commented Aug 24, 2022

choltz95 commented Aug 27, 2022 •

edited

FanwangM commented Aug 28, 2022

choltz95 commented Aug 30, 2022 •

edited

choltz95 commented Oct 18, 2022 •

edited

FanwangM commented Oct 19, 2022

L-1 procrustes #115

L-1 procrustes #115

Comments

choltz95 commented Jul 23, 2021

FanwangM commented Jul 23, 2021

choltz95 commented Jul 23, 2021 • edited

FanwangM commented Jul 25, 2021

PaulWAyers commented Aug 19, 2022

choltz95 commented Aug 24, 2022

choltz95 commented Aug 27, 2022 • edited

FanwangM commented Aug 28, 2022

choltz95 commented Aug 30, 2022 • edited

choltz95 commented Oct 18, 2022 • edited

FanwangM commented Oct 19, 2022

choltz95 commented Jul 23, 2021 •

edited

choltz95 commented Aug 27, 2022 •

edited

choltz95 commented Aug 30, 2022 •

edited

choltz95 commented Oct 18, 2022 •

edited