Svd #1248

kedevked · 2018-08-26T14:44:56Z

Description

Compute the SVD for a matrix m

Use the QR decomposition to compute eingen values and vectors
Use the solve function to compute v as the result of (us)X = m
Use a fix number of iterations that can be improved

FEATURE: Singular-value decomposition (SVD) tensorflow/tfjs#110

For repository owners only:

Please remember to apply all applicable tags to your pull request.
Tags: FEATURE, BREAKING, BUG, PERF, DEV, DOC, SECURITY

For more info see: https://github.com/tensorflow/tfjs/blob/master/DEVELOPMENT.md

This change is

method of Jordan-Gauss triangularisation of the matrix check if all diagonal values are not null get the matrix solution of the equation

remove comments and console.log

Using Jordan-Gauss elimination to solve linear equation AX = B. Compute the determinant of a matrix Compute the adjoint of a matrix FEATURE solve linear equation

using dot instead of mul adding unit test

errors in algorithms as u*s*v is not close to m

solve the equation us * x = m to compute v

DirkToewe · 2018-10-26T10:30:28Z

Hi @kedevked,

It's awesome that You've started the implementation of some of the most crucial Linear Algebra operations. I'm afraid, however, that the Implementation of the QR ~~Algorithm~~ iteration has to become a lot more complicated before it becomes fast enough and converges reliably. Some observations:

The Schur Decomposition of a real-valued matrix is usually not real-valued. There is a more lenient version of the Schur Decompostion: The Real Schur Decomposition, which only guarantess a block-triangular matrix and is real-valued for every real-valued input.
The unshifted QR ~~Algorithm~~ iteration likely never converges to the Real Schur decomposition
The QR Decomposition takes O(n³), so the QR ~~Algorithm~~ iteration - as implemented - will take O(n⁴). This can be improved upon by performing a Hessenberg Decomposition first, as a Hessenberg matrix can be QR-Decomposed in O(n²).
It is usually recommended to balance the matrix before the Schur Decomposition. Aside from a few exeptions, it is supposed to improve the numeric stability.

There is only one viable Schur Decomposition Algorithm for the non-symmetric Eigenvalue Problem that I could find: The Doubly-Shifted Francis QR Algorithm. There are two JavaScript implementations of the Francis Algorithm that i know of: Numeric.js and NDJS. The latter contains some literature references that I found very helpful while implementing it. Maybe they will help You as well.

Alternatively, I can offer to try and incorporate my implementation from NDJS into TFJS.

kedevked · 2018-10-26T17:45:49Z

@DirkToewe , tfjs already supports QR decomposition using householder transformation. I used that QR decomposition and implemented the algorithm described here: http://www.math.tamu.edu/~dallen/linear_algebra/chpt6.pdf. I don't really think that in top of that we need the schur decomposition before computing the svd.
Maybe @caisq could give some pointers in the direction to go.

DirkToewe · 2018-10-26T18:22:30Z

So You're exporting eingen only for testing purposes only? Got it! Point 3 from my bullet points still applies except that You can use a Bi-Diagonal Decomposition instead of Hessenberg.

kedevked · 2018-10-26T18:41:53Z

The PR is about creating a new operator tf.svd that will return the SVD decomposition of a given matrix. Therefore, I am not exporting eigen values and vectors only for testing.
The third point holds if I were to implement the QR decomposition. That's not the case.

JasonShin · 2018-12-01T07:13:03Z

@kedevked hey, thanks for SVD implementation! I see there is a TSLint error and I made a PR to your fork that fixes it. I wish to use this API soon in my project!

kedevked · 2018-12-01T07:47:47Z

That's great @JasonShin. Hopefully @caisq could review it or give us glimpse about how to proceed to make it part of the API

DirkToewe · 2018-12-02T09:10:30Z

Sorry to bring this up again but I am still wondering about a few points:

I am not sure that the QR iteration - even for symmetric positive semidefinite matrices - is guaranteed to converge to a diagonal matrix without a shift.
Currently the QR iteration in eingen_() uses a fixed number of 5 steps. The Francis QR Algorithm on average takes O(n) steps (literature says ~2n, for random matrices it was up to 3n in my experiments). That should also be the order of steps necessary for symmetric positive semidefinite matrices. Without a shift I expect this number to be even larger.
For checking convergence, You could check the off-diagonal norm every n steps.
Calling dispose() manually makes backpropagation over the SVD impossible. Maybe there is a way to check wether or not backpropagation is desired?
(Minor point) Computing the SV by computing the EV of A^TA or AA^T computes the squares of the SV, which would create a minor risk of underflow (Math.fround(1e-23**2) == 0). SVD is often used to determine the rank or to compute rank-deficient linear least square problem. In both cases underflow matters.
(Moot point) The QR Decomposition is already not the fastest in TFJS. Building a ~2n operation on top of it would mean ~5 minutes for a [100,100] matrix and ~1.2 hours for a [200,200] matrix according to my estimations.

JasonShin · 2018-12-03T00:16:29Z

The PR is about creating a new operator tf.svd that will return the SVD decomposition of a given matrix. Therefore, I am not exporting eigen values and vectors only for testing. 👍

JasonShin · 2018-12-03T00:20:22Z

Python Tensorflow implements SVD as well so why not have it in tfjs-core too?

DirkToewe · 2018-12-03T07:14:22Z

QR decomposition, QR iteration and (Francis implictly shifted) QR method, are three distinct algorithms in my understanding (see: Linear Algebra Handbook, chap. 42 "Symmetric Matrix Eigenvalue Techniques"). If I understand @kedevked's code correctly, the QR iteration is used to solve the symmetric positive semidefinite eigenvalue problem AA^T to then solve the SV problem of A with it. Only one of my points was addressed to the QR decomposition (the one about performance). The other points are all directed at the implementation of svd_() and eingen_() in the PR.

Here are the lines of code that make me believe, QR iteration is used:

// ...
function svd_(m: Tensor2D): {u: Tensor, s: Tensor, v: Tensor} {
  const mT = m.dot(transpose(m));
  const u = eingen(mT).vectors;
// ...

// ...
function eingen_(m: Tensor): {values: Tensor1D, vectors: Tensor} {
  let z;
  for (let i = 0; i < 5; i++) {
    const [x, y] = linalg.qr(m);
    m = y.dot(x);
    z = z ? z.dot(x) : x;
    y.dispose();
  }
// ...

eingen_ looks like QR iteration without shift.

A few more points that came to my attention looking at the PR for the fourth time:

eingen(mT) is called twice in svd_().
In an SVD U and V are both orthonormal. So solve_() is not necessary since U^-1=U^T
m.dot(transpose(m)) could be replaced by tf.matMul(m,m, false, true)
(Minor point) Sometimes A^TA could have significantly smaller size than AA^T.
(Minor point) Rank-deficient matrices are not supported by the current implementation

As always, I may very well be mistaken in all of this.

P.S.: To my knowledge, nobody is opposed to having SVD in TFJS, certainly not me. I was hoping that my first comment made that clear.

kedevked · 2018-12-03T21:28:43Z

@DirkToewe, I had a look at the article you provided for the shifting. But it does not provide a way to choose μ. Are you considering implementing the shifting ? I could continue the implementation if you can suggest an article where I can find a concrete way to choose μ or to implement the shifting. Thanks !

DirkToewe · 2018-12-04T08:22:18Z

This is a rabbit hole that will probably lead You to the Golub-Reinsch algorithm or the Francis QR algorithm for nonsymmetric Eigen problems.

The best shift would be an actual eigenvalue because then we would have convergence in exactly n steps. Of course we don't know the eigenvalues. If we knew the eigenvalues we wouldn't need the QR iteration in the first place. So instead of the actual eigenvalue a good estimation of the eigenvalue is used. In order to get a good estimation of the eigenvalue of a matrix, it should be as close to triangular as possible (or diagonal in the symmetric case). The closest You can get to triangular with deterministic algorithms is the Hessenberg decomposition, which is usually done before the QR iteration. Given the Hessenberg matrix, the eigenvalues are usually estimated from the lowest, rightmost 2x2 matrix.

Here's the corresponding quote from Golub and Reinsch:

The shift parameter s is determined by an eigenvalue of the lower 2 X 2 minor
of M. Wilkinson [t3] has shown that for this choice of s, the method converges
globally and almost always cubically.

kedevked · 2018-12-04T19:39:41Z

I will take the time to go through the algorithms for implementation suggested so as to come with a sound implementation. Thanks !

DirkToewe · 2018-12-05T22:33:33Z

For symmetric eigenvalue problems (like the SVD) there is also another, simpler algorithm: The (classic) Jacobi eigenvalue algorithm. It also takes O(n⁴) operations and in my opinion it is much simpler to understand and implement than Francis QR or Golub-Reinsch. There's also O(n³) implementations of the Jacobi SVD but they are really complicated.

kedevked added 13 commits August 20, 2018 00:31

solve linear equation

96a301e

method of Jordan-Gauss triangularisation of the matrix check if all diagonal values are not null get the matrix solution of the equation

invert matrix

ad32836

compute matrix adjoint

a2bcda3

clean code

aa0381a

remove comments and console.log

array input or not

06d3cdf

solve linear equation

628d585

Using Jordan-Gauss elimination to solve linear equation AX = B. Compute the determinant of a matrix Compute the adjoint of a matrix FEATURE solve linear equation

compute eigen values

430bc7a

fix computation of eigenvalue

dfd03fd

using dot instead of mul adding unit test

compute eigen vectors

a694849

errors in algorithms as u*s*v is not close to m

fix eingenvectors computation

f0ba106

solve the equation us * x = m to compute v

add tests

1a6d888

add documentation to code

3001461

Merge branch 'master' into svd

f73468a

dsmilkov requested a review from caisq August 29, 2018 22:56

JasonShin mentioned this pull request Dec 1, 2018

feature/temporary SVD until it is officially available in TF.js machinelearnjs/machinelearnjs#151

Closed

DirkToewe mentioned this pull request Apr 14, 2019

Linear Algeba tensorflow/tfjs#1516

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Svd #1248

Svd #1248

kedevked commented Aug 26, 2018 •

edited

DirkToewe commented Oct 26, 2018 •

edited

kedevked commented Oct 26, 2018 •

edited

DirkToewe commented Oct 26, 2018

kedevked commented Oct 26, 2018

JasonShin commented Dec 1, 2018

kedevked commented Dec 1, 2018

DirkToewe commented Dec 2, 2018

JasonShin commented Dec 3, 2018 •

edited

JasonShin commented Dec 3, 2018

DirkToewe commented Dec 3, 2018 •

edited

kedevked commented Dec 3, 2018

DirkToewe commented Dec 4, 2018

kedevked commented Dec 4, 2018

DirkToewe commented Dec 5, 2018

Svd #1248

Are you sure you want to change the base?

Svd #1248

Conversation

kedevked commented Aug 26, 2018 • edited

Description

For repository owners only:

DirkToewe commented Oct 26, 2018 • edited

kedevked commented Oct 26, 2018 • edited

DirkToewe commented Oct 26, 2018

kedevked commented Oct 26, 2018

JasonShin commented Dec 1, 2018

kedevked commented Dec 1, 2018

DirkToewe commented Dec 2, 2018

JasonShin commented Dec 3, 2018 • edited

JasonShin commented Dec 3, 2018

DirkToewe commented Dec 3, 2018 • edited

kedevked commented Dec 3, 2018

DirkToewe commented Dec 4, 2018

kedevked commented Dec 4, 2018

DirkToewe commented Dec 5, 2018

kedevked commented Aug 26, 2018 •

edited

DirkToewe commented Oct 26, 2018 •

edited

kedevked commented Oct 26, 2018 •

edited

JasonShin commented Dec 3, 2018 •

edited

DirkToewe commented Dec 3, 2018 •

edited