Cholesky, LU, QR, triangularSolve, setDiag, diagPart, broadcastTo, conj #1356

DirkToewe · 2018-10-27T02:03:29Z

matMul now supports broadcasting

Description

Linear algebra operations seem to be widely requested and help was wanted, so I thought I'd try and help:

qr() now has a "low-level" CPU-implementation. At least on my machine this massively improves performance and memory efficiency.
lu() and luSolve() allow solving LES via LU Decomposition.
cholesky() and choleskySolve() allow solving symmetric positive definite LES via Cholesky Decomposition.
triangularSolve allows solving triangular LES or LES viar QR Decomposition.
adjoint and conj are now implemented.
setDiag, diagPart and bandPart were implemented for symbolic backpropagation.
broadcastTo is now implemented.
matMul now supports (matrix) broadcasting.

P.S: A proof of concept WebGL implementation of the Cholesky Decomposition can be found here. I will require some guidance if I am supposed to work it into TFJS as a Vertex Shader and some non-Tensor uniform parameters are required. QR and LU Decompostion as well as triangularSolve should not be to difficult to implement either.

P.P.S.: This is my first time with TypeScript and about my third time with JS, so I apologize is the code is not up to standard. The linting requirements aren't even remotely met yet. I will fix that iff/once the commit is otherwise accepted.

For repository owners only:

Please remember to apply all applicable tags to your pull request.
Tags: FEATURE, BREAKING, BUG, PERF, DEV, DOC, SECURITY

For more info see: https://github.com/tensorflow/tfjs/blob/master/DEVELOPMENT.md

This change is

matMul now supports broadcasting

dsmilkov

Hi! Thanks for the contribution. It will be hard to review this PR due to its size, scope, code style etc. Let's break it into lots of smaller PRs. Let's start with QR PR. Is the existing implementation much slower than the one proposed in this PR? Can you run a benchmark with different-sized matrices and compare the two QR implementations side-by-side?

Reviewable status: 0 of 1 approvals obtained

dsmilkov · 2018-10-29T15:29:48Z

Also let's do the PRs one by one - this will be a learning opportunity for the code style, how ops work in TF.js etc. And it will impact how the future PRs look.

DirkToewe · 2018-10-29T18:31:09Z

I'd hardly call it a proper Benchmark but You can find a quick performance comparision here, and the here's the test code. The overhead of the TFJS API and the memory allocations is quite significant...

If it's easier for You I'm going to carve out all but qr for now.

EDIT: ~~Forgot to tf.tidy. Gonna fix that asap.~~ done!

dsmilkov · 2018-10-29T21:25:01Z

Yeah, that's quite a difference in performance! Let's do the qr first. Thanks!!

ds604 · 2019-01-28T16:12:50Z

Hello,
This thread came up while I was looking for information on linear algebra operations in tfjs (verifying operations against numpy and octave, and checking the status of different linear algebra functionality). I made a little polyfill, using this port of JAMA to javascript (https://github.com/ghewgill/jsmat), to make it a little easier to go back and forth between tensors and JAMA matrices, and make sure that things are working as they should. Maybe someone will find this helpful, for verification or other purposes.

<script src="https://cdn.jsdelivr.net/gh/ghewgill/jsmat@master/matrix.js"></script>
<script src="https://cdn.jsdelivr.net/npm/@tensorflow/tfjs@0.14.1"></script>
<script>
    ///////////////////////////////////////////////////
    // Tensor <-> Jama matrix

    // get data out of tensor, but keep shape information
    tf.Tensor.prototype.getData = function(){
        let data = this.dataSync()
        let [rows, columns] = this.shape

        // create matrix with proper shape
        let M = Array(rows).fill(0)
        return M.map((row,i) => data.slice(i*columns, (i+1)*columns))
    }

    Array.prototype.toJama = function(){
        return new Matrix(this)
    }

    Array.prototype.toTensor = function(){
        return tf.tensor(this)
    }

    ///////////////////////////////////////////////////
    // Jama operations that are not (yet) in tensorflow
    
    // inverse
    let A = tf.tensor([
        [1,2,3],
        [4,5,6],
        [7,8,10]
    ])
    .getData()
    .toJama()
    .inverse().A
    .toTensor()
    .print()

    // LU
    let B = tf.tensor([
        [1,2],
        [3,4]
    ])
    .getData()
    .toJama()
    .lu().LU
    .toTensor()
    .print()

    // SVD
    let C = tf.tensor([
        [1,2],
        [3,4]
    ])
    .getData()
    .toJama()
    .svd()

    tf.tensor(C.U).print()
    tf.tensor(C.s).print()
    tf.tensor(C.V).print()
</script>

DirkToewe · 2019-01-28T17:39:45Z

That's pretty cool @ds604. I wish I had known about that Jama port before.

Maybe that comment might be better placed in the tfjs/issues? Here in the pull request it might be missed by people who are - like You - looking for a working solution right now.

When it comes to using another linalg implementation for verification purposes, I'm not sure how to go about that. QR, LU, Schur, SVD and other decompositions are not unique. Two different solutions might be both correct. So comparing solutions might be hard. Maybe I am missing something?

My current approach for testing is to check the very definitions of the decompositons (Q @ R == A, L @ U == A @ P, ...) as well as the matrix properties (orthogonality Q @ Qᵀ == I, tringularity, ...).

ds604 · 2019-01-29T23:21:33Z

Wow, nice project! I'll have to dive in, but it looks like you've closed the gap to a full javascript Numpy, and with native javascript feel. That's what I was looking for.

What I was speaking of as far as verification is just achieving results that are as similar as possible to the outcomes obtained in another environment. I'm not so much on the implementation end of things, so my perspective is more as a user, or taking into account what might matter from a "design" perspective: That's true that wrt correctness of the operations, the choices in the implementation shouldn't matter. But the reality is that they do wind up being outcomes that people (unwittingly or not, correctly or not) come to rely on. One abstraction layer's "stylistic" choices are another layer's structural properties.

I guess from the implementation point of view, barring a hard correctness constraint, in the presence of choice (non-uniqueness of solutions), the safe option is to defer to the "most popular," or do what was done in the "best known" prior reference version. I'm sure there are many challenges when the GPU comes into play, but again, speaking here from the user's perspective: A significant portion of people bringing their functionality over to the web, upon seeing a difference in outcomes from what they had before, are more likely to conclude that there's something "wrong" than to really understand that some differences don't actually matter, or know what action to take if some tests start failing. Over time, I guess some of those choices in the implementations have actually come to matter in usage, or to be things that people have familiarized with and come to use like "waypoints."

So, I guess just consider the quirks of Matlab or Numpy guiding the implementation to be similar to the width of the horse's rear end determining the width of the solid rocket boosters. The implementation won't be any more or less correct, but will just fit a little better with what came before.

moob123 · 2019-03-07T17:18:34Z

Hello,
I was trying to follow threads like this to try to figure out what is likely to happen with more general linalg operation support in tensorflowjs. I do not even know who to address this question to, but if anyone here has some time please write some lines on this.
thank you

DirkToewe added 3 commits October 27, 2018 03:16

Cholesky, LU, QR, triangularSolve, setDiag, diagPart, broadcastTo, conj

743c1a0

matMul now supports broadcasting

Merge branch 'master' into linalg_ops

fe896be

Fixed LU pivoting & testing

d29b47a

dsmilkov suggested changes Oct 29, 2018

View reviewed changes

DirkToewe mentioned this pull request Oct 31, 2018

"Low-Level" QR Implementation #1366

Open

DirkToewe mentioned this pull request Jan 10, 2019

Add cholesky and gradients #1492

Open

This was referenced Apr 14, 2019

linalg: Addition of LU decomposition #1675

Closed

Linear Algeba tensorflow/tfjs#1516

Open

DirkToewe mentioned this pull request Oct 19, 2019

support Lgamma and BroadcastTo Ops in order to support probabilistic models (Poisson distribution) tensorflow/tfjs#2011

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cholesky, LU, QR, triangularSolve, setDiag, diagPart, broadcastTo, conj #1356

Cholesky, LU, QR, triangularSolve, setDiag, diagPart, broadcastTo, conj #1356

DirkToewe commented Oct 27, 2018 •

edited by dsmilkov

dsmilkov left a comment

dsmilkov commented Oct 29, 2018 •

edited

DirkToewe commented Oct 29, 2018 •

edited

dsmilkov commented Oct 29, 2018

ds604 commented Jan 28, 2019

DirkToewe commented Jan 28, 2019

ds604 commented Jan 29, 2019

moob123 commented Mar 7, 2019

Cholesky, LU, QR, triangularSolve, setDiag, diagPart, broadcastTo, conj #1356

Are you sure you want to change the base?

Cholesky, LU, QR, triangularSolve, setDiag, diagPart, broadcastTo, conj #1356

Conversation

DirkToewe commented Oct 27, 2018 • edited by dsmilkov

Description

For repository owners only:

dsmilkov left a comment

Choose a reason for hiding this comment

dsmilkov commented Oct 29, 2018 • edited

DirkToewe commented Oct 29, 2018 • edited

dsmilkov commented Oct 29, 2018

ds604 commented Jan 28, 2019

DirkToewe commented Jan 28, 2019

ds604 commented Jan 29, 2019

moob123 commented Mar 7, 2019

DirkToewe commented Oct 27, 2018 •

edited by dsmilkov

dsmilkov commented Oct 29, 2018 •

edited

DirkToewe commented Oct 29, 2018 •

edited