Speed and scalability improvements for graph multiresolution #3

EiffL · 2017-06-04T19:14:05Z

The motivation for these modification is to improve the scalability of the code to be able to handle much larger graphs.

With the baseline code, I couldn't compute graph pyramids for graphs with about 40,000 nodes, the peak memory consumption went above 500GB ! Turns out this was due to 2 main problems:

In the kron_reduction function, the computation of the Schur complement was actually turning back the sparse input matrices into dense matrices. Plus, the computation of the Schur complement can be sped up by using a linear equation solver optimized to handle SDD matrices (symmetric diagonally dominant), which is the case of graph laplacians. There are even more efficient algorithms out there (starting with Cholesky decomposition) such as Koutis et al. (2011) arXiv:1102.4842 but I didn't find an out of the box python implementation and SuperLU seemed to do the job for me. I ended up rewriting bits of kron_reduction to ensure that the sparse matrices were not turned into dense matrices when not necessary, and I added pygsp.utils.splu_inv_dot as an optimized alternative to the use of scipy.sparse.spsolve .
In the graph_sparsify function, the computation of the resistance distances was done using blunt matrix inversion in pygsp.utils.resistance_distance which tries to invert it a potentially very large Laplacian matrix. After a kron reduction, this laplacian can have a lot of non zero elements, that's what was causing the worse of the memory consumption in my case. However, it turns out that the whole point of the Spielman-Srivastava spectral sparsification algorithm is to avoid having to compute this inverse, the algorithm only requires approximate distances and provides a procedure to compute them. I implemented pygsp.utils.approx_resistance_distance to compute these distances based on the algorithm described in arxiv:0803.0929 . It still requires a way of solving a linear inverse system, where I used again my customized pygsp.utils.splu_inv_dot but it should be noted that there are much faster ways of doing this. Including the near linear time algorithm described in arXiv:1209.5821v3

With these modifications, I can now compute graph multiresolutions in practice for graphs with about 50,000 nodes and 1e6 edges without running into memory issues on a standard desktop. The slowest point in the process is the sampling time for the edges in graph_sparsify, scipy.stats.rv_discrete takes about 1s to sample 10,000 deviates, in my case the algorithm needed 16e6 which takes about 30min just to sample random numbers whereas the rest of the algorithm only takes minutes, so it's really stupid, but at least it eventually works.

I tried to comment and document these modifications in the hope that they might be useful to others, everything seems to work for me but someone should take a close look to check that it doesn't break anything.

rodrigo-pena · 2017-06-15T13:05:50Z

Hi, thanks for the contribution. Could you check and correct the errors that were raised in the TravisCI checks? They seem to be mostly errors in the code examples in the documentation, raising errors such as "NameError: name 'sparse' is not defined", "NameError: name 'extract_submatrix' is not defined", or "NameError: name 'block' is not defined".

rodrigo-pena · 2017-06-15T20:11:25Z

There's still a silly error in the doctest (it's one that I often forget is a problem):

pygsp/pygsp/utils.py", line 261, in utils.py
Expected:
(8, 8)
"""
M = M.tocoo()
Got:
(8, 8)

There should be a newline after (8,8) in the documentation

EiffL · 2017-06-15T21:08:57Z

My bad, it should be fine now.

rodrigo-pena

Hi, could you check my comments on your pull request?

rodrigo-pena · 2017-06-16T08:11:29Z

pygsp/graphs/graph.py

+        if self.directed or not self.connected:
+            raise NotImplementedError('Focusing on connected non directed graphs first.')
+
+        start_nodes, end_nodes, weights = sparse.find(sparse.tril(self.W))


Do you know if this is faster/slower/same as what's done in adj2vec(), lines 23-24 in pygsp/data_handling.py? Indeed, I rather prefer your design, than the use of adj2vec we're making now for computing graph gradients. I might make the necessary adaptions to replace it by this call of your create_incidence_matrix soon.

rodrigo-pena · 2017-06-16T08:14:32Z

pygsp/operators/reduction.py

@@ -12,17 +12,26 @@
 logger = build_logger(__name__)


-def graph_sparsify(M, epsilon, maxiter=10):
+def graph_sparsify(M, epsilon, maxiter=10, fast=True):


I would set fast=False by default, so that the previous default behavior of this function remains unchanged. What do you think?

rodrigo-pena · 2017-06-16T08:18:02Z

pygsp/operators/reduction.py

@@ -295,12 +300,22 @@ def kron_reduction(G, ind):
        Graph structure or weight matrix
    ind : list
        indices of the nodes to keep
+    threshold: float
+        Threshold applied to the reduced Laplacian matrix to remove numerical
+        noise. (default: marchine precision)


typo: machine

rodrigo-pena · 2017-06-16T08:23:16Z

pygsp/operators/reduction.py

+    Lnew.eliminate_zeros()
+
+    # Enforces symmetric Laplacian
+    Lnew = (Lnew + Lnew.T) / 2.


Do we always want the Laplacian to be symmetric here? In the previous implementation, this line was called only under the conditional statement.

mdeff · 2017-08-10T13:42:25Z

Thanks for the contribution. :) Could you look at @rodrigo-pena's comments so that we can merge it ?

coveralls · 2018-04-20T01:21:59Z

Coverage increased (+1.7%) to 81.844% when pulling bf2183e on EiffL:faster into ef8823a on epfl-lts2:master.

EiffL added 3 commits June 3, 2017 23:21

Added approximate resistance distance and faster kron reduction

4502ee1

Added documentation

df029a2

Added to documentation

5709213

Fixed example for doctest

3234d66

Fixed example for doctest

df13afe

rodrigo-pena reviewed Jun 16, 2017

View reviewed changes

mdeff force-pushed the master branch 3 times, most recently from 8321f45 to 4af195d Compare August 21, 2017 07:16

mdeff force-pushed the master branch from 53caa9d to 659d309 Compare August 22, 2017 14:05

mdeff force-pushed the master branch from d018c57 to 6e6c820 Compare September 1, 2017 13:34

EiffL added 4 commits April 19, 2018 18:12

Fixes bug preventing multiresolution

548c357

fix for mr

9ecacf0

Merge branch 'master' into faster

c1d75ba

Fixes update issues

bf2183e

mdeff force-pushed the master branch from 8e559ee to 361f67e Compare June 18, 2018 11:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed and scalability improvements for graph multiresolution #3

Speed and scalability improvements for graph multiresolution #3

EiffL commented Jun 4, 2017

rodrigo-pena commented Jun 15, 2017

rodrigo-pena commented Jun 15, 2017 •

edited

EiffL commented Jun 15, 2017

rodrigo-pena left a comment

rodrigo-pena Jun 16, 2017

rodrigo-pena Jun 16, 2017

rodrigo-pena Jun 16, 2017

rodrigo-pena Jun 16, 2017

mdeff commented Aug 10, 2017

coveralls commented Apr 20, 2018

Speed and scalability improvements for graph multiresolution #3

Are you sure you want to change the base?

Speed and scalability improvements for graph multiresolution #3

Conversation

EiffL commented Jun 4, 2017

rodrigo-pena commented Jun 15, 2017

rodrigo-pena commented Jun 15, 2017 • edited

EiffL commented Jun 15, 2017

rodrigo-pena left a comment

Choose a reason for hiding this comment

rodrigo-pena Jun 16, 2017

Choose a reason for hiding this comment

rodrigo-pena Jun 16, 2017

Choose a reason for hiding this comment

rodrigo-pena Jun 16, 2017

Choose a reason for hiding this comment

rodrigo-pena Jun 16, 2017

Choose a reason for hiding this comment

mdeff commented Aug 10, 2017

coveralls commented Apr 20, 2018

rodrigo-pena commented Jun 15, 2017 •

edited