CoDeBetHe.jl

Community detection with the Bethe Hessian

This package is an afficient implementation in Julia language of the algorithms for spectral community detection introduced in:

Dall'Amico, Couillet and Tremblay - Revisiting the Bethe-Hessian: improved community detection in sparse heterogeneous graphs (NeurIPS 2019)
Dall'Amico, Couillet and Tremblay - A unified framework for spectral clustering in sparse graphs (JMLR)
Dall'Amico, Couillet and Tremblay - Community detection in sparse time-evolving graphs with a dynamical Bethe-Hessian (NeurIPS2020)

If you make use CoDeBetHe please consider to cite the above references.

@inproceedings{dall2019revisiting,
  title={Revisiting the Bethe-Hessian: improved community detection in sparse heterogeneous graphs},
  author={Dall'Amico, Lorenzo and Couillet, Romain and Tremblay, Nicolas},
  booktitle={Advances in Neural Information Processing Systems},
  pages={4039--4049},
  year={2019}}

@article{JMLR:v22:20-261,
  author  = {Lorenzo Dall'Amico and Romain Couillet and Nicolas Tremblay},
  title   = {A Unified Framework for Spectral Clustering in Sparse Graphs},
  journal = {Journal of Machine Learning Research},
  year    = {2021},
  volume  = {22},
  number  = {217},
  pages   = {1-56},
  url     = {http://jmlr.org/papers/v22/20-261.html}}

@article{dall2020community,
  title={Community detection in sparse time-evolving graphs with a dynamical Bethe-Hessian},
  author={Dall'Amico, Lorenzo and Couillet, Romain and Tremblay, Nicolas},
  journal={Advances in Neural Information Processing Systems},
  volume={33},
  year={2020}}

Update

In the folder python we added two files that allow you to easily use our algorithm in Python as well. They rely on PyJulia and require that Julia is installed on your computer. Recall that the first time that you use the function, it will be particularly slow, unlike typical Python functions. The main function is called in the following way

Use: ℓ, k, modularity, ζ = CD_BH(A)

Inputs: 
    * A (scipy sparse array): Adjacency matrix of the input graph

Optional inputs:
    * k (int): number of communities. If not specified (default), it will be estimated
    * verbose (int): sets the level of verbosity of the algorithm

Outputs:
    *ℓ (array): estimated label partition
    * k (int): number of communties
    * modularity (float): modularity of the partition
    * ζ (array): optimal zeta values used in the algorithm

Beyond the implementation of the algorithms for community reconstruction, the package contains functions to generate synthetic graphs according to the static and dynamic degree corrected stochastic block model, as well as some real datasets.

Content of the package

The directory src contains the file CoDeBetHe.jl with the source codes
The folder datasets containsdataset.zip (don't forget to unzip the datasets.zip folder before running the demos) with some reals datasets on which aour algortihms can be trun and dataset_reference.txt with the references of the corresponding datasts. If you are using these datasets for research purpose, please consider to cite the authors of the corresponding dataset.

Getting Started

These are the basic instructions to use CoDeBetHe on you computer

Installing

You can install this toolbox by either typing (in the pkg manager) '''add https://github.com/lorenzodallamico/CoDeBetHe'' or cloning the repo locally and typing (in the pkg manager) '''add CoDeBetHe'''
Don't forget to unzip the real-world graph data in the folder demonstrating the algorithm on real data experiments

Required packages

CoDeBetHe requires the following packages

Distributions, LinearAlgebra, DataFrames, StatsBase, IterativeSolvers, Clustering, SparseArrays, KrylovKit, LightGraphs, DelimitedFiles, ParallelKMeans

Usage

To get instructions on how to use the package CoDeBetHe, please refer to the documentation page. There you can find some scripts to easily use the main functions to generate synthetic static and dynamic graphs with communities, to load the datasets inside the datasets folder and run the community detection algorithms.

Authors

Lorenzo Dall'Amico Nicolas Tremblay

License

This software is released under the GNU AFFERO GENERAL PUBLIC LICENSE (see included file LICENSE)

Name		Name	Last commit message	Last commit date
Latest commit History 94 Commits
datasets		datasets
docs		docs
python		python
src		src
LICENSE		LICENSE
Manifest.toml		Manifest.toml
Project.toml		Project.toml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

datasets

datasets

docs

docs

python

python

src

src

LICENSE

LICENSE

Manifest.toml

Manifest.toml

Project.toml

Project.toml

README.md

README.md

Repository files navigation

CoDeBetHe.jl

Community detection with the Bethe Hessian

Update

Content of the package

Getting Started

Installing

Required packages

Usage

Authors

License

About

Releases

Packages

Contributors 2

Languages

License

lorenzodallamico/CoDeBetHe.jl

Folders and files

Latest commit

History

Repository files navigation

CoDeBetHe.jl

Community detection with the Bethe Hessian

Update

Content of the package

Getting Started

Installing

Required packages

Usage

Authors

License

About

Resources

License

Stars

Watchers

Forks

Languages