Releases: gunrock/gunrock
Gunrock 2.0.0
gunrock/essentials
is a continuation of our original work gunrock/gunrock
. To name some of the changes: essentials aim to add better support for graph and frontier data structures, new operators, new load-balancing algorithms, new graph algorithms, support for benchmarking kernels, a queue-based asynchronous programming model, ease of programmability, proper Doxygen and wiki documentation for API and programming models, and more complete GitHub actions enabled compilation testing. We also introduce experiments with multi-GPU backend and gunrock/essentials-cpp
, which takes Gunrock's programming model to modern-C++ parallelism.
The quality of the release documentation may vary as it is autogenerated based on the Pull Requests (PR) to the repository. If something is unclear, please refer to the corresponding PR.
What's Changed
🎉 Algorithms and Operators
- Merge API changes to master by @bkj in gunrock/essentials#10
- Reworking the graph_t and graph::build interface. by @neoblizz in gunrock/essentials#14
- Hotfix: update color implementation by @bkj in gunrock/essentials#16
- Add SSSP CPU implementation by @bkj in gunrock/essentials#18
- Implementing PageRank by @bkj in gunrock/essentials#40
- Documentations and more... by @neoblizz in gunrock/essentials#48
- BFS CPU reference implementation by @bkj in gunrock/essentials#54
- [APP] Betweenness Centrality (V3) by @bkj in gunrock/essentials#60
- Implementing COO graph functions by @bkj in gunrock/essentials#62
- Implementating CSC graph functions by @bkj in gunrock/essentials#64
- Local Graph Clustering & Batch Operator by @neoblizz in gunrock/essentials#66
- Improve sssp_cpu performance by @bkj in gunrock/essentials#72
- [APP] Parallel PR-Nibble by @bkj in gunrock/essentials#73
- Feature merge (dev to master) by @neoblizz in gunrock/essentials#75
- Vertex k-Core Decomposition App by @angeil in gunrock/essentials#77
- Asynchronous BFS Interface & Implementation by @bkj in gunrock/essentials#78
- HITS ported from gunrock/gunrock by @li-yi-dong in gunrock/essentials#83
- SpMV using Gunrock by @DanLoran in gunrock/essentials#86
- Improvements, Clean-up, Experimental Async Support by @neoblizz in gunrock/essentials#87
- Bring changes over, so I can improve kernels. by @neoblizz in gunrock/essentials#103
- Unit testing overhaul using Googletest, Stride, blocked standard kernels by @neoblizz in gunrock/essentials#100
- Command line parameters support for essentials using cxxopts. by @neoblizz in gunrock/essentials#107
- Merge all the progress to master branch. by @neoblizz in gunrock/essentials#109
- Advance Improvements, Frontier resizing, and more... by @neoblizz in gunrock/essentials#110
- Advance Improvements, Frontier resizing, and more... by @neoblizz in gunrock/essentials#118
- Merging changes to
master
(lots of enhancements, see commits for detail) by @neoblizz in gunrock/essentials#119 - Minimum Spanning Tree (MST) algorithm. by @annielytical in gunrock/essentials#127
- Add my changes from SPMV experiments by @jdwapman in gunrock/essentials#134
- Doxygen, CMake/Make improvements, MST algorithm, bug fixes and more! by @neoblizz in gunrock/essentials#135
- Add Triangle Counting algorithm by @maawad in gunrock/essentials#140
- Add TC benchmarking by @maawad in gunrock/essentials#141
- Add Triangle Counting CPU reference code by @maawad in gunrock/essentials#144
- Benchmarking, Triangle Counting, Intersection. by @neoblizz in gunrock/essentials#145
- Fixes for Graph Coloring and minor edits. by @neoblizz in gunrock/essentials#149
🐛 Bug Fixes
- [BUGFIX] bug in csr.get_source_vertex by @bkj in gunrock/essentials#53
- Bug-fixes, minor enhancements and clean-up with unit tests. by @neoblizz in gunrock/essentials#57
- [BUGFIX] Fix
offsets_to_indices
by @bkj in gunrock/essentials#61 - [BUGFIX] Order of values passed to
csc.set
by @bkj in gunrock/essentials#63 - Fix virtual_memory unittest by @porumbes in gunrock/essentials#85
- Use
std::size_t
in graph loader by @maawad in gunrock/essentials#132 - Fix CUDA namespace collision, file support by @jdwapman in gunrock/essentials#138
- Graph Coloring bugfix: Tiebreak condition by @neoblizz in gunrock/essentials#148
🏡 API Changes/Improvements
- Move default fetch dir to _cmake_fetch. by @li-yi-dong in gunrock/essentials#74
- Refactored & fixed. by @neoblizz in gunrock/essentials#81
- Windows support. by @neoblizz in gunrock/essentials#96
- Windows support and working windows workflow. by @neoblizz in gunrock/essentials#97
- Benchmarking support, cmake version++, and sample csr() by @neoblizz in gunrock/essentials#99
📝 Documentation Updates
- Bug fixes, README updates, documentation, and more... by @neoblizz in gunrock/essentials#108
- Updated documentation. by @neoblizz in gunrock/essentials#136
🧪 Testing
- Color CPU reference + correctness checking by @bkj in gunrock/essentials#67
- CPU Reference for coloring. by @neoblizz in gunrock/essentials#68
- Enabling Actions (GitHub) for dev branch. by @neoblizz in gunrock/essentials#94
- Add NVBench Algorithm Benchmarking by @annielytical in gunrock/essentials#139
New Contributors
- @bkj made their first contribution in gunrock/essentials#3
- @cameronshinn made their first contribution in gunrock/essentials#5
- @neoblizz made their first contribution in gunrock/essentials#14
- @porumbes made their first contribution in gunrock/essentials#22
- @li-yi-dong made their first contribution in gunrock/essentials#74
- @angeil made their first contribution in gunrock/essentials#77
- @DanLoran made their first contribution in gunrock/essentials#86
- @annielytical made their first contribution in gunrock/essentials#127
- @maawad made their first contribution in gunrock/essentials#132
- @jdwapman made their first contribution in gunrock/essentials#134
Full Changelog: https://github.com/gunrock/essentials/commits/0.0.1
Gunrock 1.2.0
Gunrock v1.2.0 is a minor feature and bug fix release. We have added several new example applications, fixed many outstanding memory issues, and eliminated some JSON output bugs. Along the way we solved many little issues and annoyances we faced. Thanks to everyone who has contributed, and a special thanks to any new contributors since version 1.1 (marked in bold).
Contributors
Achal Agarwal
Agnieszka Lupinska
Charles Rozhon
Hugo Braun
Jonathan Wapman
Leyuan Wang
Mahmood Naderan - #641
Muhammad Osama
Nima Johari
Serban Porumbescu
Yangzihao Wang
v1.2.0 Changelog
Apps
- K-nearest-neighbor application implemented by agalup
- HITS application implemented by jdwapman
- Label propagation application implemented by achalagarwal
Fixes
- SetDevice after spawning OpenMP threads (#757)
- Release with multiple command switches (#767)
- Specify --quick and --validation at the same time (#773)
- Subgraph not properly released in single-gpu mode (#746)
- Compile issues with certain SM configurations (d57b932)
- Compile issue linking with C compiler instead of C++ (804ea77)
- JSON writer improvements (8819f39)
- Update moderngpu to latest (7c14fa2)
- Modify device intrinsics to compile on all arches (6b05704)
Gunrock 1.1.0
Minor feature and clean-up release.
v1.1.0 Changelog
- Subgraph Matching (SM) fixes
- SM _app .cu and .cuh updates, following proper coding style
- GetSrcDestEdge; given v and u, get the E(v, u).
- Removing grmat app (uses older API)
- Adding grmat implementation to rmat generator (supports v0.5.1 functionality of grmat)
- Streamlining performance testing (#576) and other JSON schema fixes
- Fixed pagerank support if boost is not found
- Added binary search google test
Known Issues:
Gunrock 1.0.0
(documentation: https://gunrock.github.io/docs/)
Release v1.0 is a major API refactor release, along with some feature updates and new applications/primitives. An important note from this release is that moving forward, v1.x will be incompatible with v0.5.x Gunrock, some applications are still pending to be ported over to the new and improved API.
- New operator interfaces
- New graph representations
- New frontier structure
- New test driver
- Restructured enactor routines
- New parameter handling
- New operators
- Code restructuring
- Optional Boost dependency
v1.0 Changelog
Operators (i.e. advance, filter)
- Take in lambda functions for per-element operations, instead of static
functions in a structure.<algo>_functor.cuh
is merged into
<algo>_enactor.cuh
- Use
OprtrParameters
structure to keep inputs, except for the graph,
input / output frontiers, and the lambdas KernelPolicy
is defined within each operator, instead of in the enactor- Templatized options (Idempotence, mark-preds, advance types, reduce ops,
reduce types, etc.) are provided as a combined 32bitOprtrFlag
- Queue index and selector are automatically changed by the operator when
needed
Graph representation
- A single structure encloses all graph related data
- Different representations (CSR, CSC, COO, etc.) can be selected based
on algorithmic needs - New graph representations could be added without changing other parts of
Gunrock, except operator implementation that handles how to traverse such
new representation - CPU, GPU and sub-graphs use the same graph data structure, no more
GraphSlice
andGRGraph
Frontier
- A single structure
gunrock/app/frontier.cuh:Frontier
encloses all frontier related data
Test driver
- Allows multiple graph types (
64bit-VertexT
,64bit-SizeT
,
64bit-ValueT
, directed vs. undirected) and multiple parameters
combinations to run in a single execution - Allows result validation for each run, instead of only the last run
- Result validation without reference for BFS and SSSP
- Moved common functions into
gunrock/app/test_base.cuh
- Moved CPU reference code and result validation into
gunrock/app/<algo>/<algo>_test.cuh
Enactor
- Common functions moved into
gunrock/app/enactor_base.cuh
- Use OpenMP to maintain controlling threads on CPU
- Use instances of
Iteration
instead of static access to its functions
Command line parameters
- A dedicated
Parameters
struct to store all running parameters - Need to define parameters via.
Use
function before using them - Command line is parsed by
get_opt
Set
to set parameter valuesGet
to get parameter values- Handles vectors as parameter values
1D operators for Array1D
- Per-element operations, e.g.
ForAll
andForEach
- Vector-Vector operations, e.g.
Add
,Minus
,Mul
,Div
,Mad
,Set
- Vector-Scalar operations
- Sort
Code restructuring
- Partitioners moved from
gunrock/app
togunrock/partitioner
LB
operator moved fromgunrock/oprtr/edge_map_partitioned_forward
to
gunrock/oprtr/LB_advance
TWC
operator moved from 'gunrock/oprtr/edge_map_forward' to
gunrock/oprtr/TWC_advance
Optional Boost dependency
- Utility functions changed to C++11 or implemented
- CPU references implemented for BFS and SSSP, and will be called when BOOST
is not available info
will use RapidJson-based implementation, when Boost is not available
Known Issues
- Multi-GPU framework not fully tested,
- Operators have decreased performance, due to more than 32 registers used by
a single thread in the kernels, - RGG and GRMAT generators not ported yet,
- SSSP may have incorrect predecessors, due to data racing in marking the
predecessors within the operator kernels
Gunrock 0.5.1
Minor bug-fixes release.
v0.5.1 Changelog
- Added Ctest for TC and SALSA applications
- Fixed illegal memory access bug in WTF
- Removed test directory
- Removed duplicated templates
- Cleaned up sample application code
- Added support for CUDA 9+, updated moderngpu dependency
Gunrock 0.5.0
Gunrock release 0.5 is a feature (minor) release that adds:
- New primitives and better support for existing primitives.
- New operator: Intersection.
- Unit-testing support through Googletest infrastructure.
- CPU reference code for correctness checking for some of the primitives.
- Support for central integration (Jenkins) and code-coverage.
- Overall bug fixes and support for new CUDA architectures.
v0.5 Changelog
All notable changes to gunrock for v0.5 are documented below:
Added
- New primitives:
- A*
- Weighted Label Propagation (LP)
- Minimum Spanning Tree (MST)
- Random Walk (RW)
- Triangle Counting (TC)
- Operator:
- Intersection operator (for example, see TC)
- Unit-testing:
- Googletest support (see
unittests
directory)
- Googletest support (see
- Docs
- Support using Slate (see https://github.com/gunrock/docs)
- CPU reference code
- Run scripts for all primitives
- Clang-format based on Google style
- see commit aac9add (revert for diff)
- Support for Volta and Turing architectures
- Regression tests to
ctest
for better code-coverage - Memset kernels
- Multi-gpu testing through Jenkins
Removed
- Subgraph matching and join operator removed due to race conditions (SM is now added to the future release)
- Plots generation python scripts removed (moved to https://github.com/gunrock/io)
- MaxFlow primitive removed, wasn't fully implemented for a release (implementation exists in the new API for future release)
- Outdated documentation
Fixed
- HITS now produces correct results
- Illegal memory access fixed for label propagation (LP) primitive
- WTF Illegal memory access fixed for frontier queue (see known issues below)
- Other minor bug fixes
Changed
- Updated README and other docs
- Moved previously
tests
directory toexamples
- Doesn't require
CMakeLists.txt
(orcmake
) to runmake
- Moved all docs to Slate
Known Issues:
Gunrock 0.4.0
Gunrock release 0.4 is a feature release that adds
- New optimizations to both advance and filter operators
- Multi-iteration support for BFS, SSSP, BC, CC and PR
- Better error handling
- Updates on several interfaces
- Overall performance improvement for both single and multi-GPU execution
v0.4 ChangeLog
- Integrated direction-optimizing BFS with normal BFS. Now for BFS
there is only one executable, named bfs. The direction-optimizing
switch is enabled by the command-line option
--direction-optimized
. - Added three new strategies for advance (triggered by setting
ADVANCE_MODE
accordingly):ALL_EDGES
, optimized for advance on all edges with all vertices
of the graph. WithALL_EDGES
, there is no need to use sorted
search for load balancing, just binary search over the whole row
offsets array; used in CC.LB_CULL
, fused LB advance with a subsequent CULL filter; used in
BFS, SSSP and BC.LB_LIGHT_CULL
, fusedLB_LIGHT
advance with a subsequent CULL
filter; used in BFS, SSSP and BC.
- Added three new strategies of filter (triggered by setting
FILTER_MODE
accordingly):COMPACTED_CULL
, optimized on several culling heuristicsSIMPLIFIED
, another implementation of the CULL filter, without
some optimizationsBY_PASS
, optimized for a filter with no elements
to remove from the input frontier; used in CC and PR.
- Added multi-iteration support for BFS, SSSP, BC, CC and PR. Users
can set the number of iterations to run and specify the source node
for each run (if necessary) viaInitSetup()
defined in gunrock.h.
v0.4 Known Issues
- HITS and SALSA do not have CPU reference yet
- HITS, SALSA, and who-to-Follow do not have multi-GPU support yet
- An out-of-memory error (for graphs that approach the memory limit
of GPUs) will cause result validation to fail
Gunrock 0.3.1
A release version for PPoPP 2016 artifact evaluation.
Gunrock 0.3.0
Gunrock release 0.3 is a feature release that adds two new graph primitives—Stochastic Approach for Link-Structure Analysis (SALSA) and Minimal Spanning Tree (MST)—and improves several existing primitives. The new release uses a unified framework for both single-GPU and single-node multi-GPUs. Five graph primitives (BFS, CC, PR, BC, and SSSP) can be launched on multi-GPUs now by adding the command-line option--device=GPU_index_1,GPU_index_2,...,GPU_index_n. A simple pure C interface allow users to easily integrate Gunrock into their own work. A stats-logging and performance-chart-generating pipeline is prototyped in this new release. All our future releases will require C++11 support.
Gunrock 0.2.0
A new Gunrock release which includes a whole re-factoring of v0.1 with two traversal modes, two brand-new operators (advance and filter) and several new graph primitives. This release is the reference version which will reproduce our experiment results for PPoPP `15.