Skip to content

Cascading calculations across a directed graph #97

Closed Answered by cwensel
sancarn asked this question in Q&A
Discussion options

You must be logged in to vote

FWIW, Cascading isn't a graph API, but a data flow API, that is frequently used to perform complex work on an Apache Hadoop cluster for very large datasets.

The corpus of data could be large, or an individual unit of work could be large, so that either doesn't fit into memory on a single machine.

In that case, you could chain graph operations together into a flow by partitioning the work as streams.

but I suspect you could perform each operation in memory, and if so, and using Java, i'd look at using JGraphT as a basis for the graph.

this may or may not be helpful, but it isn't immediately obvious to me Cascading would be helpful for 'cascading' events through a graph.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@sancarn
Comment options

Answer selected by sancarn
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants