Skip to content

A spark script for processing (large-scale) file system snapshot data.

Notifications You must be signed in to change notification settings

sandrain/spider2-snapshot-anon

Repository files navigation

spider2-snapshot-anon

Pyspark code for anonymizing the spider 2 snapshot files. This was used for obfusticating the snapshot data, which was used for the following study:

This software ran on the Andes cluster at OLCF with the magpie framework.

The anonymized snaphost will be available in public.

About

A spark script for processing (large-scale) file system snapshot data.

Topics

Resources

Stars

Watchers

Forks