Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Collaborative IPFS Cluster Tooling For Data Replication & Downloader Bug Fix #286

Closed
wants to merge 18 commits into from

Conversation

bonedaddy
Copy link
Collaborator

@bonedaddy bonedaddy commented Jun 5, 2020

Overview

Follow up to #222, and enables closing #265 and #265 by creating a public "IPFS Follow Cluster" that anyone can participate in and replicate the set of media linked to by the incident reports in data_build. Additionally this includes a bug fix which prevented the video downloader from working at all. Closes #306. Addtionally a new feature was added that avoids re-downloading any media previously downloaded

It provides two methods of operation:

  • Trusted peer setup (allowing anyone to create their own cluster as a trusted peer)
  • Follow peer setup (allowing anyone to participate in a public cluster as a follow peer which mirrors data)

Preliminary public cluster is online that can be joined with:

$> ipfs-cluster-follow 2020pb-dataset run --init 2020pb.temporal.cloud

A corresponding wiki page has been created

TODO

  • Spin up a public cluster cluster
  • Update configurations (copy & pasted them from a different follow cluster i run)
  • Finish readme
  • Finish webhook server to auto-updated the cluster when new data is added (i'll do this in a different PR, if you want to follow development of that see here

Warning

As part of this PR I'll spin up a public cluster that enables anyone to start mirroring the data. If you value your anonymity and privacy, do not participate in any public cluster belonging to this dataset. IPFS is supremely self-doxxing, and if you participate in the cluster anyone interested in finding out who you are will be able to do so. Even if you come from behind something like Tor and a VPN, IPFS can still dox you.

@bonedaddy bonedaddy changed the title Add Video/Media Data Replicator (WIP - Do Not Merge) Add Video/Media Data Replicator Jun 5, 2020
@bonedaddy bonedaddy marked this pull request as ready for review June 5, 2020 22:55
@bonedaddy bonedaddy changed the title Add Video/Media Data Replicator Add Data Replicator Tooling & Public Collaborative IPFS Cluster To Mirror Data On Jun 5, 2020
@bonedaddy bonedaddy changed the title Add Data Replicator Tooling & Public Collaborative IPFS Cluster To Mirror Data On Add Collaborative IPFS Cluster Tooling For Data Replication Jun 5, 2020
@bonedaddy bonedaddy marked this pull request as draft June 5, 2020 23:32
@bonedaddy bonedaddy marked this pull request as ready for review June 5, 2020 23:36
@bonedaddy bonedaddy changed the title Add Collaborative IPFS Cluster Tooling For Data Replication Add Collaborative IPFS Cluster Tooling For Data Replication & Downloader Bug Fix Jun 5, 2020
@ubershmekel ubershmekel added the Engineering Changes our tools and data pipeline label Jun 6, 2020
@bonedaddy bonedaddy marked this pull request as draft June 7, 2020 04:14
@bonedaddy
Copy link
Collaborator Author

I'm putting this back into draft mode as I need to update the downloader to use pb-id

@ubershmekel
Copy link
Collaborator

Closing this PR. I would recommend moving this tool to its own repo. We could even have that repo under the 2020pb org if you like @bonedaddy

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Engineering Changes our tools and data pipeline
Projects
None yet
Development

Successfully merging this pull request may close these issues.

BUG: Downloader Tool Leaves Data Fragments On Disk
2 participants