Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

All snapshots should have a root index listing all langs + snapshots #3

Open
jbenet opened this issue May 1, 2017 · 4 comments
Open

Comments

@jbenet
Copy link
Member

jbenet commented May 1, 2017

we should have a root "wikipedia on ipfs" index directory with:

  • all languages
  • all snapshots on ipfs (by date)
  • any metadata we want
  • (like dist.ipfs.io archive kinda)
@flyingzumwalt
Copy link
Contributor

starting to gather the info for this root page at https://github.com/ipfs/distributed-wikipedia-mirror/blob/master/snapshot-hashes.md

@flyingzumwalt flyingzumwalt changed the title root index of all langs + snapshots All snapshots should have a root index listing all langs + snapshots May 12, 2017
@alzinging
Copy link

This YAML file shows last updates in 2021, we really need them updated to current snapshots and maintain a log of all the changed Qm Merkledags--as you suggested 6 years ago.

The YML was here when I got here a few months ago, but has been recently updated to be more easily visible.

I also think we should run Ads on the whole thing to offset costs of storage and development of a Wiki on IPFS/Web3 framework that will interoperate with live Mediawiki/Wikipedia editing process, as they suggest in their "live mirror discussions." We could create a sort of conduit that will force upgrades of the Mediawiki software process to enable tiered administration of the editing and approval process.

QUARRY on mwcloud.org looks interesting, though it seems to be "alpha"

Can someone upload the "xml.tar.bz2" files from https://en.m.wikipedia.org/wiki/Wikipedia:Database_download

TL;DR: GET THE MULTISTREAM VERSION! (and the corresponding index file, pages-articles-multistream-index.txt.bz2) in all languages available. Wikisource.org also appears new.

I am interested in developing a framework for Browser based rich text editing of Wiki pages, I think it might "spinoff" into an active community of Web3 open source blogger.com/medium.com writers and readers. The GO IPFS implementation Kubo appears to be in active development making "good strides" towards where we need to be to feel "multi cloud stable" across S3/Azure/Google with HA-sharding and FIL/Dolt

https://docs.dolthub.com/introduction/what-is-dolt

They also have some "fresh good ideas" implemented and an active community of users. They are using a libp2p implementation of "swarm" that also appears to be in active hero status. I can't recall the name, something-gun.

I am willing to work "Gratis" as in I will even support others development with modest funding. Please contact me if you are capable and willing.

9546678083
getme.s.lamc.la

@kelson42
Copy link

kelson42 commented Dec 15, 2022

@jbenet This is not what https://ipfs.kiwix.org does?

@alzinging
Copy link

That site is using and references this project. Kiwix complicates the XML backup to "readable HTML"

It's dated 9/1/2021.

... though I wasn't going to backup the Wikipedia "image media" that adds another step. I am not familiar with Githubs automation, but I'm sure it works and will make this worthwhile.

I have the Wikimedia "import to MySQL/HTML" process ... "understood" ... it won't take me long.

I need help making a JavaScript client side wiki ... interface with IPFS. They are working on something like "git" level everything but editing in Kubo, I will probably use the IPFS JavaScript implementation instead, though, I imagine there is a client side git implementation also. I'll look.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants