🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
-
Updated
Jun 10, 2024 - Python
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
A suite of tools for mirroring and hoarding web pages you visit for later offline viewing. I.e. your own personal Wayback Machine that can also archive HTTP POST requests and responses, as well as most other HTTP-level data, which also follows "archive everything now, figure out what to do with it later" philosophy.
FeedVault is an open-source web application that allows users to archive and search their favorite web feeds.
Official Python package for ArchiveBox, the self-hosted internet archiving solution.
Home of the official apt/deb package for Ubuntu/Debian-based systems.
😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, BOINC, and more...
Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.
Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.
Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.
Wayback Machine API interface & a command-line tool
Home of the official docker image for ArchiveBox
Homebrew formula for the ArchiveBox self-hosted internet archiving solution.
DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by ArchiveBox.io under the hood.
Official ArchiveBox MITM proxy: saves URLs of all requests passing through to an ArchiveBox server for archival.
Navigator for Web Archive
Download and archive RSS feeds to Wayback Machine. Save a list of archived feed in locad db.
Scrape posts, threads from forums, news aggregators, mail archives, export to JSONL, mailbox, WARC
upload stuff to the Internet Archive using a shell script
Desktop Electron app for ArchiveBox internet archiver. (ALPHA: not ready for general use)
Pick a date and explore websites from the early days of the internet to now all in an easy-to-use browser format! 💻
Add a description, image, and links to the internet-archiving topic page so that developers can more easily learn about it.
To associate your repository with the internet-archiving topic, visit your repo's landing page and select "manage topics."