web-archiving
Here are 108 public repositories matching this topic...
PowerShell scripts to use the Internet Archive Wayback Machine
-
Updated
May 23, 2022 - PowerShell
Archive a list of URLs using the Wayback Machine
-
Updated
Feb 21, 2024 - Python
A suite of tools for mirroring and hoarding web pages you visit for later offline viewing. I.e. your own personal Wayback Machine that can also archive HTTP POST requests and responses, as well as most other HTTP-level data, which also follows "archive everything now, figure out what to do with it later" philosophy.
-
Updated
May 20, 2024 - Python
Discord Media Loader - Simply download all attachments
-
Updated
Aug 14, 2018 - C#
A Tumblr Blog Backup Application
-
Updated
Jul 3, 2018 - C#
Piazza course archiver and viewer
-
Updated
Apr 15, 2024 - Python
Michael Kurzmeier, 4th year Phd Digital Humanities @maynooth University
-
Updated
Jan 19, 2021
HTTPreserve Analysis of Million Dollar Web Page
-
Updated
Jun 2, 2021
Command-line program to download videos from YouTube.com and other video sites
-
Updated
Jul 4, 2018 - Python
Python Implementation for iipc/webarchive-commons
-
Updated
Sep 27, 2023 - Python
Crawls websites and saves found URLs to a file.
-
Updated
Feb 5, 2024 - JavaScript
Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.
-
Updated
May 15, 2024 - Java
Core Python Web Archiving Toolkit for replay and recording of web archives
-
Updated
Apr 5, 2024 - JavaScript
DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by ArchiveBox.io under the hood.
-
Updated
Feb 2, 2024 - HTML
This system evaluates a series of mementos (archived web pages) to determine which are off topic. The series can be part of an Archive-It collection, a single TimeMap, or stored in a WARC file.
-
Updated
Nov 7, 2017 - Python
Improve this page
Add a description, image, and links to the web-archiving topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the web-archiving topic, visit your repo's landing page and select "manage topics."