宁波凯思奥教育科技有限公司
-
Updated
Apr 19, 2019 - HTML
宁波凯思奥教育科技有限公司
Data for testing the Offtopic detection software
Send records from an EPrints server to the Internet Archive and other web archives
minimalistic crawler
ArchiveSpark DataSpec to analyze the Internet Archive's Web archive through temporal search results returned by Tempas (v2)
Parse CDXJ(https://github.com/oduwsdl/ORS/wiki/CDXJ) files with node.js
A collection of the scripts and notebooks I wrote as part of my Data Science Bootcamp capstone project
Process web archives (WARC format) with StormCrawler and index content into Elasticsearch or Solr
Python scripts to generate static navigation pages from collection list and insert Web Archives records using the Archive-It CDX
This module builds our Waybacks in the various different configurations we require.
A service that provides archive-aware oEmbed-compatible embeddable surrogates (social cards, thumbnails, etc.) for archived web pages (mementos).
Create Robust Links from within Zotero
A Python utility for publishing a social media story built from archived web pages to multiple services.
Create "perfect" snapshots of web pages
Add-On for Google Sheets to help those working with web archives.
Various examples of notebooks for working with web archives with the Archives Unleashed Toolkit, and derivatives generated by the Archives Unleashed Toolkit.
Add a description, image, and links to the web-archives topic page so that developers can more easily learn about it.
To associate your repository with the web-archives topic, visit your repo's landing page and select "manage topics."