Skip to content

dhamaniasad/PhantomWARC

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PhantomWARC

Generate WARC files from dynamic webpages


Installation

Download archive from latest release and unpack:

https://github.com/dhamaniasad/PhantomWARC/releases/latest

pip install -r requirements.txt
Usage
import phantomwarc

phantomwarc.init_browser("http://www.google.com", ia=True)
Internet Archive Uploading

Get your Internet Archive S3-Like API Keys from: https://archive.org/account/s3.php Set the Access Key and Secret Key as environment variables like so:

export IAS3_ACCESS_KEY=''
export IAS3_SECRET_KEY=''